.. _2017_concept_model_vivarium_conic_lsff:
..
  Section title decorators for this document:

  ==============
  Document Title
  ==============

  Section Level 1
  ---------------

  Section Level 2
  +++++++++++++++

  Section Level 3
  ~~~~~~~~~~~~~~~

  Section Level 4
  ^^^^^^^^^^^^^^^

  Section Level 5
  '''''''''''''''

  The depth of each section level is determined by the order in which each
  decorator is encountered below. If you need an even deeper section level, just
  choose a new decorator symbol from the list here:
  https://docutils.sourceforge.io/docs/ref/rst/restructuredtext.html#sections
  And then add it to the list of decorators above.

=============================================
Vivarium CoNIC Large Scale Food Fortification
=============================================

.. contents::
  :local:

Model Overview
--------------

Objective
+++++++++

Intervention Definitions
++++++++++++++++++++++++

Questions of Interest
+++++++++++++++++++++

Scope of Modeling
+++++++++++++++++


Concept Model Diagram
---------------------

.. image:: lsff_concept_model_diagram.svg

Model Versions
--------------

.. list-table::
   :header-rows: 1

   * - Version
     - Description
     - Done
   * - v1.0_disease_only
     - Model with mortality, morbidity, diarrhea, LRI, measles, and
       neural tube defects.
     - Yes
   * - v2.0_lbwsg
     - Model v1.0 plus the low birth weight and short gestation risk
       factor.
     - Yes
   * - v3.0_paf_of_one
     - Model v2.0 plus vitamin a deficiency and iron deficiency.
     - Yes
   * - v4.0_folic_acid_fortification
     - Model v3.0 plus folic acid fortification baseline and scale up.
     - Yes
   * - v5.0_vitamin_a_fortification
     - Model v4.0 plus vitamin a fortification baseline and scale up.
     - In progress
   * - v6.0_iron_fortification
     - Model v5.0 plus iron fortification baseline and scale up.
     - Not started
   * - v7.0_scenarios
     - Full model with final stratification and all scenarios.
     - Not started.

Model results can be located at
``/share/costeffectiveness/results/vivarium_conic_lsff`` in the
appropriate model version directory.


Model Components
----------------

Time
++++

* Start and end date: **January 1, 2020 -- December 31, 2024**
* Simulation time step: **1 day** to capture short timeframe of diarrheal
  diseases and neonatal causes

Demographics
++++++++++++

* Locations: **Nigeria, India, Ethiopia**
* Population: **Prospective open cohort of 0-5 year-olds**
* Size of largest starting population: **100,000 simulants**
* Youngest start-age and oldest end-age: **0 -- 5 years**
* Exit age (at what age to stop tracking simulants): **5 years**
* Fertility: **Crude birth rate**

Output Stratification
+++++++++++++++++++++

See the :ref:`Output Stratification Groups Table <stratification_groups_table>`
and the :ref:`Raw Outputs Table <raw_outputs_table>`.

Scenarios
+++++++++

Simulate five scenarios (each nutrient separately, plus iron and
folic acid together, plus baseline):

#. Baseline (using data on existing coverage of fortification)
#. Vitamin A scale-up
#. Iron scale-up
#. Folic Acid scale-up
#. Iron + Folic Acid scale-up

GBD Causes
++++++++++

* :ref:`Measles <2017_cause_measles>`

* :ref:`Diarrheal Diseases <2017_cause_diarrhea>`

* :ref:`Lower Respiratory Infections <2017_cause_lower_respiratory_infections>`

* :ref:`Neural Tube Defects <2017_cause_neural_tube_defects>`


GBD Risks
+++++++++

* :ref:`Low Birth Weight and Short Gestation (LBWSG) <2017_risk_lbwsg>`

PAF-of-1 Cause/Risk Pairs
+++++++++++++++++++++++++

* :ref:`Vitamin A Deficiency / Vitamin A Deficiency <2017_cause_vitamin_a_deficiency>`

* :ref:`Dietary Iron Deficiency / Iron Deficiency <2017_cause_iron_deficiency>`

Risk-Outcome Relationships
++++++++++++++++++++++++++

Interventions
+++++++++++++

* :ref:`Vitamin A Fortification <vitamin_a_intervention_section>`

* :ref:`Iron Fortification <iron_intervention_section>`

* :ref:`Folic Acid Fortification <folic_acid_intervention_section>`

Stratifying Exposure Variables by Baseline Intervention Coverage
----------------------------------------------------------------

.. todo::

  I think most of the content of this seciton should eventually be pulled out
  into a separate page (somewhere in the :ref:`model_design` section) that can
  be referred to from different concept models.

From GBD we obtain overall population estimates for our outcomes of interest
(prevalence of vitamin A deficiency, birth prevalence of neural tube defects,
distribution of haemoglobin levels, and distribution of birth weights). Because
there exists baseline coverage of our interventions --- i.e. there is already
some food fortification occurring in the locations we are simulating --- these
overall population estimates include both people who are eating fortified food
and those who are not.

When we introduce additional food fortification into the population, only those
who are *not* already eating fortified food should receive the benefit.
Therefore, in order to properly measure the effects of our interventions, we
must first stratify our population into those who are already covered vs. not
covered by the forticant of interest. We then need to calculate the relevant
exposure distribution in each coverage stratum to get a complete picture of the
baseline population. For example, we will need to know the birth prevalence of
neural tube defects or the distribution of haemoglobin levels in the
subpopulation already eating fortified food as well as in the subpopulation not
eating fortified food.

These coverage-stratified exposure variables are calculated so that:

1.  The population-weighted average of the exposure distributions matches the
    overall population estimate from GBD (at least approximately), and

2.  The differential exposure between the subpopulations matches the
    corresponding effect size for our intervention (perhaps under some
    simplifying assumptions).

For example, the ratio of the birth prevalence of neural tube defects between
the fortified and unfortified groups should match the risk ratio of neural tube
defects from folic acid fortification found in the literature, and the
difference in mean haemoglobin level between the fortified and unfortified
groups should match the mean shift in haemoglobin from iron fortification found
in the literature.

When we stratify the population in this way, we are assuming that the proportion
of our outcome attributable to the absence of our intervention is the same in
our model population as it is in our study population. (A necessary condition
for this, and indeed for modeling the intervention at all, is that the effect
sizes found in the literature are generalizable to the populations we are
simulating.) For something like vitamin A deficiency, the outcome is pretty much
all attributable to vitamin A intake. For something like haemoglobin on the
other hand, the outcome is not necessarily all attributable to iron intake
(because of non-iron-responsive anemias) --- in this case we will need to do
some extra work (essentially, additional stratification) to estimate what effect
iron fortification will have on haemoglobin levels. The above assumption is
valid for situations in which there are not many factors that cause the outcome
other than the exposure at hand. However, it should be more carefully evaluated
when there are multiple factors that can cause the outcome (for example, many
things can cause low birth weight).

.. todo::

  Can we dig a little deeper into what exactly the above assumption means, and
  when it's valid vs. when it's not? How is it different from assuming the
  effect size is generalizable to our population? And in our particular case
  (LSFF simulation), what does it mean to "more carefully evaluate" the low
  birth weight example above? Is our current strategy for stratifying birth
  weight on fortification coverage appropriate, or do we need to account for
  other factors that can affect birth weight? And can we explain exactly why we
  think our approach for accounting for non-iron-responsive anemias when
  stratifying the hemoglobin distribution is valid (e.g. I'd like a coherent
  mathematical framework for how we are handling anemia, which could be
  generalized to other situations)?

  Here are some relevant comments from Ali and Nathaniel in `PR 211
  <https://github.com/ihmeuw/vivarium_research/pull/211>`_:

    AB: We generally get around this issue (i.e. assuming that the proportion of
    our outcome attributable to our exposure is the same in our model population
    as it is in our study population) in our vivarium simulations by directly
    using the PAF for a given exposure and outcome. However, we don't use the
    PAF in this approach, so that is why it requires stronger assumptions (than
    just assuming that the effect size is generalizable to our population).

    NBS: When you say we don't use the PAF in this approach, are you referring
    specifically to continuous outcomes like hemoglobin or birth weight? Because
    I thought our approach for dichotomous outcomes like VAD was essentially
    equivalent to computing the PAF. Is that accurate, or am I missing
    something?

    AB: Ah... yes, the point I was trying to make was that we're not using GBD's
    PAF. GBD's PAFs are calculated after going through a process that vets the
    RRs and adjusts for potential confounders etc. in MRBRT and assesses
    generalizability so we are relying on this analysis and don't have to make
    the assumption on our own; we just have to assume that GBD is correct : ).

  If our approach for handling dichotomous (or polytomous?) outcomes is
  equivalent to computing a PAF (or PIF = population impact fraction), should we
  be computing a PAF/PIF in the continuous case as well? What would this
  approach look like for hemoglobin or birth weight?

.. todo::

  Come up with some examples to show what would happen if we *didn't* first
  stratify by existing coverage. I.e., why is this baseline stratification
  important? It is not totally obvious to me what the effect would be if we
  didn't do it (e.g. would we end up overestimating or underestimating the true
  effect?) -- I think the net effect depends on the level of existing coverage,
  the shape of the underlying exposure distribution, and the magnitude of the
  effect size.

  Here are some examples from Ali in `PR 211
  <https://github.com/ihmeuw/vivarium_research/pull/211>`_ that can be edited to
  fit in here somewhere (Nicole also says she has an example):

    I think that the most obvious example of this is for birth weight. The
    additional 10 grams of birth weight from an intervention will be much more
    valuable to you if you are born in the 0-500 gram category than the 2500 to
    2750 category.

    So, if we DO calibrate to baseline coverage, the uncovered group will have a
    lower baseline mean birth weight (say 2500) than the covered group (say
    2750). The difference in RR between the 2500 g group and the next birth
    weight group >> the difference in RR between the 2750 group and the next
    birth weight group. Essentially, we are helping those who really need it!

    If we do NOT calibrate for baseline coverage, both groups have the same
    starting mean birth weight. In this case, the uncovered group will be
    starting slightly "better off" and therefore have less to gain from the
    intervention (we might underestimate the effect of the intervention).

    A similar situation is true for dichotomous exposures like VAD. If overall
    population prevalence of VAD is 20% (baseline covered VAD prevalence = 10%,
    baseline uncovered = 30%) and the RR for vitamin A fortification is 0.5.
    Also let's say that baseline coverage is 50%.

    So if we DO account for baseline coverage differences, we will reduce VAD by
    50% in the baseline uncovered group... 30% to 15% so that 15% of the
    population remits.

    If we DO NOT account for baseline coverage differences, we will still reduce
    VAD by 50% in the baseline uncovered group, but there will be less VAD there
    to reduce so that VAD prevalence changes from 20% to 10% so that 10% of the
    population remits.

    So in general, we will underestimate the effect of our intervention if we do
    not do this, although it will depend on the overall dynamics of the risk
    factor and the corresponding relative risks.

The details of stratifying the baseline population depend on the form of the
exposure distribution, in particular whether it is categorical or continuous.
The simplest case is for a dichotomous outcome (e.g. vitamin A deficiency or
neural tube defects). This situation has been dealt with in a number of Vivarium
models under the title "Coverage Gap Framework"; it uses the same approach GBD
uses for risk factors to compute a PAF for the "risk" of not having the
intervention we are simulating. The strategy for continuous exposure
distributions (e.g. haemoglobin or birth weight) is less well-developed -- the
current approach only approximates a true stratification of the continuous
distribution, and it has mathematical limitations that may be problematic in
certain situations. We discuss the dichotomous and continuous cases in separate
sections below.

Baseline Coverage Stratification -- Dichotomous Variables (Coverage Gap Framework)
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++

This method applies to exposures with dichotomous measures such as Vitamin A deficiency or neural tube defects:

We always define the exposure as deleterious to match GBD 2017 definitions for risk exposures, so relative risks are always >1.

:math:`C_{vita_{baseline}}`: coverage of vitamin A fortified food in the population from the literature that is applied to our sim population at baseline

:math:`P_{exposure_{baseline}}`: 1-:math:`C_{vita_{baseline}}` prevalence of exposure to unfortified foods in our sim population at baseline

:math:`ϴ_{1}`: risk of vitamin A deficiency among those exposed to unfortified foods (bad food) in our sim population

:math:`ϴ_{0}`: risk of vitamin A deficiency among those unexposed to unfortified foods (eats fortified foods) in our sim population

:math:`ϴ_{GBD}`: risk of vitamin A deficiency in GBD population for age, sex, location, year

RR= reciprocal of a <1 effect size (risk ratios of prevalence) = :math:`\frac{1}{\text{0.45(95%CI: 0.19-1.05)}}`

RR= :math:`\frac{ϴ_{1}}{ϴ_{0}}` (we assume this to also be true in our sim population)

PAF= :math:`\frac{P_{exposure_{baseline}}(RR-1)}{1+P_{exposure_{baseline}}(RR-1)}`

1-PAF= :math:`\frac{1}{1+P_{exposure_{baseline}}(RR-1)}`

**Important assumptions and limitations:** This equation for PAF is valid under
the assumption of no confounding. An alternate equation for PAF should be used
when to get an unbiased PAF in the presence of confounding; however, we will
need the attributable fraction in the exposed which we do not readily have.
Hence this is a limitation. The RRs we use, and the exposure % we use are
approximating the PAFs. We make the assumption that the RRs pulled from
literature is generalizable.

.. todo::

   Reference Darrow and Steenland. Confounding and bias in the attributable
   fraction. *Epidemiology*. 2011 Jan;22(1):53-8. DOI:
   https://doi.org/10.1097/EDE.0b013e3181fce49b.


risk in (1-:math:`C_{vita}`), exposed group: :math:`ϴ_{1}= ϴ_{GBD}*(1-PAF)*RR` … equation 1

risk in (:math:`C_{vita}`), unexposed group: :math:`ϴ_{0}= ϴ_{GBD}*(1-PAF)` … equation 2

**How to apply the intervention**: the intervention increases the population
coverage of vitamin A fortified food, this value --> :math:`C_{vita}`, and
shifts the amount of people who receive equation 1 to equation 2.

Baseline Coverage Stratification -- Continuous Variables
++++++++++++++++++++++++++++++++++++++++++++++++++++++++

This method applies to continuous exposure variables such as hemoglobin or birth
weight.

The steps for performing baseline stratification for continuous variables are
outlined in the section `Baseline Calibration -- Hemoglobin and Birth Weight`_
below.

.. todo::

  Add more details to this section. In particular, make it clear that our
  current strategy is to assume that the two stratified distributions (e.g. the
  distributions of hemoglobin levels in the covered and uncovered
  subpopulations) each have the **same shape** as the overall population
  distribution from GBD. The steps outlined in the `Baseline Calibration --
  Hemoglobin and Birth Weight`_ section explain how to get the **means** of
  these distributions, which is sufficient to specify the entire distribution
  under the "same shape" assumption. Another missing piece is to specify what to
  do if shifting the distribution to the left causes nonsensical negative values
  -- one simple strategy is to just resample until you get something positive.
  This will make the mean slightly larger than it should be, but so will any
  strategy we pick.

Model Randomness
----------------

Random samples drawn from distributions of **intervention effect sizes**
should be identical across model locations for each draw.

	For example if there is an effect size of 1.2 (95% CI: 0.9, 1.5) and a
	series of effect sizes drawn for a series of 5 input draws in the Ethiopia
	model is [0.99, 1.15, 1.24, 1.12, 1.27], then this same series of effect
	sizes should be drawn for the India and Nigeria models as well.

This is to ensure that differences in intervention impact across model
locations are attributable to disease burden in each model location rather
than randomness in sampling from the effect size distribution.

Intervention Descriptions
-------------------------

.. _vitamin_a_intervention_section:

Vitamin A Fortification
+++++++++++++++++++++++

Research Considerations
~~~~~~~~~~~~~~~~~~~~~~~

In this model, the vitamin A fortification intervention affects the
**prevalence of vitamin A deficiency**. The effect size for this intervention
was obtained from a Cochrane review performed by [Hombali-et-al-2019]_ on the
fortification of staple foods with vitamin A for vitamin A deficiency.
Notably, the relative risk for vitamin A foritification on vitamin A
deficiency from this review only included data from two randomized controlled
trials and the authors of the review assessed the certainty of the evidence to
be "very low" [Hombali-et-al-2019]_. The relative risk of vitamin A deficiency
prevalence among an intervention population exposed to vitamin A fortified
staple foods relative to a control population given the same staple foods not
fortified with vitamin A from this review was **0.45 (95% CI: 0.19 - 1.05)**.

Therefore, we conducted a supplementary analysis of the effect of the
intervention by pooling the RCT studies from the Cochrane review with studies
included in the systematic review and meta-analysis performed by
[Keats-et-al-2019]_. Notably, none of the studies identified from the
[Keats-et-al-2019]_ review *directly* reported measures of relative risk of
vitamin A deficiency prevalence among the population exposed to vitamin A
fortification relative to the population unexposed to vitamin A fortification.
Therefore, we manually calculated this value based on data reported in study
tables and figures, which required visual approximations of certain values.
Notably, when this supplementary meta-analysis was performed, the resulting
relative risk was calcualted as **0.43 (95% CI: 0.28 - 0.65)**. However, when
limited to sugar and oil vehicles for the vitamin A forticant, the relative
risk was **0.36 (95% CI: 0.26 - 0.50)**. These two supplementary meta-analyses
are represented in the forest plots below.

.. image:: vitamin_a_meta.png

.. image:: vitamin_a_meta_sugar_oil.png

.. note::

  There is a spreadsheet of study summaries of the individual studies as well as PDFs of each study included in these meta-analyses hosted on the simulation science research team google drive

While the supplementary meta-analysis shown above contains more studies and
data than the [Hombali-et-al-2019]_ Cochrane review, it relies on results that were not directly
reported in the individual studies (and in some cases visaully estimated
values). **Therefore, we will conservatively use the results from the Cochrane
review, with increased certainty in the results based on the confirmatory
results from the supplementary meta-analysis.**

Notably, all of these studies included in the supplementary analysis were
conducted among children. Additionally, the study locations included
Guatemala, South Africa, Nicaragua, Indonesia, and the Phillipines. Therefore,
we concluded that it is **reasonable to assume generalizability of these
results to our model populations.**

Regarding effect sizes in young age groups, [Sandjaja-et-al-2015]_ reported
that population vitamin A fortification improved serum retinol concentrations
among infants aged 6-11 months. Therefore, **we assumed that the effect size
from the Cochrane review applies to all age groups above six months of age.**

	Notably, the effect can occur either through the direct consumption of vitamin
	A fortified foods or through the consumption of breastmilk from mothers who
	consume vitamin A fortified foods [Sandjaja-et-al-2015]_;
	[WHO-Vitamin-A-Supplementation-Guidelines]_.

For individuals aged between 0 and six months, we made the following
assumptions:

  1.  Maternal consumption of vitamin A fortified foods has no effect on
      infant vitamin A deficiency birth prevalence. This assumption is
      supported by [Dror-and-Allen-2018]_.

  2.  Maternal consumption of vitamin A fortified foods has no effect on
      infant vitamin A deficiency from 0 to six months of age. This assumption
      is largely supported by the vitamin A *supplementation* literature among
      these age groups [Imdad-et-al-2016]_ and is reflected in the
      [WHO-Vitamin-A-Supplementation-Guidelines]_.

.. note::

  PAF = 0 for all causes except for vitamin A deficiency paf-of-one cause for
  vitamin a deficiency risk factor in the early neonatal and late neonatal age
  groups. Implication for this is that excluding these age groups will only
  potentially impact the YLDs accumulated during the person-time spent in
  these age groups.

Additionally, we made assumptions regarding the response time following the
onset of exposure to vitamin A fortification, including:

  1.  Individuals will exhibit a response in vitamin A deficiency to vitamin
      A fortification between appoximately 2 and 12 months after onset of
      exposure to vitamin A fortification. There was sparce data available for
      the response time to vitamin A fortification, so we used data on the
      response time to vitamin D (another fat-soluble vitamin) supplementation
      as a proxy. The literature larely indicated that response to vitamin D
      supplementation plateaus between 2 and 12 months ([Heaney-2008]_,
      [Vieth-1999]_, [Talaat-et-al-2016]_). We assumed that the distribution of
      response times follows a lognormal distribution with a median value of
      five months, a 0.025 percentile of 2 months, and a 0.975 percentile of 12
      months.

  2.  If an individual was covered by baseline coverage of vitamin A
      fotification, we assumed that the individual was covered (via breastmilk
      or direct consumption) for long enough to exhibit a response (at least 12
      months).

 .. todo::

 	Add more detail regarding the time to response.

Effect Size - Vitamin A
~~~~~~~~~~~~~~~~~~~~~~~

In our Vivarium simulation, the effect of exposure foods **not** fortified
with vitamin A on the prevalence of vitamin A deficiency realtive to those
exposed to vitamin A fortified foods will be represented as follows:

.. math::

  RR = \frac{P(\text{VAD prevalence} \mid \text{no fortification})}
  {P(\text{VAD prevalence} \mid \text{fortification})}
  \approx \frac{1}{0.45\: (0.19, 1.05)}
  \approx 2.22\: (0.95, 5.26).

.. note::

	We are modeling the reciprocal of the relative risk reported in the
	Cochrane review.

	Additionally, this effect size crosses the null, and therefore, in some
	draws it will cause increasing coverage of the intervention to *increase*
	vitamin A deficiency prevalence. This is a limitation caused by the low
	quality evidence regarding the relative risk of vitamin A fortification on
	vitamin A prevalence. However, on average, increasing coverage of vitamin
	A fortification will decrease VAD.

To model the uncertainty in this estimate, the above RR should be drawn from a
`lognormal <https://en.wikipedia.org/wiki/Log-normal_distribution>`_
distribution with median = 2.22, 2.5\ :superscript:`th`-percentile = 0.95, and
97.5\ :superscript:`th`-percentile = 5.26. This distbibution can be created
using `SciPy's lognorm function
<https://docs.scipy.org/doc/scipy/reference/generated/scipy.stats.lognorm.html>`_
as follows:

.. code-block:: Python

  from numpy import log
  from scipy.stats import norm, lognorm

  # median and 0.975-quantile of lognormal distribution for RR
  median = 2.22
  q_975 = 5.26

  # 0.975-quantile of standard normal distribution (=1.96, approximately)
  q_975_stdnorm = norm().ppf(0.975)

  mu = log(median) # mean of normal distribution for log(RR)
  sigma = (log(q_975) - mu) / q_975_stdnorm # std dev of normal distribution for log(RR)

  # Frozen lognormal distribution for RR, representing uncertainty in our effect size
  # (s is the shape parameter)
  rr_distribution = lognorm(s=sigma, scale=median)

.. note::

	The same draws from this distribution should be applied to each model
	location as described in the `Model Randomness`_ section

.. note::

	I copied this from Nathaniel's documentation for the folic acid RR, but I
	think that the same approach is appropriate. Perhaps we can eventually
	create a separate page that lists similar strategies that we can reference
	via links.

Time to Response
~~~~~~~~~~~~~~~~

Further, the time-to-response to vitamin A fortification in years should also
be sampled such that:

.. code-block:: Python

	from numpy import log
	from scipy.stats import norm, lognorm

	# median and 0.975-quantile of lognormal distribution for RR
	median = 5/12
	q_975 = 12/12

	# 0.975-quantile of standard normal distribution (=1.96, approximately)
	q_975_stdnorm = norm().ppf(0.975)

	mu = log(median) # mean of normal distribution for log(RR)
	sigma = (log(q_975) - mu) / q_975_stdnorm # std dev of normal distribution for log(RR)

	# Frozen lognormal distribution for RR, representing uncertainty in our effect size
	# (s is the shape parameter)
	response_time_distribution = lognorm(s=sigma, scale=median)

Population Coverage Data
~~~~~~~~~~~~~~~~~~~~~~~~

The coverage algorithm for vitamin A fortification should follow the same approach described
in this concept model document for folic acid fortification (see `Population Coverage Data - Iron and Folic Acid`_).

.. list-table:: Means and 95% CI's for existing population coverage of vitamin A fortification (% of total population)
  :widths: 5 5 5 5
  :header-rows: 1

  * - Location
    - :math:`a` = Eats fortified vehicle
    - :math:`b` = Eats fortifiable vehicle
    - :math:`c` = Eats vehicle
  * - Ethiopia
    - 1.0 (see below)
    - 44 (34, 54)
    - 55 (45, 65)
  * - India
    - 24.3 (21.1, 27.9)
    - 89.4 (87.0, 91.8)
    - 100 (100, 100)
  * - Nigeria (Kano)
    - 7.6 (5.9, 9.4)
    - 35.9 (32.7, 39.1)
    - 98.4 (97.6, 99.3)
  * - Nigeria (Lagos)
    - 7.2 (5.5, 8.9)
    - 22.7 (19.9, 25.5)
    - 98.6 (97.8, 99.3)

For all values other than :math:`a` for Ethiopia, use a Beta distribution
with mean equal to the central estimate, and variance equal to the variance of
a normal distribution with the same mean and 95% confidence interval.

For the :math:`a` value for Ethiopia, assume the following:

.. math::

  a_\textit{Ethiopia} \sim \operatorname{Beta}(0.1,9.9),\quad

The mean of this `Beta distribution
<https://en.wikipedia.org/wiki/Beta_distribution>`_ will have the value shown
in the table. The density has an asymptote at 0 and an x-intercept at
1. The mean value for this parameter was chosen so that the mean of
existing fortification coverage is close to 0 (in a similar approach to
existing coverage of folic acid in Ethiopia).

.. todo::

	Cite Ethiopia data source (Assessment of Feasibility and Potential
	Benefits of Food Fortification. Addis Ababa : Government of the
	Federal Democratic Republic of Ethiopia, 2011).

Effect of Intervention on Simulants
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

As described in the research considerations section, the
intervention effect is dependent on age and time since intervention coverage.

For simulants covered by *baseline coverage*, the effect of the
vitamin A fortitication is determined as follows:

.. code-block:: Python

  # draw level

  rr = rr_distribution.rvs()

  # individual level

  if age_i < 0.5:
  	rr_i = 1
  else:
  	rr_i = rr_distribution.rvs()

For simulants covered by the *intervention scale-up*, the effect of the
vitamin A fortitication is determined as follows:

.. code-block:: Python

  # draw level

  rr = rr_distribution.rvs()

  # individual level

  response_time_i = response_time_distribution.rvs()

  if age_i < 0.5 or random_number_i > coverage(t - response_time_i)):
  	rr_i = 1
  else:
  	rr_i = rr

Where,

	- **age_i** = age of simulant in years

	- **rr_i** = relative risk to be applied to an individual simulant

	- **rr_distribution** = distribution for the relative risk of the intervention, as described above

	- **response_time_distribution** = distribution for the response time to the intervention, as described above

	- **response_time_i** = response time to the intervention assigned to an individual simulant

	- **coverage(t - response_time_i)** = population coverage of vitamin A fortification X years prior to the current time-step, where X is response_time_i

	- **random_number_i** = independent random number between 0 and 1, assigned to a simulant

Therefore, the probability that a simulant has vitamin A deficiency should be evaluated as:

	vitamin_a_deficiency_prevalence * (1 - PAF) * rr_i

Where,

	- PAF is computed based on the RR for vitamin A fortification

	- rr_i is the relative risk assigned to the individual simulant

Summary of Vitamin A Intervention Algorithm
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

The pseudo-code used to implement the vitamin A intervention effect in Vivarium
is shown below. This summary was written by James and sent to Ali and Nathaniel
via Slack on March 19, 2020.


.. code-block:: Python

	# Definitions
	effectively_covered := individual is recieving fortification and it is affecting their
					probability of being vitamin a deficient.
	t                   := current time

	# Population level params
	rr                = 1 if under 6 months, else 2.22 (or whatever)
	coverage(t)       = baseline coverage of vitamin a fortification
	exposure(t)       = baseline exposure to lack of vitamin a fortification
	                  = 1 - coverage(t)
	delta_coverage(t) = intervention shift in baseline coverage of vitamin a fortification
	coverage*(t)      = coverage(t) + delta_coverage(t)
	exposure*(t)      = 1 - coverage*(t)
	time_lag          = time from when an individual starts eating fortified
				food until the effect is present
	mean_rr(t)        = rr * exposure(t) + 1 * (1 - exposure(t))
	paf(t)            = (mean_rr(t) - 1)/mean_rr(t)
	p_gbd(t)          = vitamin a deficiency exposure

	# Individual attributes
	qx_i           = propensity for an indidual to receive vitamin a fortification
	x_i(t)         = whether an individual is receiving vitamin a fortification
	               = qx_i > exposure*(t)
	               = qx_i < coverage*(t)
	tx_i           = time at which individual started receiving fortification
	               = argmin_t(x_i(t) == True)

	# Note this definition of eff_x_i depends on coverage*(t) weakly monotonically increasing
	eff_x_i(t)     = whether an individual is effectively covered
	               = (t - tx_i) > time_lag
	qp_i           = propensity for an individual to be vitamin a deficient
	rr(eff_x_i(t)) = 1 if eff_x_i(t) else rr
	p_i(t)         = probability individual is vitamin a deficient
	               = p_gbd(t) * (1 - paf(t)) * rr(eff_x_i(t))
	y_i(t)         = whether an individual is vitamin a deficient
	               = qp_i < p_i(t)

.. note::

	Perhaps need to make coverage inclusive such that: qx_i > exposure*(t) and qx_i >= coverage*(t) for future model runs

.. _iron_intervention_section:

Iron Fortification
++++++++++++++++++

Population Coverage Data and Coverage Algorithm - Iron
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

The baseline coverage for iron fortification is the same as the baseline
coverage for folic acid fortification, as described below_. Additionally, the
coverage algorithm should be implemented in the same way as  for folic acid
fortification, as described here_.

Notably, iron and folic acid coverage should be correlated as specified under
the following scenarios:

- **Baseline coverage**: baseline iron and folic acid fortification coverage should be perfectly correlated so that exactly the same simulants are covered by each forticant

- **Scale-up of iron fortification**: zero correlation of scaled-up *intervention* coverage (although baseline coverage remains perfectly correlated)

- **Scale-up of folic acid fortification**: zero correlation of scaled-up *intervention* coverage (although baseline coverage remains perfectly correlated)

- **Scale-up of iron + folic acid fortification**: both baseline and scaled-up intervention coverage of iron and folic acid fortification are perfectly correlated so that exactly the same simulants are covered by each forticant

.. note::

	A limitation of our population coverage data is that we assume no baseline
	iron fortification occurs in the absence of folic acid fortification and
	vice versa.

Effect Size - Iron
~~~~~~~~~~~~~~~~~~

Iron fortification of staple food affects two outcomes in our simulation
model. The first outcome is an individual's hemoglobin concentration following
the *direct* consumption of iron fortified foods. The second outcome is a
simulant's birth weight following the *maternal* consumption of iron fortified
foods.

Hemoglobin Level
^^^^^^^^^^^^^^^^

The effect of iron fortified food consumption on children under 7 years of age
was obtained from the [Keats-et-al-2019]_ systematic review. However, the
effect size in this review was reported in standardized mean differences
rather than in units of hemoglobin concentration directly. Therefore, we
performed a separate meta-analysis of the results included in the
[Keats-et-al-2019]_ review to obtain shifts in hemoglobin concentration. This
meta-analysis is shown below.

.. image:: iron_meta.png

Notably, the above changes in hemoglobin concentration are shown in grams per
*deciliter* (dL); however, GBD models hemoglobin concentration as grams per
*liter* (L). Therfore, the effect size used in our simulation model should be
**+3.0 (95% CI: -0.2, +6.1)** grams of hemoglobin per liter of blood. This
effect size should be interpreted as the population mean shift in hemoglobin
concentrations among children less than seven years old in LMICs following
iron fortification of staple foods.

.. note::

	The confidence interval for this effect size includes the null (0),
	indicating that for some individual simulation draws, the intervention
	effect may *decrease* population mean hemoglobin concentrations. However,
	*on average*, the effect of the intervention will increase population mean
	hemoglobin levels.

To model the uncertainty in the estimate, the above mean difference (MD)
should be drawn from a normal distribution with mean = 3.0,
2.5\ :superscript:`th`-percentile = -0.2, and
97.5\ :superscript:`th`-percentile = 6.1. This
distbibution can be created using `SciPy's norm function
<https://docs.scipy.org/doc/scipy/reference/generated/scipy.stats.norm.html>`_
as follows:

.. code-block:: Python

	from scipy.stats import norm

	# mean and 0.975-quantile of normal distribution for mean difference (MD)
	mean = 3
	q_975 = 6.1

	# 0.975-quantile of standard normal distribution (=1.96, approximately)
	q_975_stdnorm = norm().ppf(0.975)

	std = (q_975 - mean) / q_975_stdnorm # std dev of normal distribution

	# Frozen normal distribution for MD, representing uncertainty in our effect size
	hb_md_distribution = norm(mean, std)

.. note::

	The same draws from this distribution should be applied to each model
	location as described in the `Model Randomness`_ section

.. note::

	This distirbution is slightly shifted to the right (0.025 percentile is
	equal to -0.1, rather than -0.2) due to rounding approximations in the
	meta-analysis for the effect size causing a slightly non-symmetrical
	confidence interval around the mean.

Birth Weight
^^^^^^^^^^^^

The effect of maternal consumption of iron fortified food on infant birth
weight was obtained from [Haider-et-al-2013]_. According to this data
source, birth weight among babies born to mothers increased, on average, by
15.1 grams (95% CI: 6.0, 24.2) per 10 mg of daily iron consumption. The
distribution of the parameter should be modeled as follows:

.. code-block:: Python

	from scipy.stats import norm

	# mean and 0.975-quantile of normal distribution for mean difference (MD)
	mean = 15.1
	q_975 = 24.2

	# 0.975-quantile of standard normal distribution (=1.96, approximately)
	q_975_stdnorm = norm().ppf(0.975)

	std = (q_975 - mean) / q_975_stdnorm # std dev of normal distribution

	# Frozen normal distribution for MD, representing uncertainty in our effect size
	bw_md_distribution = norm(mean, std)

	# random sample from effect size distribution
	bw_md_per_10_mg_iron = bw_md_distribution.rvs()

.. note::

	The same draws from this distribution should be applied to each model
	location as described in the `Model Randomness`_ section

.. note::

	The [Haider-et-al-2013]_ paper did not report if the dose of iron was measured
	as elemental iron or as an iron compound such as NaFeEDTA  (sodium ferric
	ethylenediaminetetraacetate). We operated under the  assumption that 10 mg of
	daily iron consumption, as referenced in [Haider-et-al-2013]_, represented 10
	mg of *elemental* iron.

Therefore, we investigated the amount of daily iron consumption that a
pregnant mother would likely consume through iron fortification of staple
foods in our locations of interest in order to scale the effect size
accordingly. See the table below:

.. list-table:: Daily Iron Consumption Parameters
  :widths: 5 5 5
  :header-rows: 1

  * - Location
    - | Concentration of forticant in flour
      | (X)
    - | Amount of fortifiable flour consumed daily
      | (Y)
  * - Ethiopia
    - 30 mg iron as NaFeEDTA / kg flour  [1]; use point value
    - 100 g (IQR: 77.5, 200) [2]; sample from distribution described below
  * - India
    - 14 to 21.25 mg iron as NaFeEDTA / kg flour [1]; sample from uniform distribution with range: 14 - 21.25
    - 100 g (IQR: 77.5, 200) [2*]; sample from distribution described below
  * - Nigeria
    - 40 mg iron as NaFeEDTA / kg flour [1]; use point value
    - 100 g (IQR:77.5, 200) [2*]; sample from distribution described below

.. note::

  While there was data for the concentration of iron forticants other
  than NaFeEDTA for the locations in this table, we conservatively chose
  to use the concentrations of the NaFeEDTA forticant. This is because the
  concentration of elemental iron in flour is lower when the NaFeEDTA
  forticant is used compared to other forticants.

  We assumed that the concentrations listed in the
  [Global-Fortification-Data-Exchange]_ represented miligrams of
  *elemental iron as NaFeEDTA* per kilogram of flour rather than miligrams of
  NaFeEDTA per kilogram of flour. While this was not explicitly stated on the
  [Global-Fortification-Data-Exchange]_, the available literature suggests
  that this is the measurement convention and is consistent with typical doses
  of iron via fortification using NaFeEDTA [Hurrell-et-al-2010]_.

[1] [Global-Fortification-Data-Exchange]_

[2] [Ethiopian-National-Food-Consumption-Survey]_

*In the absence of better data, we assumed that individuals in Nigeria and
India consumed the same amount of fortifiable flour per day as individuals in
Ethiopia.*

.. todo::

	Find better data and replace values for flour consumption per day for
	Nigeria and India.

The amount Z of elemental iron consumed daily, in miligrams per person, (at the
draw level) should be calculated as such (where X is mg iron as NaFeEDTA per kg
of flour and Y is grams of flour consumed daily per person, as defined in the
table above):

.. math::
  \frac{\text{Z mg iron daily}}{\text{person}} =
  \frac{\text{X mg iron as NaFeEDTA}}{\text{kg flour}} \cdot \frac{\text{Y g flour daily}}{\text{person}} \cdot \frac{\text{1 kg flour}}{\text{1,000 g flour}}

First, we must sample values from the distribution of the amount of flour
eaten per day *at the individual simulant level*. We chose to sample at the
individual simulant level rather than the draw level because the underlying
data from the [Ethiopian-National-Food-Consumption-Survey]_
represented variation in individual consumption rather than uncertainty around
a population mean value. The survey reported that the median daily consumption
was 100 g (IQR: 77.5, 200). We assumed that the minumum and maximum values in
the distribution (which were not directly reported) were zero and 350.5 grams
(75th percentile for the region with the highest value for this parameter).
Additionally, we assumed that the values followed a uniform distribution
within each quartile of consumption. Values should be sampled as described
below:

.. code-block:: Python

  import numpy as np

  q0 = 0
  q1 = 77.5
  q2 = 100
  q3 = 200
  q4 = 350.5

  random_number_i = np.random.uniform(0,1)

  if random_number_i =< 0.25:
    daily_flour_consumption_i = np.random.uniform(q0,q1)
  elif random_number_i =< 0.5:
    daily_flour_consumption_i = np.random.uniform(q1,q2)
  elif random_number_i =< 0.75:
    daily_flour_consumption_i = np.random.uniform(q2,q3)
  else:
    daily_flour_consumption_i = np.random.uniform(q3,q4)

Next, the amount of elemental iron ingested by an individual simulant should
be calculated as follows:

.. code-block:: Python

  # daily_flour_consumption_i : value from above code block
  # iron_concentration_in_flour : location-specific value from table above

  elemental_iron_consumed_per_day_i = (daily_flour_consumption_i
      * iron_concentration_in_flour / 1_000)

Then, the specific effect size of iron fortification on low birth weight can
be calculated as follows (where Z is the location-specific amount of elemental
iron in miligrams, as calculated from the equation above):

.. math::

  \text{BW increase} =
	\text{Z mg iron consumed daily} \cdot \frac{\text{15.1 g (95% CI 6.0 - 24.2) BW increase}}{\text{10 mg iron consumed daily}}

.. code-block:: Python

	# bw_md_per_10_mg_iron : defined above (random sample from effect size distribution)
	# elemental_iron_consumed_per_day_i : defined above ( iron consumed per day in mg)

	bw_shift_i = elemental_iron_consumed_per_day_i * bw_md_per_10_mg_iron / 10

See the following section to see if/how to apply the *bw_shift_i* parameter to
individual simulants.

Determining Whether A Simulant is Affected - Iron
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Hemoglobin Level
^^^^^^^^^^^^^^^^

For the purposes of our simulation, we made a few assumptions:

1.  We assumed that simulants only received an effect from iron fortification
    on their *hemoglobin levels* if they directly consumed iron fortified foods;
    they received no effect of iron fortification on their hemoglobin levels
    from maternal consumption of iron fortified foods in utero or through
    breastmilk (although maternal consumption of iron fortified foods does
    affect infant birth weight). As discussed in a review by
    [Siddappa-et-al-2007]_, there may be an association between maternal iron
    deficiency and infant iron levels at birth (measured as serum ferritin).
    However, a significant relationship between maternal iron deficiency and
    infant *hemoglobin* at birth has not been consistently demonstrated
    [Allen-2002]_, [Altinkaynak-et-al-1994]_, [Emamghorashi-and-Heidari-2004]_,
    [Erdem-et-al-2002]_, [Turkay-et-al-1995]_. Notably, there is some evidence
    of a correlation between maternal hemoglobin and infant hemoglobin in the
    first months of life [Marques-et-al-2016]_, [De-Pee-et-al-2002]_,
    [Kilbride-et-al-1999]_; however, the amount and quality of evidence is
    limited [Allen-2002]_ and the scope of our model additionally limited our
    ability to consider this correlation.

2.  We assumed that simulants begin to eat staple foods as a supplement to
    breast milk consumption at the age of six months and that the quantity of
    staple foods consumed as a proportion of total consumption increases
    linearly from six months to two years of age, at which point it reaches its
    peak and then remains constant. We based this assumption on the WHO
    guideline to a) begin to introduce complementary foods as a supplement to
    breastfeeding at six months of age (p. 10), b) continue breastfeeding until
    2 years or age or beyond (p.12), and c) to start feeding "small amounts" of
    complementary food at six months and to "increase the quantity as the child
    gets older" (p. 18) [WHO-Guidelines-For-Complementary-Feeding]_.
    Additionally, data reported by [Diana-et-al-2016]_ indicated that
    complementary feeding began at six months and increased in quantity at nine
    and twelve months among a study population in Indonesia.

.. note::

  Due to the absence of more detailed data, we did not evaluate the specific
  intake of *staple* foods among these age groups, but rather made the
  assumption that the amount of staple foods consumed was proportional to the
  amount of total complementary foods consumed as a supplement to breastmilk.

3.  We assumed that the effect of consumption of iron fortified foods on an
    individual's hemoglobin level is proportional to the amount of staple foods
    consumed as a proportion of total consumption. This assumption was made in
    the absence of supporting data.

4.  We assumed that the full effect of the iron fortification
    intervention takes six months to achieve and that the effect scales up in a
    linear fashion between the onset of exposure and six months post exposure.
    This is likely a conservative assumption, as there is evidence
    [Andersson-et-al-2010]_ that the true curve increases more steeply at first,
    then levels off before reaching the full effect at six months. Thus, the
    measured response curve is concave down, and our linear approximation lies
    entirely below this curve -- it is the secant line to the curve, with slope
    equal to the average rate of increase over the 6 month interval. Notably,
    the evidence from [Andersson-et-al-2010]_ is limited in its application to
    our model in that it evaluated iron fortification of margarine in adult
    women, not iron fortification of flour among children.

5.  We assumed that all individuals covered by baseline coverage of iron
    fortification have been covered for at least six months, and therefore
    either

    a) They have already achieved the full age-dependent effect of the
    intervention *if* they have been eating food for at least 6 months; that is,
    they receive the full effect if they are at least one year old, or

    b) If they are less than one year old, they have started or will start
    eating fortified food as soon as they turn(ed) 6 months old, hence will
    achieve the full age-dependent effect at exactly one year old.

To summarize, we are trying to account for two factors that will decrease the
reported effect size of iron fortification on hemoglobin in our simulants:

1.  (**Age effect**) Children don't start eating (fortified) solid food until 6
    months of age, at which point the amount of (fortified) food slowly
    increases to a maximum at 2 years of age, then remains constant.

2.  (**Time-lag effect**) When someone starts eating iron-fortified food, the
    measured effect on hemoglobin increases from 0 to the maximum effect size
    over a period of 6 months.

To model these two effects mathematically, we are making the following
simplifying assumptions:

1.  The above two factors each reduce the reported Hb effect size by a multiplicative factor independent of the other factor.

2.  Each multiplicative factor is a continuous piecewise linear function of time.

We will model the age effect with a continuous piecewise linear function
``hb_age_fraction(age)``, where ``age`` is the current age of the simulant, and
we will model the time-lag effect with a continuous piecewise linear function
``hb_lag_fraction(time_since_fortified)``, where ``time_since_fortified`` is the
time since the simulant started eating iron-fortified food:

.. code-block:: Python

  def hb_age_fraction(age):
  """
  Multiplier on Hb effect size due to children eating less food at younger
  ages. (`age` is current age in years).
  """
    return 0 if age<0.5 else (age-0.5)/1.5 if age<2 else 1

  def hb_lag_fraction(time_since_fortified):
  """
  Multiplier on Hb effect size due to lag in response time to iron fortification.
  The argument `time_since_fortified` is the time (in years) since a simulant
  started eating fortified food (note that a negative value of `time_since_fortified`
  indicates the child has not yet started eating fortified food).
  """
    return (0                         if time_since_fortified < 0   else
            time_since_fortified/0.5  if time_since_fortified < 0.5 else
            1)

.. Note::

  Since children don't start eating fortified food until 6 months of age, the
  argument ``time_since_fortified`` for the time-lag function will be *either*
  the time since the child's household got fortification coverage, *or* the time
  since the child turned 6 months old, *whichever is smaller*.

.. Note::

  When calibrating hemoglobin levels for baseline coverage (see the
  :ref:`baseline calibration <baseline_calibration_hb_bw_section>` section
  below), we will use **only** the age effect to compute the treatment-deleted
  hemoglobin levels of the population (the down-shift), whereas we will use
  **both** the age effect *and* the time-lag effect when applying the treatment
  to an individual simulant (the up-shift). This will cause our population's
  overall mean hemoglobin level to be slightly lower than the value in GBD, but
  not by much, and this is the simplest strategy that seems reasonable.

.. Note::

  When calibrating hemoglobin levels for baseline coverage (see the
  :ref:`baseline calibration <baseline_calibration_hb_bw_section>` section
  below), it is *possible* (but highly unlikely with the shift size we're
  dealing with) that a simulant's hemoglobin level can end up *negative* after
  the down-shift. To correct for this possibility, if any shifted hemoblobin
  levels end up below 1 g/L, reset them to 1 g/L.


.. Note::

  As described in the :ref:`Dietary Iron Deficiency / Iron Deficiency
  Documentation <2017_cause_iron_deficiency>`, the hemoglobin shift from iron
  fortification should be applied **only** to simulants who are *iron
  responsive*. This applies both to the baseline calibration (down-shift) and
  the application of the effect size to simulants with fortification coverage
  (up-shift).

.. Note::

  When determining whether a simulant gets covered by our iron fortification
  program, the child's propensity for coverage (for the hemoglobin effect)
  should be **the same** as their mother's propensity for coverage (for the
  birthweight effect). In other words, when comparing the random propensity
  against the iron fortification :ref:`coverage function
  <coverage_algorithm_section>`, the same random number should be used to
  determine whether the child's mother ate fortified food during pregnancy as is
  used to determine whether the child eats fortified food. (In the future, we
  could consider using a positive correlation instead -- see :ref:`Model
  Limitations <limitations_section>`.)

Combining everything:

.. code-block:: Python

  import numpy as np

  # At the draw level

  # Mean difference in hemoglobin concentration due to iron fortification.
  # Note: hb_md_distribution is defined above in the effect size section.
  hb_shift = hb_md_distribution.rvs()

  # At the individual simulant level, at each time step (time = t)

  # Determine whether simulant's household is receiving iron-fortified food,
  # regardless of whether the simulant is iron-responsive
  household_covered_i(t)  = time_covered_i <= t
                          = p_i < coverage(t)

  # Effect on Hb:
  # Only adjust Hb level for iron-responsive individuals
  if iron_responsive_i:

    # Shift iron-responsive simulants' Hb levels down to calibrate for baseline coverage.
    # Take into account the age-dependent effect size but not the 6-month time lag.
    hb_i = hb_gbd(age_i(t)) - baseline_coverage(t) *  hb_age_fraction(age_i(t)) * hb_shift

    # Clip Hb value to be >=1 in case shifting made it negative
    hb_i = np.clip(hb_i, 1, np.inf)

    # Only increase simulant's Hb if their household is covered
    if household_covered_i(t):

      # Fortification starts either when the household receives coverage, or when
      # the child turns 6 months old and starts eating solids, whichever is later.
      # Recall that if the child is covered in baseline, then time_covered_i =
      # float('-inf'), so coverage always starts at age 6 months in this case.
      time_since_fortified_i(t) = min(t - time_covered_i, age_i(t) - 0.5)

      # Increase simulant's Hb level by the appropriate amount, taking into
      # account both the age-dependent effect and the 6-month time lag.
      hb_i += (hb_lag_fraction(time_since_fortified_i(t))
               * hb_age_fraction(age_i(t))
               * hb_shift
              )
.. todo::

  Edit the above code to define all variables, make it valid Python code, and
  comment appropriately.
  Here are the relevant definitions from the :ref:`summary <iron_intervention_summary>` below:

  .. code-block:: Python

    ## Definitions:
    t                         := current time in years (t=0 is start of simulation)
    time_covered_i            := time at which simulant's household first receives iron-fortified food
    household_covered_i(t)    := whether child's household is receiving iron-fortified food at time t
    time_since_fortified_i(t) := amount of time since child started eating iron-fortified food
    hb_i(t)                   := simulant's hemoglobin level after adjusting for fortification

    ## Population level parameters:
    hb_shift                = effect size on hemoglobin (normally disributed, mean 3.0 g/L)
    baseline_coverage(t)    = iron fortification coverage at time t in baseline scenario
      # Note: If delta_coverage is defined to be 0 in the baseline scenario, then
      # the below strategies should work in both baseline and intervention scenarios
    delta_coverage(t)       = change in coverage from baseline
    coverage(t)             = iron fortification coverage at time t
                            = baseline_coverage(t) + delta_coverage(t)

    ## Individual level attributes:
    p_i                     = propensity of simulant and mother for exposure to lack of iron fortification (~uniform(0,1))
    iron_responsive_i       = whether individual is responsive to iron fortification
    age_i(t)                = the simulant's age at time t
    hb_gbd(age_i(t))        = hemoglobin level drawn from GBD for simulant
      # This time is float('-inf') if simulant is covered in baseline
      # and float('inf') if simulant never gets covered
    time_covered_i          = argmin_t(p_i < coverage(t) == True)

See below for a visual representation:

.. image:: iron_effect_scale_up.svg

.. todo::

  Edit the above graph to reflect the modified definition of our effect size.
  Namely:

  - The blue line represents the effect size as a function of age, ``hb_shift *
    hb_age_fraction(age)``, without taking the time lag into account.

  - To get the graph for "coverage gained at birth" (or "coverage gained before
    6 months"), modify the blue graph by making it a quadratic function between
    0.5 years and 1 year, such that the graph has slope 0 at 6 months and
    remains continuous.

  - The graph for "coverage gained at 1 year" should be slightly curved
    (quadratic concave up) between 1 year and 1.5 years.

Birth Weight
^^^^^^^^^^^^

Our model will apply the effect size of maternal consumption of iron fortified
foods on infant birth weight under the following assumptions:

1. Maternal consumption of iron fortified foods must begin at week 20
(CI: 8 - 28) of gestation and continue for a minimum of seven weeks in
order to impact infant birth weight. This assumption is based on the studies
included in [Haider-et-al-2013]_ that evaluated birth weight as an outcome.
These studies initiated iron supplementation between approximately 8 and 28
weeks of gestation, with most studies initiating approximately at week 20 of
gestation. Additionally, [Haider-et-al-2013]_ found that supplementation
duration did not affect the relationship between maternal iron consumption
and birth weight after adjusting for dose; however, the minimum duration of
iron consumption in the studies included in [Haider-et-al-2013]_ was seven
weeks, so we used this as the lower bound of necessary duration for our model.

2. A simulant's birth weight will affect their all-cause mortality rate
during the early and late neonatal periods only. This assumption is a
product of assumptions made in the modeling of the low birth weight and
short gestation (LBWSG) risk factor in GBD (see :ref:`Low Birth Weight and Short Gestation (LBWSG) <2017_risk_lbwsg>`)

Therefore, for our simulation, an infant's mother must have gained coverage
to iron fortified foods at least **twenty weeks prior to the birth of the
infant** in order for the iron fortification coverage to affect the infant's
birth weight.

.. todo::

  The algorithm description below assumes each simulant was born at 40 weeks of
  gestation. This needs to be updated to to use the actual gestational ages of
  simulants.

  Also, the below algorithm has an incorrect description of how to initialize
  the variable `mother_ate_iron_fortified_food` at the beginning of the
  simulatiton. Namely, `mother_ate_iron_fortified_food` should be initialized to
  `Unknown` for **all** simulants at the beginning of the simulation because 1)
  we do not know the joint distribution of fortification status and birthweight
  for any age group other than at birth, and 2) the variable
  `mother_ate_iron_fortified_food` is not needed for anything except adjusting
  the birthweight of simulants when they are born.

  For the algorithm to take actual gestational ages into account and to
  correctly initialize the simulation, see the :ref:`iron intervention summary
  <iron_intervention_summary>`.

.. note::

  The following was adopted from Nathaniel's description for folic acid.

1.  **Define variables:** Each simulant needs an attribute
    `mother_ate_iron_fortified_food`, which will be `True` if the simulant's
    mother ate iron fortified food starting at least twenty weeks (0.384 years)
    before the simulant was born, `False` if not, and `Unknown` if we don't
    know.

2.  **Initialize the simulation:** At the start of the simulation for the early
    neonatal and late neonatal age groups (IDs 2 and 3), set
    `mother_ate_iron_fortified_food = True` for a proportion of the population
    equal to baseline coverage of iron fortification at simulation start,
    :math:`C(0)`. Set `mother_ate_iron_fortified_food = False` for a proportion
    of the population equal to 1 - baseline coverage of iron fortification at
    simulation start, :math:`1 - C(0)`. Set
    `mother_ate_iron_fortified_food = Unknown` for all simulants in the
    post-neonatal and 1-4 year age groups (age_group_ids 4 and 5);
    this attribute will not be used for these age groups because the
    LBWSG risk factor does not apply.

3.  **Initialize simulants born into the simulation:** For each simulant born
    at time :math:`t` (in years), the probability that the simulant's mother
    started eating iron-fortified food at nine months prior is equal to the
    population coverage nine months prior, :math:`C(t-0.384)`. Therefore, upon
    the simulant's birth, do the following:

  a.  Generate a uniform random number :math:`u\sim
      \operatorname{Uniform}([0,1])`.

  b.  If :math:`u<C(t-0.384)`, `mother_ate_iron_fortified_food = True`;
      else `mother_ate_iron_fortified_food = False`.

Note that we are assuming that once someone (in this case the baby's mother)
starts eating fortified food, they will continue to eat the fortified food
forever. Additionally, we assume that those covered by baseline coverage of
iron fortification have been covered for at least nine months + age of the
simulant (in the case of simulants initialized in the early neonatal age
groups).

Then, for simulants with the attribute `mother_ate_iron_fortified_food = True`,
the effect size of iron fortification on birth weight, calculated as described
above (`Effect Size - Iron`_), should be applied as an **additive shift** to
the simulant's birth weight (in grams). This may then impact their LBWSG risk
category.

Application of Effect to Simulants - Iron
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Hemoglobin Level
^^^^^^^^^^^^^^^^

The age- and time-dependent effect of iron fortification on a simulant's
hemoglobin level (as documented in the sections above) should be **additively**
applied to a simulant's hemoglobin level, as sampled as described in the
:ref:`Dietary Iron Deficiency / Iron Deficiency Documentation <2017_cause_iron_deficiency>`
given that the simulant's value for *iron_responsive_i* (as described in the
:ref:`Dietary Iron Deficiency / Iron Deficiency Documentation <2017_cause_iron_deficiency>`)
is equal to 1. The age- and time-dependent intervention effect should be
applied *each time* that a simulant's hemoglobin level is sampled. If a
simulant's value for *iron_responsive_i* = 0, the effect of the intervention
should **not** be applied to that simulant.

.. note::

  See Baseline Calibration section below for special considerations regarding baseline coverage

Birth Weight
^^^^^^^^^^^^

The time-dependent effect of iron fortification on a simulant's birth weight
(as documented in the sections above) should be **additively** applied to a
simulant's birth weight after it is sampled as a value *in grams* from the
continuous distribution of birth weight from the low birthweight short
gestation risk factor. The simulant's birthweight post-application of the
effect size will then be used to determine the simulant's LBWSG risk category.

.. note::

  See Baseline Calibration section below for special considerations regarding baseline coverage

.. todo:: Reference the LBWSG risk factor page when complete

.. _baseline_calibration_hb_bw_section:

Baseline Calibration -- Hemoglobin and Birth Weight
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

A simulant's individual hemoglobin/birthweight value *in the baseline scenario* should be assigned as follows:

.. math::

  π_i =	\hat{π}_{GBD}(i) - αp + αx_i

Where,

.. list-table:: Parameter Definitions
  :widths: 5 15
  :header-rows: 1

  * - Parameter
    - Definition
  * - :math:`p`
    - Proportion covered by iron fortification in the baseline scenario
  * - :math:`α`
    - Effect size (age-dependent, sampled from distribution as described above)
  * - :math:`π_{GBD}`
    - Mean birthweight or hemoglobin from GBD
  * - :math:`\hat{π}_{GBD}(i)`
    - Unadjusted individual simulant birthweight/hemoglobin sampled from a draw of GBD's exposure distribution
  * - :math:`x_{i}`
    - Whether an individual simulant is covered (1=yes, 0=no)
  * - :math:`π_{i}`
    - Draw of birthweight/hemoglobin distribution according to GBD

See the proofs for this approach below.

.. image:: baseline_calibration_proofs.PNG

.. todo::

  Update the above description to reflect the slightly more complicated
  situations we actually have for Hb and birthweight. Namely:

  * The birthweight effect size is not constant but varies between simulants.
    Therefore, when we down-shift to account for baseline coverage at the
    population level, we use the **average** effect size for the population,
    whereas when we up-shift individual birthweights based on coverage status,
    we use each simulant's individual effect size.

  * The effect size for Hb depends on the simulant's age and has a time lag
    during which the effect scales up from zero to the full age-dependent effect
    size. Therefore, when we down-shift to account for baseline coverage at the
    population level, we use only the age-dependent effect size, whereas when we
    up-shift individual Hb levels based on coverage status, we use both the
    age-dependent effect size **and** the time lag for the individual simulant
    based on when they got coverage.

  For the algorithms to account for these complexities, see the :ref:`iron
  intervention summary <iron_intervention_summary>` below.

.. _iron_intervention_summary:

Summary of Iron Intervention Algorithm
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

The following Python-style pseudocode summarizes all the pieces of the iron
intervention algorithm (for both birthweight and hemoglobin) and is intended to
resolve any ambiguities that may remain in the descriptions above. However, some
details for implementation are only provided above, and if any ambiguities or
inconsistencies remain in the algorithm below, the above descriptions may help
clarify the intent.

.. code-block::

  import numpy as np

  ## Definitions:
  t                         := current time in years (t=0 is start of simulation)
  n_weeks                   := n weeks, measured in years (e.g. n*7/365.25)
  time_covered_i            := time at which simulant's household first receives iron-fortified food
  mother_fortified_i        := whether simulant's mother got sufficient iron fortification during pregnancy
  household_covered_i(t)    := whether child's household is receiving iron-fortified food at time t
  time_since_fortified_i(t) := amount of time since child started eating iron-fortified food
  birthweight_i             := simulant's birthweight after adjusting for mother's iron consumption
  hb_i(t)                   := simulant's hemoglobin level after adjusting for fortification

  ## Population level parameters:
  hb_shift                = effect size on hemoglobin (normally disributed, mean 3.0 g/L)
  bw_response             = dose-response on birthweight (normally distributed, mean 1.51 g/mg)
  baseline_coverage(t)    = iron fortification coverage at time t in baseline scenario
    # Note: If delta_coverage is defined to be 0 in the baseline scenario, then
    # the below strategies should work in both baseline and intervention scenarios
  delta_coverage(t)       = change in coverage from baseline
  coverage(t)             = iron fortification coverage at time t
                          = baseline_coverage(t) + delta_coverage(t)
    # Multiplier on hb_shift due to age
  hb_age_fraction(age)    = 0 if age<0.5 else (age-0.5)/1.5 if age<2 else 1
    # Multiplier on hb_shift due to lag in response time
    # Negative time_since_fortified indicates child has not yet started eating fortified food
  hb_lag_fraction(time_since_fortified)
                          = (0 if time_since_fortified < 0 else
                             time_since_fortified/0.5 if time_since_fortified < 0.5 else
                             1)
  iron_concentration      = concentration X of iron in the country's fortified flour
  flour_consumption_dist  = distribution of flour consumption Y among women who eat flour
  mean_flour_consumption  = mean flour consumption among women who eat flour
                          = flour_consumption_dist.mean()
  mean_bw_shift           = mean birthweight shift among babies whose mothers ate fortified flour
                          = bw_response * iron_concentration * mean_flour_consumption

  ## Individual level attributes:
  p_i                     = propensity of simulant and mother for exposure to lack of iron fortification (~uniform(0,1))
  gestational_age_i       = gestational age of simulant drawn from GBD (in years)
  birthweight_gbd_i       = birthweight drawn from GBD for simulant
  iron_responsive_i       = whether individual is responsive to iron fortification
  age_i(t)                = the simulant's age at time t
  hb_gbd(age_i(t))        = hemoglobin level drawn from GBD for simulant
    # This time is float('-inf') if simulant is covered in baseline
    # and float('inf') if simulant never gets covered
  time_covered_i          = argmin_t(p_i < coverage(t) == True)

  ## At beginning of simulation (t=0):
  mother_fortified_i    = 'unknown'
    # Use GBD birthweight distribution at start of sim, because we don't have
    # the need or the data to stratify by baseline coverage after birth.
  birthweight_i         = birthweight_gbd_i

  ## When simulant is born in the simulation (t = time of birth):
  critical_time       = t - gestational_age_i + 20_weeks
  mother_fortified_i  = ((time_covered_i <= critical_time) and
                         (t-time_covered_i) >= 7_weeks)
    # Effect on birthweight:
    # Shift everyone's birthweight down to calibrate for baseline coverage
  birthweight_i       = birthweight_gbd_i - baseline_coverage(t) * mean_bw_shift
    # If simulant's mother was fortified, shift their birthweight up accordingly
  if mother_fortified_i:
    daily_flour_i     = average daily flour consumption Y of simulant's mother
                      = flour_consumption_dist.rvs()
    bw_shift_i        = birthweight shift Z given flour consumption Y
                      = bw_response * iron_concentration * daily_flour_i
    birthweight_i     += bw_shift_i

  ## At each simulation time step (including t=0 or when simulant is born):
    # Determine whether simulant's household is receiving iron-fortified food,
    # regardless of whether the simulant is iron-responsive
  household_covered_i(t)  = time_covered_i <= t
                          = p_i < coverage(t)
    # Effect on Hb:
    # Only adjust Hb level for iron-responsive individuals
  if iron_responsive_i:
      # Shift iron-responsive simulants' Hb levels down to calibrate for baseline coverage.
      # Take into account the age-dependent effect size but not the 6-month time lag.
    hb_i = hb_gbd(age_i(t)) - baseline_coverage(t) *  hb_age_fraction(age_i(t)) * hb_shift
      # Clip Hb value to be >=1 in case shifting made it negative
    hb_i = np.clip(hb_i, 1, np.inf)
      # Only increase simulant's Hb if their household is covered
    if household_covered_i(t):
        # Fortification starts either when the household receives coverage, or when
        # the child turns 6 months old and starts eating solids, whichever is later.
        # Recall that if the child is covered in baseline, then time_covered_i =
        # float('-inf'), so coverage always starts at age 6 months in this case.
      time_since_fortified_i(t) = min(t - time_covered_i, age_i(t) - 0.5)
        # Increase simulant's Hb level by the appropriate amount, taking into
        # account both the age-dependent effect and the 6-month time lag.
      hb_i += (hb_lag_fraction(time_since_fortified_i(t))
               * hb_age_fraction(age_i(t))
               * hb_shift
              )

.. _folic_acid_intervention_section:

Folic Acid Fortification
++++++++++++++++++++++++

.. _below:

Population Coverage Data - Iron and Folic Acid
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Our `coverage algorithm`_ for folic acid and iron will use three parameters,
:math:`a`, :math:`b`, and :math:`c`, which describe the existing level of
fortification as well as the potential level of fortification we can hope to
achieve. The food vehicle we are using for folic acid and iron is wheat flour.

.. list-table:: Definitions of the coverage parameters :math:`a`, :math:`b`, and :math:`c`
  :widths: 1 5
  :header-rows: 1

  * - Parameter
    - Definition
  * - :math:`a`
    - Proportion of the population who eats a fortified version of the vehicle
  * - :math:`b`
    - Proportion of the population who eats a fortifiable version of the vehicle
  * - :math:`c`
    - Proportion of the population who eats the vehicle

Note that by definition we must have :math:`a\le b\le c`. We will assume that
:math:`a`, :math:`b`, and :math:`c` are random variables distributed as
described in the following subsections. The following table provides estimates
of :math:`a`, :math:`b`, and :math:`c` for our locations (data sources listed
below). The food vehicle referred to in the table is wheat flour.

.. list-table:: Means and 95% CI's for existing population coverage of folic acid fortification (% of total population)
  :widths: 5 5 5 5
  :header-rows: 1

  * - Location
    - :math:`a` = Eats fortified vehicle
    - :math:`b` = Eats fortifiable vehicle
    - :math:`c` = Eats vehicle
  * - Ethiopia
    - 1.0
    - 15.0 (10.0, 20.0)
    - 28.0 (23.0, 33.0)
  * - India (Rajsathan)
    - 6.3 (4.8, 7.9)
    - 7.1 (5.6, 9.1)
    - 83.2 (79.5, 86.5)
  * - Nigeria (Kano)
    - 22.7 (20.0, 25.5)
    - 83.8 (81.4, 86.2)
    - 83.9 (81.5, 86.3)
  * - Nigeria (Lagos)
    - 5.4 (3.8, 6.9)
    - 13.8 (11.5, 16.1)
    - 14.2 (11.8, 16.5)

The above data for India and Nigeria is from Table 4 in [Aaron-et-al-2017]_. The
numbers for Ethiopia were chosen so that (i) the mean of existing wheat flour
fortification coverage is close to 0, based on
[GFDx-Ethiopia-Fortification-Dashboard]_, (ii) the percentage of people eating
wheat flour is 28%, based on [Ethiopian-Federal-Ministry-of-Health-2011]_, (iii)
between 53% and 55% of the wheat flour is fortifiable, based data from
[Ethiopian-Federal-Ministry-of-Health-2011]_ and
[GFDx-Ethiopia-Fortification-Dashboard]_, and (iv) the 95% confidence intervals
for :math:`b` and :math:`c` have a width of 10% (chosen arbitrarily).

Marginal distributions of :math:`a`, :math:`b`, and :math:`c`
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

**For Ethiopia parameter** :math:`a`, assume

.. math::

  a_\textit{Ethiopia} \sim \operatorname{Beta}(0.1,9.9).

This `Beta distribution <https://en.wikipedia.org/wiki/Beta_distribution>`_ will
have a mean of 1% as in the table, since

.. math::

  E[a_\textit{Ethiopia}] = \frac{\alpha}{\alpha+\beta}
  = \frac{0.1}{0.1+9.9} = 0.01.

The density has an asymptote at 0 and an x-intercept at 1.

.. warning::

  We may need to change the parameters of this distribution (and/or the
  distributions of b and c for Ethiopia) to make sure that we can find a joint
  distribution that guarantees a<b<c almost surely. In particular, the
  distribution functions need to satisfy :math:`F_a(x)\ge F_b(x)\ge F_c(x)`.

.. todo::

  Check whether our definitions of the distributions of a,b, and c for Ethiopia
  are compatible with the requirement that a<b<c. Once we finalize the
  distributions, add graphs of the beta distributions of a, b, and c for
  Ethiopia.


.. _GFDx Ethiopia Dashboard: https://fortificationdata.org/country-fortification-dashboard/?alpha3_code=ETH&lang=en

**For all the remaining parameters**, use a Beta
distribution with mean equal to the central estimate, and variance equal to the
variance of a normal distribution with the same mean and 95% confidence
interval. Here is Python code for achieving this:

.. code-block:: Python

  # https://docs.scipy.org/doc/scipy/reference/generated/scipy.stats.norm.html
  # https://docs.scipy.org/doc/scipy/reference/generated/scipy.stats.beta.html
  import scipy.stats.norm
  import scipy.stats.beta

  def beta_a_b_from_mean_var(mean, variance):
  """
  Returns the parameters a=alpha and b=beta for a beta distribution
  with the specified mean and variance.
  """
    if mean <= 0 or mean >= 1:
        raise ValueError("Mean must be in the interval (0,1)")
    if variance >= mean*(1-mean):
        raise ValueError("Variance too large")

    # For derivations of these formulas, see:
    # https://en.wikipedia.org/wiki/Beta_distribution#Mean_and_variance
    a = mean*(mean*(1-mean)/variance - 1)
    b = (1-mean)*(mean*(1-mean)/variance - 1)
    return a, b

  def normal_stdev_from_mean_quantile(mean, quantile, quantile_rank):
  """
  Computes the standard deviation of a normal distribution that has the
  specified mean and quantile.
  """
    # If q = quantile, mu = mean, and sigma = std deviation, then
    # q = mu + q'*sigma, where q' is the standard normal quantile
    # and q is the transformed quantile, so sigma = (q-mu)/q'
    return (quantile - mean) / scipy.stats.norm().ppf(quantile_rank)

  def beta_from_mean_approx_quantile(mean, approx_quantile, quantile_rank):
    """
    Returns a scipy.stats Beta distribution with the specified mean and a
    quantile of rank quantile_rank approximately equal to approx_quantile.
    This is achieved by specifying that the variance of the Beta distribution
    is equal to the variance of a normal distribution with the same mean and
    the specified quantile.
    """
    variance = normal_stdev_from_mean_quantile(mean, approx_quantile, quantile_rank)**2
    a,b = beta_a_b_from_mean_var(mean, variance)
    return scipy.stats.beta(a,b)

  # Example usage - distribution of parmeter a for India (Rajsathan)
  mean = 6.3 / 100
  q_975 = 7.9 / 100

  india_a_distribution = beta_from_mean_approx_quantile(mean, q_975, 0.975)

Here are the graphs of the Beta distributions for India (Rajasthan), Nigeria
(Kano), and Nigeria (Lagos):

.. image:: coverage_india_nigeria.svg

Obtaining national estimates of :math:`a`, :math:`b`, and :math:`c`
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

For **Ethiopia**, the estimates in the table are already at the national level.

For **India**, we will assume that the national values for :math:`a`, :math:`b`,
and :math:`c` are the same as those in Rajasthan listed in the table.

Fpr **Nigeria**, to estimate the national coverage levels :math:`a`, :math:`b`,
and :math:`c`, we will take a population-weighted average of the corresponding
values for Kano and Lagos. Kano has a population of about 4 million, and Lagos
has a population of about 21 million, so we have

.. math::

  a_\textit{Nigeria}
  = \tfrac{4}{25} a_\textit{Kano} + \tfrac{21}{25} a_\textit{Lagos},

and similarly for :math:`b` and :math:`c`.

`Couple <https://en.wikipedia.org/wiki/Coupling_(probability)>`_ the random
variables :math:`a_\textit{Kano}` and :math:`a_\textit{Lagos}` `comonotonically
<comonotonicity_>`_, i.e. by giving them the same `percentile rank`_ (and
similarly for :math:`b` and :math:`c`). This can be done by using `inverse
transform sampling`_ to generate :math:`a_\textit{Kano}` and
:math:`a_\textit{Lagos}` from a single uniform random variable :math:`u`. This
is the same strategy described below for coupling :math:`a`, :math:`b`, and
:math:`c` for each location, so only one uniform random variable :math:`u` will
be needed to generate all six subnational numbers for Nigeria, which will then
be averaged as described above to obtain the national estimates.

The comonotone coupling strategy will create greater uncertainty in the weighted
average :math:`a_\textit{Nigeria}` than if we sampled the two estimates
indepdently, and more uncertainty seems like a good idea since we're trying to
estimate an average for the entire country based on only two data points.
Moreover, this coupling seems plausible since the data for Kano and Lagos were
from the same paper and therefore could have a similar bias.

.. todo::

  List the means of a, b, and c for Nigeria, for easier validation later.
  Also, draw histograms for the distributions.

Joint distribution of :math:`a`, :math:`b`, and :math:`c`
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

To ensure that :math:`a\le b\le c` for each location, we will use the
`comonotone coupling <comonotonicity_>`_ of the three random variables. That is,
for each location, sample :math:`a`, :math:`b`, and :math:`c` so that they all
have the same `percentile rank`_ in their respective distributions. This can be
done by using `inverse transform sampling`_ to generate all three variables
(:math:`a`, :math:`b`, and :math:`c`) from a single uniform random variable
:math:`u`.

*Between* countries, the random vectors :math:`(a,b,c)` should be independent.
That is, the three random vectors :math:`(a,b,c)_\textit{Ethiopia}`,
:math:`(a,b,c)_\textit{India}`, and :math:`(a,b,c)_\textit{Nigeria}` should be
mutually independent, where :math:`(a,b,c)_\text{[country\_name]}` indicates
the national estimate  obtained as described above.

.. todo::

  Check whether a, b, and c are actually `stochastically ordered <stochastic
  ordering_>`_, i.e. their distribution functions satisfy :math:`F_a(x)\ge
  F_b(x)\ge F_c(x)` for all x. If not, then then the comonotone coupling won't
  work for guaranteeing :math:`a\le b\le c`, and neither will any other. This
  could particularly be a problem for Ethiopia since the distribution for
  :math:`a` has a different form from the others.

  Experiment with different joint distributions to see what what happens to the
  overall uncertainty. For example, it may be better to generate a,b,c
  independently, then resample until a<b<c (i.e. use the independent coupling
  and condition on the event that a<b<c), because I think that will produce more
  variation in the differences c-b and b-a, which may better reflect our
  uncertainty. Note that then the marginal distributions won't exactly match the beta distributions listed above, but that's less important than making sure we have :math:`a\le b\le c` and we are getting reasonable estimates for our overall uncertainty.

.. _percentile rank: https://en.wikipedia.org/wiki/Percentile_rank
.. _comonotonicity: https://en.wikipedia.org/wiki/Comonotonicity
.. _inverse transform sampling: https://en.wikipedia.org/wiki/Inverse_transform_sampling
.. _stochastic ordering: https://en.wikipedia.org/wiki/Stochastic_ordering

.. _here:
.. _coverage_algorithm_section:

Coverage Algorithm
~~~~~~~~~~~~~~~~~~

.. todo::

  Add a section to describe baseline coverage, and edit this section
  appropriately, as per James's recommendation in `PR 170
  <https://github.com/ihmeuw/vivarium_research/pull/170>`_:

    This is fine, because we talked an I know what you mean. I have a very
    strong preference for the following framing though:

    1.  Provide a description of how to find coverage in the baseline scenario.
        (and background info as necessary).
    2.  In a separate section, describe the intervention as a change in
        baseline coverage. (e.g. your intervention provides a delta_C(t),
        describe that).

    This framing is a little more complicated, but handles far more of our
    intervention cases. Frequently baseline coverage will have it's own time
    trends or other complicating factors. This framing provides a very clear
    description on exactly what your intervention is, as well.

  Also, add plots of coverage to show the time trend for a few versions of a, b,
  c.

Let :math:`C(t)` denote the proportion of the population receiving fortified
food (i.e. the population coverage) after :math:`t` years, and let :math:`t_0`
denote the time at which our folic acid intervention starts. Our folic acid
intervention algorithm is described by the following formula:

.. math::

  C(t) =
  \begin{cases}
  a & \text{if $t<t_0$,}\\
  c - (c-b)(1-r)^{t-t_0} & \text{if $t\ge t_0$,}
  \end{cases}

where :math:`a`, :math:`b`, and :math:`c` are the randomly sampled population
coverage data estimates from the `previous section <population coverage data -
iron and folic acid_>`_, and :math:`r \in [0,1]` is a user-defined parameter
representing the proportion of people each year who start off eating an
unfortifiable version of the vehicle but can be convinced to switch to the
fortified vehicle.

By default, we will start the intervention at :math:`t_0 = 1 \text{ year}`, and we'll assume :math:`r = 0.1` (i.e. an additional 10% of unfortifiable food will be converted to fortified food each year). We may later want to specify different values of these parameters for different locations.

In words, our intervention algorithm does the following:

1.  Before time :math:`t_0`, the population coverage is the proportion
    :math:`a` of people who are already eating the fortified vehicle.

2.  At time :math:`t_0`, the population coverage jumps immediately from
    :math:`a` to :math:`b`, i.e. the proportion of people who are already eating
    a fortifiable version of the vehicle. The rationale for this instantaneous
    jump is that the government begins enforcing fortification laws at time
    :math:`t_0`.

3.  After time :math:`t_0`, we run a campaign to convince people to switch to
    fortified food, and the rate of conversion is controlled by the parameter
    :math:`r`, with values close to 0 modeling a slow conversion, and values
    close to 1 modeling a fast conversion.

4.  If we let the simulation run forever, the population coverage would
    approach the value :math:`c`, i.e. the proportion of people who eat the
    vehicle.

Determining Whether a Simulant Is Affected - Folic Acid
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Folic acid `reduces the birth prevalence <effect size - folic acid_>`_ of :ref:`neural tube
defects (NTDs) <2017_cause_neural_tube_defects>`. In order for a newborn to have
a reduced risk of NTDs, the baby's mother needs sufficient folate intake *three
months prior to pregnancy*. Therefore, only babies born to mothers who have been
eating folic-acid-fortified food for at least a year will have any benefit from
our intervention.

With this in mind, here is the algorithm to determine whether a simulant is affected by the folic acid intervention:

1.  **Define variables:** Each simulant needs an attribute
    `mother_ate_folate_fortified_food`, which will be `True` if the simulant's
    mother ate folic-acid-fortified food starting at least a year before the
    simulant was born, `False` if not, and `Unknown` if we don't know.

2.  **Initialize the simulation:** At the start of the simulation, set
    `mother_ate_folate_fortified_food = Unknown` for all simulants; this
    attribute will not be used after the simulant is born, so the value is
    irrelevant for all simulants already alive at the start of the simulation.

3.  **Initialize simulants born into the simulation:** For each simulant born
    at time :math:`t` (in years), the probability that the simulant's mother
    started eating folic-acid-fortified food at least a year ago is equal to the
    population coverage one year ago, :math:`C(t-1)`. Therefore, upon the
    simulant's birth, do the following:

  a.  Generate a uniform random number :math:`u\sim
      \operatorname{Uniform}([0,1])`.

  b.  If :math:`u<C(t-1)`, set `mother_ate_folate_fortified_food = True`;
      otherwise set `mother_ate_folate_fortified_food = False`.

Note that we are assuming that once someone (in this case the baby's mother)
starts eating fortified food, they will continue to eat the fortified food
forever.

The variable `mother_ate_folate_fortified_food` will be used to determine the
probability that the newborn has a neural tube defect. We will describe how to
do this below.

Effect Size - Folic Acid
~~~~~~~~~~~~~~~~~~~~~~~~

Folic acid fortification reduces the birth prevalence of :ref:`neural tube
defects (NTDs) <2017_cause_neural_tube_defects>`. The effect size is measured as
a risk ratio (RR), where we think of "no fortification" as a risk factor:

.. math::

  RR = \frac{P(\text{born with NTD} \mid \text{no fortification})}
  {P(\text{born with NTD} \mid \text{fortification})}
  \approx \frac{1}{0.59\: (0.49, 0.70)}
  \approx 1.71\: (1.43, 2.04).

We are estimating this effect size as the reciprocal of the odds ratio (OR) of
:math:`0.59\: (0.49, 0.70)` found in the [Keats-et-al-2019]_ review; this odds
ratio is the ratio of the odds of being born with NTDs in the fortified
population to the odds of being born with NTDs in the unfortified population.
Since the prevalence of NTDs is small, the odds ratio is very close to the
risk ratio.

To model the uncertainty in the estimate, the above RR should be drawn from a
`lognormal <https://en.wikipedia.org/wiki/Log-normal_distribution>`_
distribution with median = 1.71, 2.5\ :superscript:`th`-percentile = 1.43, and
97.5\ :superscript:`th`-percentile = 2.04. This distbibution can be created
using `SciPy's lognorm function
<https://docs.scipy.org/doc/scipy/reference/generated/scipy.stats.lognorm.html>`_
as follows:

.. code-block:: Python

  from numpy import log
  from scipy.stats import norm, lognorm

  # median and 0.975-quantile of lognormal distribution for RR
  median = 1.71
  q_975 = 2.04

  # 0.975-quantile of standard normal distribution (=1.96, approximately)
  q_975_stdnorm = norm().ppf(0.975)

  mu = log(median) # mean of normal distribution for log(RR)
  sigma = (log(q_975) - mu) / q_975_stdnorm # std dev of normal distribution for log(RR)

  # Frozen lognormal distribution for RR, representing uncertainty in our effect size
  # (s is the shape parameter; the scale parameter is exp(mu), which equals the median)
  rr_distribution = lognorm(s=sigma, scale=median)

.. note::

	The same draws from this distribution should be applied to each model
	location as described in the `Model Randomness`_ section


Determining Whether a Simulant Is Born with an NTD
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

The procedure here uses the standard coverage gap framework:

1.  Compute a PAF based on the above RR
2.  When a simulant is born, check whether `mother_ate_folate_fortified_food` is
    `True`. If so, the simulant has an NTD with probability equal to the
    risk-deleted prevalence birth_prevalence_of_NTDs_in_GBD * (1-PAF).
    Otherwise, their risk of an NTD is increased by a factor of RR, to
    birth_prevalence_of_NTDs_in_GBD * (1-PAF) * RR.
3.  Once the simulant's probability of an NTD is determined, decide whether
    they actually have one using standard inverse transform sampling for the
    Bernoulli random variable representing the simulant's
    `has_neural_tube_defect` boolean attribute.

.. todo::

  Write this out more explicitly. Perhaps it would be better to merge
  `Determining Whether a Simulant Is Affected - Folic Acid`_ into this section.

Desired Model Outputs
---------------------

.. list-table:: Priorities
  :widths: 1 5
  :header-rows: 1

  * - Priority
    - Definition
  * - High
    - Needed for client
  * - Med
    - Needed for primary verification & validation
  * - Low
    - Needed for additional V&V or research

.. csv-table:: Output Summary
  :file: output_summary.csv
  :widths: 1 1 1 1 1 1 1
  :header-rows: 1

.. _stratification_groups_table:

.. csv-table:: Output Stratification Groups
  :file: stratification_groups.csv
  :widths: 1 1 1 1 1 1
  :header-rows: 1

.. csv-table:: Final Outputs
  :file: final_outputs.csv
  :widths: 1 1 1 1 1 1
  :header-rows: 1

.. _raw_outputs_table:

.. csv-table:: Raw Outputs
  :file: raw_outputs.csv
  :widths: 1 1 1 1 1 1
  :header-rows: 1

.. note::

  To compute the `variance <https://en.wikipedia.org/wiki/Variance>`_ of a
  random variable :math:`Y` in the population (e.g. a risk exposure variable
  like hemoglobin level or birthweight), there are (at least) two possible
  options for the raw outputs to report in ``output.hdf``. Both options require
  calculating the *mean* of :math:`Y` as well as the variance, as well as the
  *size of the population* for which the mean and variance are computed:

  1.  Directly record the *mean* and *variance* of :math:`Y` for the population
      in each random seed, then use the `law of total variance
      <https://en.wikipedia.org/wiki/Law_of_total_variance>`_ (equivalently, the
      `formula for the variance of a mixture distribution
      <https://en.wikipedia.org/wiki/Mixture_distribution#Moments>`_) to compute
      the variance of the population in each draw when aggregating over random
      seeds. That is, compute the variance of the means of :math:`Y` over the
      random seeds and the mean of the variances of :math:`Y` over the random
      seeds, and add these together. The mean of the variances and the variance
      of the means are both weighted averages, so for each random seed we need
      the *size of the population* for which the mean and variance are being
      computed in order to weight the components appropriately.

  2.  Record the *first moment* :math:`\sum_i y_i` and *second moment*
      :math:`\sum_i y_i^2` of :math:`Y` with respect to population measure, then
      compute the variance of :math:`Y` using the formula
      :math:`\operatorname{Var}(Y) = \operatorname{E}(Y^2) -
      (\operatorname{E}Y)^2`. With this option, the first and second moments can
      be aggregated over random seeds simply by summing. Again, for each random
      seed we also need the *size of the population* for which the mean and
      variance are being computed, as the population size appears in the
      denominator. **Caution:** It is possible that this method can become
      `numerically unstable
      <https://en.wikipedia.org/wiki/Algorithms_for_calculating_variance>`_ if
      the population is very large or the values of :math:`Y` are very large,
      though typically this should not be a problem.

  **To do:** Include more explicit formulas to make it clearer exactly how the
  above two options work. Also, be clearer with notation for population
  parameters vs. statistical estimates. Perhaps researchers should also figure
  out if the law of total variance (approach 1) is preferred to the moments
  approach (2) because of the numerical stability issue. Also, somewhere
  (perhaps not here) describe the alternative definitions of mean and variance
  as person-time-weighted averages.

.. todo::

  - Finish the above tables and add clarifying descriptions:

    - Add birthweight mean and variance to Raw Outputs table
    - Add "low birthweight proportion" to Final Outputs table, and add
      corresponding numerator and denominator to Raw Outputs
    - Add coverage person-time to Raw Outputs table and Output Summary table
    - Add incidence and remission to Final Outputs table and corresponding
      transition counts to Raw Outputs table
    - Add "Additional Stratification" variables to Raw Outputs table
    - Change definitions of stratification groups to match our actual
      outputs (which are defined at time of measurement rather than simulant
      initialization)
    - Split the "Live births with {cause state}" entry of Raw Outputs into two
      rows, one for NTDs and one for LRI (to allow for different specification
      of Additional Stratification)
    - Be consistent with "first moment" and "second moment" terminology
    - Add a note about which measures fall into the generic category of "state
      person time"
    - Add note to clarify that simulants "at a particular age" means simulants
      who turned that age between the last timestep and the current timestep
    - Add note to clarify that all measures are for *live* simulants (i.e. make
      sure to filter to "alive" simulants when doing stratification)
    - Similar to the note about variance, specify more carefully how to compute
      means and how to either aggregate them as population-weighted averages or report raw moments and populations

  - Take into account Ali's feedback from PR 226:

      I wonder if there should be an additional column in the Priorities table
      that indicates what stage of the model building process an output for V&V
      is needed. I know we had previously discussed that once we validated a
      cause model that it likely isn't necessary to report the state person time
      for that cause any more. I'm not sure what the best way to include this
      information might be, but maybe some columns that have the status of V&V
      for each model build by the software engineers and then we can update the
      table as "verified; no longer needed"? And others that we might want to
      keep in as "verified; still needed". Just a thought! Otherwise I think
      this looks good.

Observers
+++++++++

Verification and Validation Strategy
------------------------------------

Design Decisions and Rationales
-------------------------------

Do not explicitly model neonatal disorders, meningitis, or other LBWSG-affected causes
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++

For context, here are `treemaps for Nigeria and India on GBD Compare
<http://ihmeuw.org/51tj>`_ showing that the majority of DALYs attributable to
low birth weight and short gestation in the under-5 age groups are due to
neonatal disorders, LRI, diarrhea, and meningitis.

Although the sub-causes of  :ref:`neonatal disorders
<2017_cause_neonatal_disorders>` account for the majority of disease burden in
the neonatal age groups, we are not explicitly modeling the neonatal causes in
this simulation. Instead, the relative risks from :ref:`low birth weight and
short gestation <2017_risk_lbwsg>` will directly affect each simulant's
mortality rate, which indirectly accounts for mortality due to neonatal
disorders and other causes like meningitis. Since iron fortification affects
birth weight and hence the relative risk of mortality that a simulant
experiences, this approach allows us to count how many deaths can be averted
from an increased birth weight resulting from iron fortification. This approach
ignores morbidity from neonatal disorders, meningitis, and other causes in the
neonatal age groups, but YLDs for neonates are negligible compared to YLLs (on
the order of 100 times smaller), so ignoring YLDs due to neonatal disorders and
other causes is not a concern.

Additional reasons for excluding neonatal disorders, meningitis, and other
causes affected by low birth weight and short gestation are as follows:

* The relative risks for :ref:`low birth weight and short gestation
  <2017_risk_lbwsg>` are not cause-specific, but rather apply to all-cause
  mortality. Thus it is simpler and more consistent with the input data to group
  all the LBWSG-affected causes together rather than modeling them explicitly,
  unless those causes need to interact with other model components (as is the
  case with :ref:`diarrheal diseases <2017_cause_diarrhea>` and :ref:`lower
  respiratory infections <2017_cause_lower_respiratory_infections>`, which are
  affectd by :ref:`vitamin A deficiency <2017_cause_vitamin_a_deficiency>`).

* Neonatal disorders is potentially problematic because:

  #. We know its (birth) prevalence does not include (birth) prevalence of
     :ref:`other neonatal disorders <2017_cause_neonatal_other>`, for which
     there is no data, and

  #. We know the relative risks are wrong for :ref:`preterm birth
     <2017_cause_neonatal_preterm>` because this sub-cause is PAF-of-1 with
     LBWSG.

* Meningitis is potentially problematic because we have not successfully
  modeled it before. In our initial attempt to include meningitis in the BEP
  model, the simulated meningitis CSMR significantly underestimated the GBD CSMR
  for meningitis in all age groups.

* Other causes affected by LBWSG, such as encephalitis, upper respiratory
  infections, and otitis media, have very few DALYs attributable to LBWSG. Thus
  it is not worth the effort to model these causes explicitly, though averted
  YLLs due to these cause will in theory be captured by the above strategy of
  using the LBWSG relative risks to affect the overall mortality rate of
  simulants.

.. _limitations_section:

Model Limitations
-----------------

* For iron fortification, we are assuming the child's propensity for getting
  fortified food is the same as their mother's. It may be better to model these
  propensities as positively correlated but not necessarily identical.

* For the iron effect on birthweight:

  - We are assuming that flour consumption is independent of birthweight as
    sampled by GBD. Thus we are not accounting for confounding by overall
    caloric intake, which is likely correlated with both flour consumption as
    well as birthweight.

  - We are also assuming that fortification is independent of flour
    consumption, given whether or not you eat flour. Thus we are not accounting
    for confounding due to factors like urban vs. rural settings, which could be
    correlated with both flour consumption and fortification.

  - Ideally, we would want to get correlations between birthweight, flour
    consumption, and fortification in our populations, so that we could create a
    joint distribution to appropriately sample all three variables at once.
    Accounting for this correlation would likely reduce the impact from
    increased birthweight.

* GBD data for neural tube defects, particularly for India, is low relative to
  estimates of NTD burden from other data sources such as
  [Kancherla-et-al-2018-India]_, [Dixon-et-al-2019]_,
  [Kancherla-et-al-2018-Global]_, [Kancherla-et-al-2019]_. Therefore, we are
  likely underestimating the impact of folic acid fortification in our
  simulation, which relies upon GBD data.

* GBD does not model folate deficiency, so we need to be careful that our
  populations are comparable to the Keats study populations when applying the
  effect size of folic acid fortification directly to the birth prevalence of
  neural tube defects. (E.g., could there be some reason besides lack of
  folate-rich food for a high rate of neural tube defects?)

* The fortification scale-up strategy is relatively simple and does not capture
  the numerous complexities of actually implementing a large-scale fortification
  program. This design is deliberate, as the model is intended to provide an
  upper bound on the potential impact fortification could have, by providing a
  flexible framework to specify the maximum possible population coverage one
  could hope to achieve with a given fortification strategy. Estimating what
  this maximum possible coverage would be for a particular fortification
  strategy is outside the scope of this model.

* The model does not currently track individual sequelae of any cause (except
  for anemia severity in the Iron Deficiency model), only overall DALYs due to
  each cause, averaging over all sequalae.

* The model does not currently allow modeling iron fortification and vitamin A
  fortification together, because of different strategies for tracking anemia due to vitamin A deficiency in the two interventions.

References
----------

.. [Aaron-et-al-2017]

  View `Aaron et al. 2017`_

    Grant J Aaron, Valerie M Friesen, Svenja Jungjohann, Greg S Garrett,
    Lynnette M Neufeld, Mark Myatt, Coverage of Large-Scale Food Fortification
    of Edible Oil, Wheat Flour, and Maize Flour Varies Greatly by Vehicle and
    Country but Is Consistently Lower among the Most Vulnerable: Results from
    Coverage Surveys in 8 Countries, The Journal of Nutrition, Volume 147, Issue
    5, May 2017, Pages 984S–994S, https://doi.org/10.3945/jn.116.245753

.. _Aaron et al. 2017: https://doi.org/10.3945/jn.116.245753

.. [Allen-2002]

  View `Allen 2002`_

    Lindsay H Allen, Anemia and iron deficiency: effects on pregnancy outcome,
    The American Journal of Clinical Nutrition, Volume 71, Issue 5, May 2000,
    Pages 1280S–1284S, https://doi.org/10.1093/ajcn/71.5.1280s

.. _`Allen 2002`: https://doi.org/10.1093/ajcn/71.5.1280s

.. [Altinkaynak-et-al-1994]

  View `Altinkaynak et al. 1994`_

    Altinkaynak S, Alp H, Bastem A, Selimoğlu M, Energin M (1994). Serum ferritin and hemoglobin levels of mothers and their newborns. Turk J Pediatr. 1994 Oct-Dec;36(4):289-93.

.. _`Altinkaynak et al. 1994`: https://www.ncbi.nlm.nih.gov/pubmed/7825234

.. [Andersson-et-al-2010]

  View `Andersson et al. 2010`_

    Maria Andersson, Winfried Theis, Michael B Zimmermann, Jasmin Tajeri Foman, Martin Jäkel, Guus SMJE Duchateau, Leon GJ Frenken, Richard F Hurrell, Random serial sampling to evaluate efficacy of iron fortification: a randomized controlled trial of margarine fortification with ferric pyrophosphate or sodium iron edetate, The American Journal of Clinical Nutrition, Volume 92, Issue 5, November 2010, Pages 1094–1104, https://doi.org/10.3945/ajcn.2010.29523

.. _`Andersson et al. 2010`: https://doi.org/10.3945/ajcn.2010.29523

.. [De-Pee-et-al-2002]

  View `De Pee et al. 2002`_

    De Pee S, Bloem MW, Sari M, Kiess L, Yip R, Kosen S. The high prevalence of low hemoglobin concentration among Indonesian infants aged 3-5 months is related to maternal anemia. J Nutr. 2002 Aug;132(8):2215-21.

.. _`De Pee et al. 2002`: https://doi.org/10.1093/jn/132.8.2215

.. [Diana-et-al-2016]

  View `Diana et al. 2016`_

    Diana A, Mallard SR, Haszard JJ, Purnamasari DM, Nurulazmi I, Herliani PD,
    Nugraha GI, Gibson RS, Houghton L. Consumption of fortified infant foods
    reduces dietary diversity but has a positive effect on subsequent growth in
    infants from Sumedang district, Indonesia. PLoS One. 2017 Apr
    20;12(4):e0175952. doi: 10.1371/journal.pone.0175952. eCollection 2017.

.. _`Diana et al. 2016`: https://www.ncbi.nlm.nih.gov/pubmed/28426828

.. [Dixon-et-al-2019]

  View `Dixon et al. 2019`_

    Dixon, M, Kancherla, V, Magana, T, Mulugeta, A, Oakley, GP. High potential
    for reducing folic acid‐preventable spina bifida and anencephaly, and
    related stillbirth and child mortality, in Ethiopia. Birth Defects Research.
    2019; 111: 1513– 1519. https://doi.org/10.1002/bdr2.1584

.. _Dixon et al. 2019: https://doi.org/10.1002/bdr2.1584

.. [Dror-and-Allen-2018]

  View `Dror and Allen 2018`_

    Dror, D. K., & Allen, L. H. (2018). Retinol-to-fat ratio and retinol
    concentration in human milk show similar time trends and associations with
    maternal factors at the population level: a systematic review and
    meta-analysis. Advances in Nutrition, 9(suppl_1), 332S-346S.

.. _`Dror and Allen 2018`: https://doi.org/10.1093/advances/nmy021

.. [Emamghorashi-and-Heidari-2004]

  View `Emamghorashi and Heidari 2004`_

    Emamghorashi F, Heidari T (2004). Iron status of babies born to iron-deficient anaemic mothers in an Iranian hospital. East Mediterr Health J. 2004 Nov;10(6):808-14.

.. _`Emamghorashi and Heidari 2004`: https://www.ncbi.nlm.nih.gov/pubmed/16335768

.. [Erdem-et-al-2002]

  View `Erdem et al. 2002`_

    Erdem A, Erdem M, Arslan M, Yazici G, Eskandari R, Himmetoglu O (2002). The effect of maternal anemia and iron deficiency on fetal erythropoiesis: comparison between serum erythropoietin, hemoglobin and ferritin levels in mothers and newborns. J Matern Fetal Neonatal Med. 2002 May;11(5):329-32.

.. _`Erdem et al. 2002`: https://www.ncbi.nlm.nih.gov/pubmed/?term=12389675

.. [Ethiopian-Federal-Ministry-of-Health-2011]

  View `Ethiopian Federal Ministry of Health 2011`_

    Ethiopian Federal Ministry of Health. Assessment of Feasibility and
    Potential Benefits of Food Fortification. 2011.

.. _Ethiopian Federal Ministry of Health 2011: http://www.ffinetwork.org/about/calendar/2011/documents%202011/Ethiopia.pdf

.. [Ethiopian-National-Food-Consumption-Survey]

  View `Ethiopian National Food Consumption Survey`_

    Ethiopian Public Health Institute (2013). Ethiopian National Food Consumption Survey. Addis Ababa, Ethiopia. [table 18; women]

.. _`Ethiopian National Food Consumption Survey`: https://www.ephi.gov.et/images/pictures/National%20Food%20Consumption%20Survey%20Report_Ethiopia.pdf

.. [Global-Fortification-Data-Exchange]

  View `Global Fortification Data Exchange <https://tinyurl.com/wka9mgh>`_

    Global Fortification Data Exchange. Map: Nutrient Levels in Fortification Standards (mid-range or average). Accessed 4/7/2020. [http://www.fortificationdata.org.]

..
  .. _Global Fortification Data Exchange: https://tinyurl.com/wka9mgh

.. [GFDx-Ethiopia-Fortification-Dashboard]

  View `GFDx Ethiopia Fortification Dashboard`_

    Global Fortification Data Exchange. Ethiopia Fortification Dashboard.
    Accessed 4/28/2020. http://www.fortificationdata.org

.. _GFDx Ethiopia Fortification Dashboard: https://tinyurl.com/rdm4wza

.. [Haider-et-al-2013]

  View `Haider et al. 2013`_

    Haider, B. A., Olofin, I., Wang, M., Spiegelman, D., Ezzati, M., & Fawzi, W. W. (2013). Anaemia, prenatal iron use, and risk of adverse pregnancy outcomes: systematic review and meta-analysis. Bmj, 346, f3443.

.. _`Haider et al. 2013`: https://doi.org/10.1136/bmj.f3443

.. [Heaney-2008]

  View `Heaney 2008`_

    Heaney R. P. (2008). Vitamin D in health and disease. Clinical journal of the American Society of Nephrology : CJASN, 3(5), 1535–1541. https://doi.org/10.2215/CJN.01160308

.. _`Heaney 2008`: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4571146/

.. [Hombali-et-al-2019]

  View `Hombali et al. 2019`_

    Hombali  AS, Solon  JA, Venkatesh  BT, Nair  NS, Peña‐Rosas  JP. Fortification of staple foods with vitamin A for vitamin A deficiency. Cochrane Database of Systematic Reviews 2019, Issue 5. Art. No.: CD010068. DOI: 10.1002/14651858.CD010068.pub2.

.. _`Hombali et al. 2019`: https://doi.org/10.1002/14651858.CD010068.pub2

.. [Hurrell-et-al-2010]

  View `Hurrell et al. 2010`_

    Hurrell, R., Ranum, P., de Pee, S., Biebinger, R., Hulthen, L., Johnson, Q., & Lynch, S. (2010). Revised Recommendations for Iron Fortification of Wheat Flour and an Evaluation of the Expected Impact of Current National Wheat Flour Fortification Programs. Food and Nutrition Bulletin, 31(1_suppl1), S7–S21. https://doi.org/10.1177/15648265100311S102

.. _`Hurrell et al. 2010`: https://doi.org/10.1177/15648265100311S102

.. [Imdad-et-al-2016]

  View `Imdad et al. 2016`_

    Imdad A, Ahmed Z, Bhutta ZA. Vitamin A supplementation for the prevention of morbidity and mortality in infants one to six months of age. Cochrane Database of Systematic Reviews 2016, Issue 9. Art. No.: CD007480. DOI: 10.1002/14651858.CD007480.pub3.

.. _`Imdad et al. 2016`: https://doi.org/10.1002/14651858.CD007480.pub3

.. [Kancherla-et-al-2018-Global]

  View `Kancherla et al. 2018 (Global)`_

    Kancherla, V, Wagh, K, Johnson, Q, Oakley, GP. A 2017 global update on folic
    acid‐preventable spina bifida and anencephaly. Birth Defects Research. 2018;
    110: 1139– 1147. https://doi.org/10.1002/bdr2.1366

.. _Kancherla et al. 2018 (Global): https://doi.org/10.1002/bdr2.1366

.. [Kancherla-et-al-2018-India]

  View `Kancherla et al. 2018 (India)`_

    Kancherla, V, Oakley, GP. Total prevention of folic acid‐preventable spina
    bifida and anencephaly would reduce child mortality in India: Implications
    in achieving Target 3.2 of the Sustainable Development Goals. Birth Defects
    Research. 2018; 110: 421– 428. https://doi.org/10.1002/bdr2.1175

.. _Kancherla et al. 2018 (India): https://doi.org/10.1002/bdr2.1175

.. [Kancherla-et-al-2019]

  View `Kancherla et al. 2019`_

    Kancherla, V., Carmichael, S.L., Feldkamp, M.L. and Berry, R.J. (2019),
    Teratology society position statement on surveillance and prevalence
    estimation of neural tube defects. Birth Defects Research, 111: 5-8.
    https://doi.org/10.1002/bdr2.1434

.. _Kancherla et al. 2019: https://doi.org/10.1002/bdr2.1434

.. [Keats-et-al-2019]

  View `Keats et al. 2019`_

    Keats, E. C., Neufeld, L. M., Garrett, G. S., Mbuya, M. N., & Bhutta, Z. A. (2019). Improved micronutrient status and health outcomes in low-and middle-income countries following large-scale fortification: evidence from a systematic review and meta-analysis. The American journal of clinical nutrition, 109(6), 1696-1708.

.. _`Keats et al. 2019`: https://doi.org/10.1093/ajcn/nqz023

.. [Kilbride-et-al-1999]

  View `Kilbride et al. 1999`_

    J Kilbride, T G Baker, L A Parapia, S A Khoury, S W Shuqaidef, D Jerwood, Anaemia during pregnancy as a risk factor for iron-deficiency anaemia in infancy: a case-control study in Jordan., International Journal of Epidemiology, Volume 28, Issue 3, Jun 1999, Pages 461–468, https://doi.org/10.1093/ije/28.3.461

.. _`Kilbride et al. 1999`: https://doi.org/10.1093/ije/28.3.461

.. [Marques-et-al-2016]

  View `Marques et al. 2016`_

    Marques Rde F, Taddei JA, Konstantyner T, Marques AC, Braga JA (2016). Correlation between hemoglobin levels of mothers and children on exclusive breastfeeding in the first six months of life. J Pediatr (Rio J). 2016 Sep-Oct;92(5):479-85. doi: 10.1016/j.jped.2015.11.006. Epub 2016 May 4.

.. _`Marques et al. 2016`: https://doi.org/10.1016/j.jped.2015.11.006

.. [Sandjaja-et-al-2015]

  View `Sandjaja et al. 2015`_

    Sandjaja, Jus’at, I., Jahari, A. B., Htet, M. K., Tilden, R. L., Soekarjo, D., Utomo, B., ... & Korenromp, E. L. (2015). Vitamin A-fortified cooking oil reduces vitamin A deficiency in infants, young children and women: results from a programme evaluation in Indonesia. Public health nutrition, 18(14), 2511-2522.

.. _`Sandjaja et al. 2015`: https://doi.org/10.1017/S136898001400322X

.. [Siddappa-et-al-2007]

  View `Siddappa et al. 2007`_

    Siddappa AM, Rao R, Long JD, Widness JA, Georgieff MK (2007).The assessment of newborn iron stores at birth: a review of the literature and standards for ferritin concentrations. Neonatology. 2007;92(2):73-82. Epub 2007 Mar 14.

.. _`Siddappa et al. 2007`: https://www.ncbi.nlm.nih.gov/pubmed/17361090

.. [Talaat-et-al-2016]

  View `Talaat et al. 2016`_

    Talaat IM, Kamal NM, Alghamdi HA, Alharthi AA, Alshahrani MA. A randomized clinical trial comparing 3 different replacement regimens of vitamin D in clinically asymptomatic pediatrics and adolescents with vitamin D insufficiency. Ital J Pediatr. 2016;42(1):106. Published 2016 Dec 7. doi:10.1186/s13052-016-0314-z

.. _`Talaat et al. 2016`: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5142392/

.. [Turkay-et-al-1995]

  View `Turkay et al. 1995`_

    Türkay S, Tanzer F, Gültekin A, Bakici MZ (1995). The influence of maternal iron deficiency anaemia on the haemoglobin concentration of the infant. J Trop Pediatr. 1995 Dec;41(6):369-71. doi: 10.1093/tropej/41.6.369.

.. _`Turkay et al. 1995`: https://www.ncbi.nlm.nih.gov/pubmed/8606448

.. [Vieth-1999]

  View `Vieth 1999`_

    Vieth, R. (1999). Vitamin D supplementation, 25-hydroxyvitamin D concentrations, and safety. The American journal of clinical nutrition, 69(5), 842-856.

.. _`Vieth 1999`: https://www.ncbi.nlm.nih.gov/pubmed/10232622

.. [WHO-Vitamin-A-Supplementation-Guidelines]

  View `WHO Vitamin A Supplementation Guidelines in Infants 1-5 Months of Age`_

    World Health Organization (2011). Vitamin A supplementation for infants 1–5 months of age: Guideline. ISBN: 978-92-4-150181-1

.. _`WHO Vitamin A Supplementation Guidelines in Infants 1-5 Months of Age`: https://www.who.int/nutrition/publications/micronutrients/guidelines/vas_infants_1-5/en/

.. [WHO-Guidelines-For-Complementary-Feeding]

  View `WHO Guiding Principles for Complementary Feeding of the Breastfed Child`_

  World Health Organization; Pan American Health Organization. Division of Health Promotion and Protection - Food and Nutrition Program. WHO Guiding Principles for Complementary Feeding of the Breastfed Child. 2003.

.. _`WHO Guiding Principles for Complementary Feeding of the Breastfed Child`: https://www.who.int/maternal_child_adolescent/documents/a85622/en/