Before diving in and talking immediately about the Young, Geier, & Geier study, “Thimerosal exposure in infants and neurodevelopmental disorders: an assessment of computerized medical records in the Vaccine Safety Datalink,” it’s probably a good idea to back up a bit and talk about confounding and confounders. The authors make no mention of confounding and take no account of potential confounding in their analysis. This is one of the many flaws of the study.

Ironically, no epidemiologist has contributed more to modern epidemiological thinking about confounding than Sander Greenland. Professor Greenland has published around 50 methodological papers about confounding and confounders, including five papers specifically on the problems of confounding in ecological study designs. The irony here is that Prof Greenland is an expert witness for the Petitioners in the Autism Omnibus. Given that Prof Greenland has written so much about confounding, you might guess that there are hundreds of other methodological papers related to the subject, and you would be right. But there’s no need to get bogged down in an advanced discussion of confounding, even though the philosophically inclined might be fascinated by new thinking on counterfactual confounding and causality.

For the purposes of our discussion today, we will use the classical definition of confounding accepted by most epidemiologists: Let X be an independent variable, or exposure, and Y be a dependent variable, or disease outcome.*
Z is a potential confounder if:
(1) Z is causally related to the disease outcome Y or the diagnosis of disease outcome Y. Put another way, Z is a risk factor for the disease or for diagnosis of the disease.
(2) Z is associated with the exposure.**
If you put #1 and #2 above together, confounding can be thought of as a “mixing of effects.” You’ve hypothesized a causal association between X and Y, but some other risk factor Z may be responsible part (or all) of the association under investigation. So a potential confounder must have the potential to provide an alternative explanation for the observed association. (There’s a nice little lesson plan for teaching “Confounding in Epidemiology” to high school students at the College Board website.)

Now let’s get away from theoretical discussion and talk about studying thimerosal exposure and neurodevelopmental disorders, with a specific focus on autism. What are the potential confounders that might need to be considered when investigating this association? To guide us in thinking about confounding, the usual approach would be to use knowledge of risk factors for autism, knowledge of vaccine utilization, and previous literature. We also need to consider the nature of the data. If the investigation includes data on children collected over several years, it’s usually a good idea to consider date of birth as a potential confounder. In the Vaccine Safety Datalink (VSD) data, we know that the frequency of autism was increasing over time, between 1990 and 1996. We also know that Hg exposure varied during that time period. These latter two relationships are shown clearly in Figure 1 of the Young et al paper. I would argue, therefore, that in the VSD data its absolutely essential to consider date of birth as a confounder. At the very least you would need to take birth cohort into account. Indeed, given the marked increase in autism rates over time, you would want very “tight control” for date of birth, so it might even be a good idea to consider month of birth as a control variable in the analysis. Indeed, in the the Verstraeten et al. study of the VSD, the investigators used “proportional hazards models stratified by…year and month of birth.” Whatever else one thinks of the Verstraeten et al. study, any epidemiologist would agree that this was the correct approach to control for confounding in the VSD. Now it’s impossible to know for absolutely certain that a potential confounder is an actual confounder until you analyze the data. But in the VSD, the autism time trend is so strong that you have to consider annual birth cohort as a confounder, and month of birth would be even better.

There’s lots of other variables that could be considered as confounders in looking at the association between thimerosal exposure and neurodevelopmental disorders. For example, in Heron and Golding’s analysis of the British ALSPAC data the investigators used nine confounders in their multivarate analysis (birth weight, gestational age at birth, maternal education, child’s gender, parity, housing tenure, midpregnancy maternal smoking , child’s ethnicity, and breastfeeding for 3 months or more.) In Thompson et al’s follow-up study of children from the VSD, there is a huge list of confounders In Table F of their Supplementary Appendix.

Unfortunately, Young et al. did not have access to this kind of detailed data from the VSD — nothing even close to it. Given the limited amount of data available to them, all the more reason to carefully consider confounding in the data that they did have. They did have data on year of birth from both the exposure file and the outcome file. Young et al. analyzed the VSD using ecological regression analysis with birth cohorts as units of analysis, but the only variables in the regression were autism rate and “average Hg dose per person.” A simple approach would have been to add year of birth (1990, 1991, 1992…, 1996) in to the regression analysis as a control variable. This would not have completely controlled for confounding, but it would have been a start.

I can’t be absolutely sure, since they don’t describe any details or give any statistical references, but I think Young et al’s ecological regression equation is really quite simple:
Log(autism prevalence rate) = A + B(average Hg exposure in 100 microgram intervals), where A is an intercept term and the units of analysis are birth cohorts. Without going into the mathematical derivations, take my word for it that the Risk Ratio = exp(B). If we look at the first risk ratio in Table 3, which is 2.87, Young et al. would interpret it thus: In the birth to 7 month period, the rate of autism was approximately 2.9 times higher given a 100 microgram increase in Hg exposure in thimerosal-containing vaccines. This 2.87 was derived from a poisson regression in which the slope was 0.46 (i.e., the log of 2.87). As a slight modification of the statement in my previous post, picture a scatter plot of 7 points were the X axis is average Hg dose, the Y axis is log(prevalence rate), and the 7 points are where the mean Hg dose for each birth cohort intersects the log(prevalence rate) for that birth cohort. The slopes of the lines for autism are 0.46 (for exposure from birth to 7 months) and 0.42 for (exposure from birth to 13 months).

Pretty steep slopes and, therefore, apparently strong associations. But there’s no attempt to control for, or adjust for, the confounding effect of birth cohort. Just one look at Figure 1 (or a basic knowledge about trends in autism) tells you the regression coefficients (slopes) are being driven by increases in autism risk over time. Given the increase in frequency of autism (and other neurodevelopmental disabilities) during time time period, you could do an ecological regression analysis of almost any factor that varied over time and you would find an an association with autism. I would bet that you could enter number of sushi bars per capita into an ecological regression and you’d find an association with autism rates.

And please note, as I alluded to briefly in my last post, that one of the major problems of ecological analyses like this one is that, if confounding variables are left uncontrolled, risk ratio estimates tend to be hugely and spuriously magnified. In other words, the 2.87 is wrong and much too big. This “ecological magnification” bias will be the topic of my next post.

Oh, and by the way, let’s not forget that Young, Geier, and Geier “cooked” the data on the number of cases in such a way that they would get stronger effects. I mention this again because all of hese flaws combine together to make one virtually unbelievable paper.

If you have any questions, please don’t hesitate to comment.

*Please understand that “disease” is a general term used here for the sake of methodological discussion only. It could be argued that autism and ASD are not “diseases.” Also, these definitions (of confounding, etc.) work for research on outcomes that are clearly not diseases. For example, I have a colleague who just submitted a paper for publication on causes of happiness during pregnancy.

**There is one other “rule” about confounding that doesn’t apply to our discussion of the Young et al. paper, but it’s worth mentioning for completeness:
(3) Z cannot be a confounder if it’s association with the exposure is entirely due to the causal effects of the exposure on Z. Thus, for example, Z may not be a confounder if it is an intermediate variable in the causal pathway between exposure and outcome, i.e. exposure X affects Z, which affects the outcome Y. Please don’t worry too much about this criterion #3, because it’s not relevant to our discussion today.

Sphere: Related Content

24 Responses to “New Study on Thimerosal and Neurodevelopmental Disorders: II. What Happened to Control for Confounding?”  

  1. 1 Kev

    Thanks again for outlining this in such a clear, accessible way. It really does help.

  2. 2 Matt

    It is unfortunate that the VSD only dates back to 1991. It would be interesting to plot the trend of autism vs. time for earlier birth cohorts. In the CDDS data (friend to the armchair hack epidemiologist, but also free and public) it is clear that there is a significant increase in the administrative ‘prevalence’ of autism in the mid-t0-late 1990’sfor kids who were already over 10 years old!

    As I recall from one graph on gmwm, there was a 50% increase in the administrative obsevered in the 1995-20000 time frame for kids born in about 1982.

    Without something like increased awareness, these numbers don’t make any sense.

  3. 3 Joseph

    “I would argue, therefore, that in the VSD data its absolutely essential to consider date of birth as a confounder.”

    Even stratification by year of birth would’ve been better than not controlling at all, although there might not be enough cases each year to get good statistical significance.

    Since they did a Poisson regression, plugging in year of birth in the equation could not have been too difficult.

    Another thing they could’ve done, and I can’t say they didn’t do this, is to run the analysis on a different cohort to confirm. I’m sure VSD has data after 1996. They could’ve done, say, 1994 to 2000.

  4. 4 Matt


    I think they are working with a limited dataset of the VSD. I seem to recall that it only goes up to year 2,000. That’s why they had to impute the diagnoses.

    My guess is that this is some sort of end-run attempt. They don’t have access to the full VSD, so they got acccess to this older dataset that may not have been covered by the newer rules.

  5. 5 Joseph

    Oh, right, I seem to recall something of a soap opera involving the Geiers and the VSD. They renamed some files with the wrong extension or something. Let’s not forget that while this is probably the first time they have actually analyzed VSD data, it’s not the first time they have published a paper where they claimed they did.

  6. 6 kristina

    Thanks for explaining all this, esp. for this statement:

    “you could do an ecological regression analysis of almost any factor that varied over time and you would find an an association with autism”

    Though you might want to be careful about noting sushi bars—someone will says it’s the mercury in the tuna…..

  7. 7 Higgamus Hoggamus

    Thanks for elucidating exactly why this latest Geier piece is a piece of trash. I see it as results for hire. I figure that they figured it was a smart way to self-promote some bogus “treatments”. And it was until people like you come along.

  8. 8 B. Martin, MD

    Appreciate the ongoing explanations of study methodology. I’m providing a counterpart/complementary dissection (also in installments) of the Young-Geier study at, from the perspective of a trained neurologist (the journal’s intended audience).

  9. 9 Insider


    Have linked you to PharmaGossip and welcomed you.

    Hope your traffic increases.



  10. 10 crankynick

    Dude, you rock - this is the most helpful, useful analysis on a study I’ve seen in ages.

    And as a health and pharma journo who’s always trying to learn how to do a better job, this is enormously helpful - not just as a dissection of this study, but as a damn fine reminder of what I should be looking for every time I pick up a report on a study.

    I got here via pharmagossip, but I’m staying for life…

  11. 11 Sander Greenland

    The petitioners asked me what I thought of the Young et al. study, and also earlier studies by the Geiers. I told them I thought they were unreliable. Then at the trial, the government (respondent) lawyer asked what I thought of them, I replied the same, that I was willing to disregard them if everyone else agreed to. My testimony for the petitioners was restricted to a few narrow points. Chief among them: Based on the figures given by the respondent’s expert Dr. Eric Fombonne (who said clearly regressive autism accounts for no more than 6% of cases), the existing epidemiologic data could not rule out relative risks as high as 2 linking thimerosal at the US schedule dosing to clearly regressive autism. At the trial, when the government asked if I believed that thimerosal caused autism in general, I said that IF it had any effect (emphasizing the “if”), I would bet it was limited to a very small subgroup.

  12. 12 B. Martin, MD

    I’ve posted a second installment (, reviewing the M&M of Young et al. There are a few questions to you, EpiWonk, which may be relevant and useful–or may indicate my methodologic ignorance.

  13. 13 EpiWonk

    @B. Martin, MD: In your post you say, “There are several uses of the term ‘ecological’ in the article to describe this study and others. Perhaps EpiWonk can provide some insight into the word’s meaning in this context; the distinction’s lost on me.” This is the subject of my next post, in particular the weakness of an “ecologic” analysis of the VSD. In the meantime, readers may be interested in the following document,, in which an expert panel convened by NIH’s National Insititute of Environmental Health Sciences concluded that “…The consensus was that such a design [an ecologic analysis] would have limited value and be potentially misleading.”

  14. 14 DLC

    I’d like to see someone actually do a study on this topic without the fudge factoring and data cooking.

  15. 15 Jennyalice

    Thank you for your clear analysis. You can easily point out the many flaws of the ’study’,
    but do so without the vitriole so often found in many discussions (mostly from the anti-vaccine folk).
    A breath of fresh air for me.

  16. 16 Martin

    Oh o ho! very nice site!t

  17. 17 mypadmedia

    Go ahead and generate the idea,it is going to surely have you feeling better,lake don’t value other ancestors imagination We write decent material!and also keep in mind that..any time another person can come and let you know which he/she adored your current guide you may sense similar to “God”!and as well,who cares about you company have composed a manuscript by using precisely the same issue,you’re a completely different particular person and i am sure which usually you are about to generate will probably be particular!great beginners luck!

  18. 18 clothing

    This kind of info just as if I study created me get more perception , more or less annoyed when someone offline. I need over to verification on the part of your extensive compass similar to info to talk about along with them verification

  19. 19 fashion

    This kind of info just as if I study created me get more perception , more or less annoyed when someone offline. I need over to thank you for your broad range of info to share with each other thank you

  20. 20 testing

    This web site is really a stroll-by way of for all the data you needed about this and didn’t know who to ask. Glimpse here, and also you’ll definitely uncover it.

  21. 21 Vimax no Brasil

    Attention-grabbing weblog by an elegant templates, it may possibly be superior if almost all the current subject matter on your blogging site is manufactured unique with intriguing topics that will give you far more site visitors to your web page.

  22. 22 e-cigarette

    It’s truly very difficult in this active life to listen news on TV, thus I simply use web for that reason, and obtain the latest information.

  23. 23 how can I use anti inflammatory supplements in my daily diet

    Hello there! I know this is somewhat off topic but I was
    wondering which blog platform are you using for this site?
    I’m getting fed up of Wordpress because I’ve had issues with hackers and I’m looking at options for another platform.

    I would be great if you could point me in the direction of
    a good platform.

  1. 1 The Definitive Reference Guide to Debunking the Vaccine-Autism Myth | Angry Autie

Leave a Reply