Thursday, November 21, 2024

The megafat could be the healthiest


Typically obesity leads to health problems via insulin resistance (). Excess calories are stored as fat in fat cells up to a certain point. Beyond this point fat cells start rejecting fat. This is the point where fat cells become insulin resistant.

When they become insulin resistant, fat cells no longer respond to the insulin-mediated signal that they should store fat. Fat then increases in circulation and starts getting stored in tissues other than fat cells, including organ tissues (visceral fat). When the organ in question is the liver, this is called non-alcoholic fatty liver disease.

This progression happens with most people, but not with those who can progress to extremely high body fat levels (). Those people are the “megafat-prone” (MP). In the MP, fat cells take a long time to start rejecting fat. So the MP can keep on gaining body fat, often with no sign of diabetes at body fat levels that would have caused serious harm to most people.

One could say that the MP are extremely metabolically resilient. By not becoming insulin resistance as they gain more and more body fat, the MP are somewhat similar to sumo wrestlers (photo below from Nationalgeographic.com); although the main reason why sumo wrestlers do not develop insulin resistance is vigorous exercise. Visceral fat is very easy to "mobilize" through vigorous exercise; this being the basis for the "fat-but-fit" phenomenon (). There are two interesting, and also speculative, inferences that can be made based on all of this.



One is that the MP could potentially be the healthiest people among us. This is due to their extreme metabolic resilience, which should be fairly protective if they can avoid getting up to the unhealthy point of body fat for them. In fact, they could be overweight or even obese and fairly healthy, at least in terms of degenerative diseases. This is a genetic predisposition, which is likely to run in families.

The other inference is that the MP would probably not look “ripped” at relatively low weights. Since their body fat cells have above average insulin sensitivity at high body fat levels, one would expect that high insulin sensitivity to remain at low body fat levels. Insulin sensitivity is strongly associated with longevity ().

So, bringing all of this together, here are two apparent paradoxes. That person who already gained a lot of body fat and is an MP, showing no health problems at or near obesity, could be the healthiest among us. And that person who cannot look ripped at low body fat levels, no matter how hard he or she tries, may be one of the 2 percent or so of the population who will live beyond 90.

Unfortunately it is hard to tell whether someone is MP or not until the person actually becomes megafat. And if you are MP and actually become megafat, the afterlife will very likely arrive sooner rather than later.

Thursday, October 31, 2024

Want to make coffee less acidic? Add cream to it

The table below is from a 2008 article by Ehlen and colleagues (), showing the amount of erosion caused by various types of beverages, when teeth were exposed to them for 25 h in vitro. Erosion depth is measured in microns. The third row shows the chance probabilities (i.e., P values) associated with the differences in erosion of enamel and root.


As you can see, even diet drinks may cause tooth erosion. That is not to say that if you drink a diet soda occasionally you will destroy your teeth, but regular drinking may be a problem. I discussed this study in a previous post (). After that post was published here some folks asked me about coffee, so I decided to do some research.

Unfortunately coffee by itself can also cause some erosion, primarily because of its acidity. Generally speaking, you want a liquid substance that you are interested in drinking to have a pH as close to 7 as possible, as this pH is neutral (). Tap and mineral water have a pH that is very close to 7. Black coffee seems to have a pH of about 4.8.

Also problematic are drinks containing fermentable carbohydrates, such as sucrose, fructose, glucose, and lactose. These are fermented by acid-producing bacteria. Interestingly, when fermentable carbohydrates are consumed as part of foods that require chewing, such as fruits, acidity is either neutralized or significantly reduced by large amounts of saliva being secreted as a result of the chewing process.

So what to do about coffee?

One possible solution is to add heavy cream to it. A small amount, such as a teaspoon, appears to bring the pH in a cup of coffee to a little over 6. Another advantage of heavy cream is that it has no fermentable carbohydrates; it has no carbohydrates, period. You will have to get over the habit of drinking sweet beverages, including sweet coffee, if you were unfortunate enough to develop that habit (like so many people living in cities today).

It is not easy to find reliable pH values for various foods. I guess dentistry researchers are more interested in ways of repairing damage already done, and there doesn't seem to be much funding available for preventive dentistry research. Some pH testing results from a University of Cincinnati college biology page were available at the time of this writing; they appeared to be reasonably reliable the last time I checked them ().

Sunday, September 29, 2024

Body fat and disease: How much body fat can I lose in one day?

Body fat is not an inert deposit of energy. It can be seen as a distributed endocrine organ. Body fat cells, or adipocytes, secrete a number of different hormones into the bloodstream. Major hormones secreted by adipose tissue are adiponectin and leptin.

Estrogen is also secreted by body fat, which is one of the reasons why obesity is associated with infertility. (Yes, abnormally high levels of estrogen can reduce fertility in both men and women.) Moreover, body fat secretes tumor necrosis factor, a hormone that is associated with generalized inflammation and a number of diseases, including cancer, when in excess.

The reduction in circulating tumor necrosis factor and other pro-inflammatory hormones as one loses weight is one reason why non-obese people usually experience fewer illness symptoms than those who are obese in any given year, other things being equal. For example, the non-obese will have fewer illness episodes that require full rest during the flu season. In those who are obese, the inflammatory response accompanying an illness (which is necessary for recovery) will often be exaggerated.

The exaggerated inflammatory response to illness often seen in the obese is one indication that obesity in an unnatural state for humans. It is reasonable to assume that it was non-adaptive for our Paleolithic ancestors to be unable to perform daily activities because of an illness. The adaptive response would be physical discomfort, but not to the extent that one would require full rest for a few days to fully recover.

Inflammation markers such as C-reactive protein are positively correlated with body fat. As body fat increases, so does inflammation throughout the body. Lipid metabolism is negatively affected by excessive body fat, and so is glucose metabolism. Obesity is associated with leptin and insulin resistance, which are precursors of diabetes type 2.

Some body fat is necessary for survival; that is normally called essential body fat. The table below (from Wikipedia) shows various levels of body fat, including essential levels. Also shown are body fat levels found in athletes, as well as fit, “not so fit” (indicated as "Acceptable"), and obese individuals. Women normally have higher healthy levels of body fat than men.


If one is obese, losing body fat becomes a very high priority for health reasons.

There are many ways in which body fat can be measured.

When one loses body fat through fasting, the number of adipocytes is not actually reduced. It is the amount of fat stored in adipocytes that is reduced.

How much body fat can a person lose in one day?

Let us consider a man, John, whose weight is 170 lbs (77 kg), and whose body fat percentage is 30 percent. John carries around 51 lbs (23 kg) of body fat. Standing up is, for John, a form of resistance exercise. So is climbing stairs.

During a 24-hour fast, John’s basal metabolic rate is estimated at about 2,550 kcal/day. This is the number of calories John would spend doing nothing the whole day. It can vary a lot for different individuals; here it is calculated as 15 times John’s weight in lbs.

The 2,550 kcal/day is likely an overestimation for John, because the body adjusts its metabolic rate downwards during a fast, leading to fewer calories being burned.

Typically women have lower basal metabolic rates than men of equal weight.

For the sake of discussion, we expect each gram of John’s body fat to contribute about 8 kcals of energy, assuming a rate of conversion of body fat to calories of about 90 percent.

Thus during a 24-hour fast John burns about 318 g of fat, or about 0.7 lbs. In reality, the actual amount may be lower (e.g., 0.35 lbs), because of the body's own down-regulation of its basal metabolic rate during a fast. This down-regulation varies widely across different individuals, and is generally small.

Many people think that this is not much for the effort. The reality is that body fat loss is a long term game, and cannot be achieved through fasting alone; this is a discussion for another post.

It is worth noting that intermittent fasting (e.g., one 24-hour fast per week) has many other health benefits, even if no overall calorie restriction occurs. That is, intermittent fasting is associated with health benefits even if one fasts every other day, and eats twice one's normal intake on the non-fasting days.

Some of the calories being burned during John's 24-hour fast will be from glucose, mostly from John’s glycogen reserves in the liver if he is at rest. Muscle glycogen stores, which store more glucose substrate (i.e., material for production of glucose) than liver glycogen, are mobilized primarily through anaerobic exercise.

Very few muscle-derived calories end up being used through the protein and glycogen breakdown pathways in a 24-hour fast. John’s liver glycogen reserves, plus the body’s own self-regulation, will largely spare muscle tissue.

The idea that one has to eat every few hours to avoid losing muscle tissue is complete nonsense. Muscle buildup and loss happen all the time through amino acid turnover.

Net muscle gain occurs when the balance is tipped in favor of buildup, to which resistance exercise and the right hormonal balance (including elevated levels of insulin) contribute.

One of the best ways to lose muscle tissue is lack of use. If John's arm were immobilized in a cast, he would lose muscle tissue in that arm even if he ate every 30 minutes.

Longer fasts (e.g., lasting multiple days, with only water being consumed) will invariably lead to some (possibly significant) muscle breakdown, as muscle is the main store of glucose-generating substrate in the human body.

In a 24-hour fast (a relatively short fast), the body will adjust its metabolism so that most of its energy needs are met by fat and related byproducts. This includes ketones, which are produced by the liver based on dietary and body fat.

How come some people can easily lose 2 or 3 pounds of weight in one day?

Well, it is not body fat that is being lost, or muscle. It is water, which may account for as much as 75 percent of one’s body weight.

References:

Elliott, W.H., & Elliott, D.C. (2009). Biochemistry and molecular biology. New York: NY: Oxford University Press.

Fleck, S.J., & Kraemer, W.J. (2004). Designing resistance training programs. Champaign, IL: Human Kinetics.

Large, V., Peroni, O., Letexier, D., Ray, H., & Beylot, M. (2004). Metabolism of lipids in human white adipocyte. Diabetes & Metabolism, 30(4), 294-309.

Thursday, August 29, 2024

Compensatory adaptation as a unifying concept: Understanding how we respond to diet and lifestyle changes

Trying to understand each body response to each diet and lifestyle change, individually, is certainly a losing battle. It is a bit like the various attempts to classify organisms that occurred prior to solid knowledge about common descent. Darwin’s theory of evolution is a theory of common descent that makes classification of organisms a much easier and logical task.

Compensatory adaptation (CA) is a broad theoretical framework that hopefully can help us better understand responses to diet and lifestyle changes. CA is a very broad idea, and it has applications at many levels. I have discussed CA in the context of human behavior in general (Kock, 2002), and human behavior toward communication technologies (Kock, 2001; 2005; 2007). Full references and links are at the end of this post.

CA is all about time-dependent adaptation in response to stimuli facing an organism. The stimuli may be in the form of obstacles. From a general human behavior perspective, CA seems to be at the source of many success stories. A few are discussed in the Kock (2002) book; the cases of Helen Keller and Stephen Hawking are among them.

People who have to face serious obstacles sometimes develop remarkable adaptations that make them rather unique individuals. Hawking developed remarkable mental visualization abilities, which seem to be related to some of his most important cosmological discoveries. Keller could recognize an approaching person based on floor vibrations, even though she was blind and deaf. Both achieved remarkable professional success, perhaps not as much in spite but because of their disabilities.

From a diet and lifestyle perspective, CA allows us to make one key prediction. The prediction is that compensatory body responses to diet and lifestyle changes will occur, and they will be aimed at maximizing reproductive success, but with a twist – it’s reproductive success in our evolutionary past! We are stuck with those adaptations, even though we live in modern environments that differ in many respects from the environments where our ancestors lived.

Note that what CA generally tries to maximize is reproductive success, not survival success. From an evolutionary perspective, if an organism generates 30 offspring in a lifetime of 2 years, that organism is more successful in terms of spreading its genes than another that generates 5 offspring in a lifetime of 200 years. This is true as long as the offspring survive to reproductive maturity, which is why extended survival is selected for in some species.

We live longer than chimpanzees in part because our ancestors were “good fathers and mothers”, taking care of their children, who were vulnerable. If our ancestors were not as caring or their children not as vulnerable, maybe this blog would have posts on how to control blood glucose levels to live beyond the ripe old age of 50!

The CA prediction related to responses aimed at maximizing reproductive success is a straightforward enough prediction. The difficult part is to understand how CA works in specific contexts (e.g., Paleolithic dieting, low carbohydrate dieting, calorie restriction), and what we can do to take advantage (or work around) CA mechanisms. For that we need a good understanding of evolution, some common sense, and also good empirical research.

One thing we can say with some degree of certainty is that CA leads to short-term and long-term responses, and that those are likely to be different from one another. The reason is that a particular diet and lifestyle change affected the reproductive success of our Paleolithic ancestors in different ways, depending on whether it was a short-term or long-term change. The same is true for CA responses at different stages of one’s life, such as adolescence and middle age; they are also different.

This is the main reason why many diets that work very well in the beginning (e.g., first months) frequently cease to work as well after a while (e.g., a year).

Also, CA leads to psychological responses, which is one of the key reasons why most diets fail. Without a change in mindset, more often than not one tends to return to old habits. Hunger is not only a physiological response; it is also a psychological response, and the psychological part can be a lot stronger than the physiological one.

It is because of CA that a one-month moderately severe calorie restriction period (e.g., 30% below basal metabolic rate) will lead to significant body fat loss, as the body produces hormonal responses to several stimuli (e.g., glycogen depletion) in a compensatory way, but still “assuming” that liberal amounts of food will soon be available. Do that for one year and the body will respond differently, “assuming” that food scarcity is no longer short-term and thus that it requires different, and possibly more drastic, responses.

Among other things, prolonged severe calorie restriction will lead to a significant decrease in metabolism, loss of libido, loss of morale, and physical as well as mental fatigue. It will make the body hold on to its fat reserves a lot more greedily, and induce a number of psychological responses to force us to devour anything in sight. In several people it will induce psychosis. The results of prolonged starvation experiments, such as the Biosphere 2 experiments, are very instructive in this respect.

It is because of CA that resistance exercise leads to muscle gain. Muscle gain is actually a body’s response to reasonable levels of anaerobic exercise. The exercise itself leads to muscle damage, and short-term muscle loss. The gain comes after the exercise, in the following hours and days (and with proper nutrition), as the body tries to repair the muscle damage. Here the body “assumes” that the level of exertion that caused it will continue in the near future.

If you increase the effort (by increasing resistance or repetitions, within a certain range) at each workout session, the body will be constantly adapting, up to a limit. If there is no increase, adaptation will stop; it will even regress if exercise ceases altogether. Do too much resistance training (e.g., multiple workout sessions everyday), and the body will react differently. Among other things, it will create deterrents in the form of pain (through inflammation), physical and mental fatigue, and even psychological aversion to resistance exercise.

CA processes have a powerful effect on one’s body, and even on one’s mind!

References:

Kock, N. (2001). Compensatory Adaptation to a Lean Medium: An Action Research Investigation of Electronic Communication in Process Improvement Groups. IEEE Transactions on Professional Communication, 44(4), 267-285.

Kock, N. (2002). Compensatory Adaptation: Understanding How Obstacles Can Lead to Success. Infinity Publishing, Haverford, PA. (Additional link.)

Kock, N. (2005). Compensatory adaptation to media obstacles: An experimental study of process redesign dyads. Information Resources Management Journal, 18(2), 41-67.

Kock, N. (2007). Media Naturalness and Compensatory Encoding: The Burden of Electronic Media Obstacles is on Senders. Decision Support Systems, 44(1), 175-187.

Friday, July 26, 2024

Large LDL and small HDL particles: The best combination

High-density lipoprotein (HDL) is one of the five main types of lipoproteins found in circulation, together with very low-density lipoprotein (VLDL), intermediate-density lipoprotein (IDL), low-density lipoprotein (LDL), and chylomicrons.

After a fatty meal, the blood is filled with chylomicrons, which carry triglycerides (TGAs). The TGAs are transferred to cells from chylomicrons via the activity of enzymes, in the form of free fatty acids (FFAs), which are used by those cells as sources of energy.

After delivering FFAs to the cells, the chylomicrons progressively lose their TGA content and “shrink”, eventually being absorbed and recycled by the liver. The liver exports part of the TGAs that it gets from chylomicrons back to cells for use as energy as well, now in the form of VLDL. As VLDL particles deliver TGAs to the cells they shrink in size, similarly to chylomicrons. As they shrink, VLDL particles first become IDL and then LDL particles.

The figure below (click on it to enlarge), from Elliott & Elliott (2009; reference at the end of this post), shows, on the same scale: (a) VLDL particles, (b) chylomicrons, (c) LDL particles, and (d) HDL particles. The dark bar at the bottom of each shot is 1000 A in length, or 100 nm (A = angstrom; nm = nanometer; 1 nm = 10 A).


As you can see from the figure, most of the LDL particles shown are about 1/4 of the length of the dark bar in diameter, often slightly more, or about 25-27 nm in size. They come in different sizes, with sizes in this range  being the most common. The smaller and denser they are, the more likely they are to contribute to the formation of atherosclerotic plaque in the presence of other factors, such as chronic inflammation. The larger they become, which usually happens in diets high in saturated fat, the less likely they are to form plaque.

Note that the HDL particles are rather small compared to the LDL particles. Shouldn’t they cause plaque then? Not really. Apparently they have to be small, compared to LDL particles, to do their job effectively.

HDL is a completely different animal from VLDL, IDL and LDL. HDL particles are produced by the liver as dense disk-like particles, known as nascent HDL particles. These nascent HDL particles progressively pick up cholesterol from cells, as well as performing a number of other functions, and “fatten up” with cholesterol in the process.

This process also involves HDL particles picking up cholesterol from plaque in the artery walls, which is one of the reasons why HDL cholesterol is informally called “good” cholesterol. In fact, neither HDL nor LDL are really cholesterol; HDL and LDL are particles that carry cholesterol, protein and fat.

As far as particle size is concerned, LDL and HDL are opposites. Large LDL particles are the least likely to cause plaque formation, because LDL particles have to be approximately 25 nm in diameter or smaller to penetrate the artery walls. With HDL the opposite seems to be true, as HDL particles need to be small (compared with LDL particles) to easily penetrate the artery walls in order to pick up cholesterol, leave the artery walls with their cargo, and have it returned back to the liver.

Another interesting aspect of this cycle is that the return to the liver of cholesterol picked up by HDL appears to be done largely via IDL and LDL particles (Elliott & Elliott, 2009), which get the cholesterol directly from HDL particles! Life is not that simple.

Reference:

William H. Elliott & Daphne C. Elliott (2009). Biochemistry and Molecular Biology. 4th Edition. New York: NY: Oxford University Press.

Thursday, June 27, 2024

Sensible sun exposure

Sun exposure leads to the production in the human body of a number of compounds that are believed to be health-promoting. One of these is known as “vitamin D” – an important hormone precursor ().

About 10,000 IU is considered to be a healthy level of vitamin D production per day. This is usually the maximum recommended daily supplementation dose, for those who have low vitamin D levels.

How much sun exposure, when the sun is at its peak (around noon), does it take to reach this level? Approximately 10 minutes.

We produce about 1,000 IU per minute of sun exposure, but seem to be limited to 10,000 IU per day. This assumes a level of skin exposure comparable to that of someone wearing a bathing suit.

Contrary to popular belief, this does not significantly decrease with aging. Among those aged 65 and older, pre-sunburn full-body exposure to sunlight leads to 87 percent of the peak vitamin D production seen in young subjects ().

Evolution seems to have led to a design that favors chronic (every day or so) but relatively brief sun exposure. Most of the sun rays are of the UVA type. However it is the UVB rays, which peak when the sun is high, that stimulate vitamin D production the most. The UVA rays in fact deplete vitamin D. Therefore, after 10 minutes of sun exposure per day when the sun is high, we would be mostly depleting vitamin D by sunbathing when the sun is low.

There is a lot of research that suggests that extended sun exposure also causes skin damage, even exposure below skin cancer levels. Also, anecdotally there are many reports of odd things happening with people who sunbathe for extended periods of time at the pool. Examples are moles appearing in odd places like the bottom of the feet, cases of actinic keratosis, and even temporary partial blindness.


Source: Lifecasting.org

There is something inherently unnatural about sunbathing at the pool, and exponentially more so in tan booths. Hunter-gatherers enjoy much sun exposure by generally avoiding the sun; particularly from the front, as this impairs the vision.

Pools often have reflective surfaces around them, so that people will not burn their feet. They cause glare, and over time likely contribute to the development of cataracts.

When you go to the pool, put your hands perpendicular to your face below you nose so that much of the light coming from those reflective surfaces does not hit your eyes directly. If you do this, you’ll probably notice that the main source of glare is what is coming from below, not from above.

In the African savannas, where our species emerged, this type of reflective surface has no commonly found analog. You don't have to go to the pool to find all kinds of sources of unnatural glare in urban environments.

Snow is comparable. Hunter-gatherers who live in areas permanently or semi-permanently covered with snow, such as the traditional Inuit, have a much higher incidence of cataracts than those who don’t.

So, what would be some of the characteristics of sensible sun exposure during the summer, particular at pools? Considering all that is said above, I’d argue that these should be in the list:

- Standing and moving while sunbathing, as opposed to sitting or lying down.

- Sunbathing for about 10 minutes, when the sun is high, staying mostly in the shade after 10 minutes or so of exposure.

- Wearing eye protection, such as polarized sunglasses.

- Avoiding the sun hitting you directly in the face, even with eye protection, as the facial skin is unlikely to have the same level of resistance to sun damage as other parts that have been more regularly exposed in our evolutionary past (e.g., shoulders).

- Covering those areas that get sunlight perpendicularly while sunbathing when the sun is high, such as the top part of the shoulders if standing in the sun.

Doing these things could potentially maximize the benefits of sun exposure, while at the same time minimizing its possible negative consequences.

Wednesday, May 29, 2024

The China Study again: A multivariate analysis suggesting that schistosomiasis rules!

In the comments section of Denise Minger’s post on July 16, 2010, which discusses some of the data from the China Study (as a follow up to a previous post on the same topic), Denise herself posted the data she used in her analysis. This data is from the China Study. So I decided to take a look at that data and do a couple of multivariate analyzes with it using WarpPLS (warppls.com).

First I built a model that explores relationships with the goal of testing the assumption that the consumption of animal protein causes colorectal cancer, via an intermediate effect on total cholesterol. I built the model with various hypothesized associations to explore several relationships simultaneously, including some commonsense ones. Including commonsense relationships is usually a good idea in exploratory multivariate analyses.

The model is shown on the graph below, with the results. (Click on it to enlarge. Use the "CRTL" and "+" keys to zoom in, and CRTL" and "-" to zoom out.) The arrows explore causative associations between variables. The variables are shown within ovals. The meaning of each variable is the following: aprotein = animal protein consumption; pprotein = plant protein consumption; cholest = total cholesterol; crcancer = colorectal cancer.


The path coefficients (indicated as beta coefficients) reflect the strength of the relationships; they are a bit like standard univariate (or Pearson) correlation coefficients, except that they take into consideration multivariate relationships (they control for competing effects on each variable). A negative beta means that the relationship is negative; i.e., an increase in a variable is associated with a decrease in the variable that it points to.

The P values indicate the statistical significance of the relationship; a P lower than 0.05 means a significant relationship (95 percent or higher likelihood that the relationship is real). The R-squared values reflect the percentage of explained variance for certain variables; the higher they are, the better the model fit with the data. Ignore the “(R)1i” below the variable names; it simply means that each of the variables is measured through a single indicator (or a single measure; that is, the variables are not latent variables).

I should note that the P values have been calculated using a nonparametric technique, a form of resampling called jackknifing, which does not require the assumption that the data is normally distributed to be met. This is good, because I checked the data, and it does not look like it is normally distributed. So what does the model above tell us? It tells us that:

- As animal protein consumption increases, colorectal cancer decreases, but not in a statistically significant way (beta=-0.13; P=0.11).

- As animal protein consumption increases, plant protein consumption decreases significantly (beta=-0.19; P<0.01). This is to be expected.

- As plant protein consumption increases, colorectal cancer increases significantly (beta=0.30; P=0.03). This is statistically significant because the P is lower than 0.05.

- As animal protein consumption increases, total cholesterol increases significantly (beta=0.20; P<0.01). No surprise here. And, by the way, the total cholesterol levels in this study are quite low; an overall increase in them would probably be healthy.

- As plant protein consumption increases, total cholesterol decreases significantly (beta=-0.23; P=0.02). No surprise here either, because plant protein consumption is negatively associated with animal protein consumption; and the latter tends to increase total cholesterol.

- As total cholesterol increases, colorectal cancer increases significantly (beta=0.45; P<0.01). Big surprise here!

Why the big surprise with the apparently strong relationship between total cholesterol and colorectal cancer? The reason is that it does not make sense, because animal protein consumption seems to increase total cholesterol (which we know it usually does), and yet animal protein consumption seems to decrease colorectal cancer.

When something like this happens in a multivariate analysis, it usually is due to the model not incorporating a variable that has important relationships with the other variables. In other words, the model is incomplete, hence the nonsensical results. As I said before in a previous post, relationships among variables that are implied by coefficients of association must also make sense.

Now, Denise pointed out that the missing variable here possibly is schistosomiasis infection. The dataset that she provided included that variable, even though there were some missing values (about 28 percent of the data for that variable was missing), so I added it to the model in a way that seems to make sense. The new model is shown on the graph below. In the model, schisto = schistosomiasis infection.


So what does this new, and more complete, model tell us? It tells us some of the things that the previous model told us, but a few new things, which make a lot more sense. Note that this model fits the data much better than the previous one, particularly regarding the overall effect on colorectal cancer, which is indicated by the high R-squared value for that variable (R-squared=0.73). Most notably, this new model tells us that:

- As schistosomiasis infection increases, colorectal cancer increases significantly (beta=0.83; P<0.01). This is a MUCH STRONGER relationship than the previous one between total cholesterol and colorectal cancer; even though some data on schistosomiasis infection for a few counties is missing (the relationship might have been even stronger with a complete dataset). And this strong relationship makes sense, because schistosomiasis infection is indeed associated with increased cancer rates. More information on schistosomiasis infections can be found here.

- Schistosomiasis infection has no significant relationship with these variables: animal protein consumption, plant protein consumption, or total cholesterol. This makes sense, as the infection is caused by a worm that is not normally present in plant or animal food, and the infection itself is not specifically associated with abnormalities that would lead one to expect major increases in total cholesterol.

- Animal protein consumption has no significant relationship with colorectal cancer. The beta here is very low, and negative (beta=-0.03).

- Plant protein consumption has no significant relationship with colorectal cancer. The beta for this association is positive and nontrivial (beta=0.15), but the P value is too high (P=0.20) for us to discard chance within the context of this dataset. A more targeted dataset, with data on specific plant foods (e.g., wheat-based foods), could yield different results – maybe more significant associations, maybe less significant.

Below is the plot showing the relationship between schistosomiasis infection and colorectal cancer. The values are standardized, which means that the zero on the horizontal axis is the mean of the schistosomiasis infection numbers in the dataset. The shape of the plot is the same as the one with the unstandardized data. As you can see, the data points are very close to a line, which suggests a very strong linear association.


So, in summary, this multivariate analysis vindicates pretty much everything that Denise said in her July 16, 2010 post. It even supports Denise’s warning about jumping to conclusions too early regarding the possible relationship between wheat consumption and colorectal cancer (previously highlighted by a univariate analysis). Not that those conclusions are wrong; they may well be correct.

This multivariate analysis also supports Dr. Campbell’s assertion about the quality of the China Study data. The data that I analyzed was already grouped by county, so the sample size (65 cases) was not so high as to cast doubt on P values. (Having said that, small samples create problems of their own, such as low statistical power and an increase in the likelihood of error-induced bias.) The results summarized in this post also make sense in light of past empirical research.

It is very good data; data that needs to be properly analyzed!