About: Nutrition-equivalent food breakdown
Back to the tool!
What is this?
Most likely, you're sometimes a little unclear about what you're
putting into your body. These charts show the
nutritional equivalents of a large collection of foods.
- Don't eat meat for health reasons? Then you'd probably better avoid falafels and coffeecake, since these effectively have chunks of pork in them.
- Enjoy eating crackers? Then you should probably choose the low-sodium, low-fat kind, since otherwise you're eating fast food.
- How much fast food does your baby food have in it?
Data
USDA Foodgroups
This data is from the USDA's nutritional
information database.
This database contains 7412 foods. The food groups are:
- Legumes and Legume Products
- Beef Products
- Soups, Sauces, and Gravies
- Fats and Oils
- Baked Products
- Meals, Entrees, and Sidedishes
- Fast Foods
- Beverages
- Sweets
- Spices and Herbs
- Vegetables and Vegetable Products
- Nut and Seed Products
- Pork Products
- Finfish and Shellfish Products
- Dairy and Egg Products
- Breakfast Cereals
- Ethnic Foods
- Baby Foods
- Poultry Products
- Cereal Grains and Pasta
- Snacks
- Lamb, Veal, and Game Products
- Sausages and Luncheon Meats
- Fruits and Fruit Juices
Latent Foodgroups
What if we want to discover good labels for
foods, to best explain their nutritional information? Using the
tool described in the Methodology section, we have have discovered a
set of 24 (and 10) nutritional "themes" found in common foods
themes.
The 24 themes are:
- Unsaturated vegetable oil (nuts and mostly unsaturated vegetable oils) (monounsaturated fatty acids, polyunsaturated fatty acids, beta-sitosterol, arginine, niacin, saturated fat, thiamine, copper)
- Fortified cereal (e.g., multigrain cheerios)
- Starch nutrients (Folate, starch, cystine, phenylalanine, serine, glutamic acid, thiamin, tryptophan, proline, pantothenic acid)
- Good meat nutrients (thiamin, calcium, iron, riboflavin, niacin, vitamin b6, phosphorus)
- Mustard oil (mono) (vitamin D, monounsaturated fat, vitamin A RAE, cholesterol, tocopherol)
- Flaxseed oil (mono, poly) (polyunsaturated fat, phytosterols, monounsaturated fat, vitamin K, saturated fat)
- Tangerines (beta cryptoxanthin, beta carotene, lutein, vitamin A IU, vitamin A RAE, fiber, vitamin C)
- Carbohydrates and thickeners (gelatin, cornstarch) (Carbohydrates, fiber, NaCl, Energy, Ash, phosphorus, total fat, selenium)
- Salt (NaCl, NaHCO3) (NaCl, water, maltose, ash, fiber, protein, potassium, calcium, vitamin A, vitamin D, fats)
- Fiber and minerals (foods: potatoes, wheat bran, tea, spices) (fiber, manganese, copper, magnesium)
- Liver (vitamins A, B12, D, copper)
- Industrial Coconut/Palm oil (Saturated fats, total fat, phytosterols, vitamin K, monounsaturated fat, vitamin E)
- Decaffeinated softdrinks (with fluoride) (vitamin C, water, fluoride, vitamin K, vitamin A)
- Desalted egg whites (lysine, methionine, histidine, glycine, alanine, leucine)
- Animal byproduct nutrients (cholesterol, vitamin b12, zinc, selenium)
- "Alcoholic / caffeinated drinks" (ethyl alcohol, caffeine, theobromine, lycopene, water)
- Greens (e.g., turnip, kale, cress) (lutein, beta carotene, vitamin K, galactose, vitamin A, fiber, folate)
- Cheese (cholesterol, retinol, calcium, saturated fat)
- Beets (beta tocopherol, vitamin E, betaine, phytosterols, gamma tocopherol)
- Cupuassu oil (undifferentiated, monounsaturated and saturated fats, cholesterol)
- Trans fats (hydrogenated and soybean oils)
- Raw sugars (sugar, lactose, sucrose, fiber)
- Carrots (beta carotene, fructose, glucose, vitamin A, maltose, fiber)
- Fish oil (cholesterol, vitamin D, unsaturated fat, saturated fat, vitamin b12)
Interestingly, when using 10 themes, the only vegetable foodgroup to
appear was the carrots/squash foodgroup; other vegetables, such as
broccoli, were combinations of other categories; there was also no
dairy group and no fruits group; these appear to have been mixed up
by the additives in cereals and sweetened beverages. Instead, the
discovered food groups appear to be largely a result of manufactured
foods and the manufacturing decisions that go into these foods (such
as hydrogenated soybean oils vs. palm kernel and coconut oils).
- Lard
- Fortified cereal
- Lean meat
- Walrus liver
- Flavored water drinks
- Commodity shortening, (hydrogenated) soybean
- Carrots or Squash
- Industrial shortening, (hydrogenated) palm or coconut
- Gin
- Dried tea (and spices)
Methodology
We found these breakdowns using topic models. Topic models are a
way of finding collections of themes in text documents. Here we
treated foods as documents and nutritional information as word
counts. Generally, topic models can discover these themes
automatically. To keep a common vocabulary, we chose to define
these themes (formally, "topics") based on food group labels
provided by the USDA.
This is useful because it is not necessarily based on preconceived
notions of the major food groups; instead, it defines food groups,
or "nutritional themes", that best explain the variance in the
dataset. In statistical terms, this is a bit like applying
principal component analysis.
To convert nutrients to words, we first normalized all
nutrients to have mean 100 (unitless) and then rounded nutrient
weights to the nearest integer.
Contact
Sean Gerrish