AI takes on meal planning: LLMs now dissect dishes to boost health accuracy

The research leveraged GPT-4o to generate 15 detailed meal plans, ensuring a diverse range of breakfast, lunch, and dinner options featuring compound foods. These meals were then processed by GPT-4o, Llama-3 (70B), and Mixtral (8x7B) to test their ability to decompose and map ingredients against entries in the USDA FoodData Central repository. The approach enabled accurate nutrient aggregation, creating a detailed picture of each meal’s macronutrient and micronutrient composition. By manually validating outputs with nutritionists, the study ensured objective, real-world performance assessment.


CO-EDP, VisionRICO-EDP, VisionRI | Updated: 30-04-2025 17:28 IST | Created: 30-04-2025 17:28 IST
AI takes on meal planning: LLMs now dissect dishes to boost health accuracy
Representative Image. Credit: ChatGPT

In the evolving world of personalized health and nutrition, artificial intelligence is reshaping how meal planning is approached. A new research paper published in Nutrients titled "Improving Personalized Meal Planning with Large Language Models: Identifying and Decomposing Compound Ingredients" explores how large language models (LLMs) are bringing unprecedented precision to the decomposition of complex meal ingredients.

Conducted by a multidisciplinary team from institutions including the University of Maribor, the University of Florida, and Indiana University, the study offers a critical analysis of how GPT-4o, Llama-3 (70B), and Mixtral (8x7B) can transform dietary planning by breaking down compound food items into their basic components for detailed nutritional analysis.

How can decomposing ingredients improve precision nutrition and meal planning?

Understanding the exact composition of meals is vital for individuals managing allergies, intolerances, or specific dietary goals. Traditional meal planning often lists complex food items without breaking them down into their fundamental ingredients, making it difficult to identify problematic components or accurately calculate nutritional values. The study highlights that Large Language Models are now capable of identifying and decomposing compound ingredients like "Chicken Cacciatore" or "Vegetable Ratatouille" into basic ingredients, thus allowing for far more precise nutritional profiling.

The research leveraged GPT-4o to generate 15 detailed meal plans, ensuring a diverse range of breakfast, lunch, and dinner options featuring compound foods. These meals were then processed by GPT-4o, Llama-3 (70B), and Mixtral (8x7B) to test their ability to decompose and map ingredients against entries in the USDA FoodData Central repository. The approach enabled accurate nutrient aggregation, creating a detailed picture of each meal’s macronutrient and micronutrient composition. By manually validating outputs with nutritionists, the study ensured objective, real-world performance assessment.

The findings demonstrate that decomposing compound foods is not just an academic exercise; it has practical applications for enhancing diet personalization, institutional meal services in schools and hospitals, and even public health initiatives aimed at combating diet-related diseases. The models can help identify allergens, optimize macronutrient balance, and customize meals to individual health needs, elevating meal planning from estimation to science-driven precision.

How do different LLMs perform in decomposing compound ingredients?

The study compared three major LLMs to evaluate their effectiveness in ingredient decomposition. Results showed that Llama-3 (70B), an open-source model developed by Meta AI, performed the best, achieving an impressive F1-score of 0.894 and an accuracy of 0.893. GPT-4o followed with an F1-score of 0.842 and an accuracy of 0.835. In contrast, Mixtral (8x7B) lagged behind, with a significantly lower F1-score of 0.690 and an accuracy of 0.666.

Statistical analysis confirmed that Llama-3 (70B) and GPT-4o performed significantly better than Mixtral across evaluators, although the difference between Llama-3 (70B) and GPT-4o was not statistically significant. Llama-3 (70B) was especially consistent, successfully matching basic ingredient weights to compound ingredient expectations 87% of the time, compared to 73% for GPT-4o and only 55% for Mixtral.

The study also pointed out that all models occasionally failed to include minor but important elements like seasonings and oils. However, Llama-3 (70B) showed slightly better culinary awareness, including missing ingredients like olive oil in ratatouille recipes. Mixtral tended to overestimate ingredient weights more frequently, leading to inaccuracies in nutrient profiling.

Performance disparities were also evident in how models handled nutrient estimations. All models showed consistency in macronutrient estimates, but slight deviations were observed, especially in fat and carbohydrate calculations. Llama-3 (70B) once again outperformed, providing more accurate breakdowns while minimizing major overestimations in ingredient weights.

What are the challenges and future directions for AI-enhanced meal planning?

While the findings are promising, the study does not shy away from highlighting the existing limitations. Despite improvements, none of the tested models consistently included essential micro-ingredients like salt, pepper, and sugar with full precision, which could significantly impact the nutritional profile of a meal. Additionally, all three models occasionally struggled with accurately reflecting the weight loss that occurs during cooking processes, an important factor for precise energy density calculations.

The research also acknowledges that the limited dataset, just 15 meal plans, may restrict the generalizability of the results. Future studies will need to incorporate a broader range of cuisines and food cultures to better understand the models' adaptability across diverse eating patterns. The paper suggests that expanding into global cuisines could uncover hidden biases or gaps in ingredient recognition, which is crucial for developing universally applicable dietary tools.

Another challenge lies in model explainability. The "black box" nature of LLMs can make it difficult to understand how specific decomposition decisions are made. Increasing transparency and integrating explainable AI methods would strengthen trust among nutritionists, healthcare providers, and end-users.

In a nutshell, the study underscores the immense potential of integrating ingredient decomposition into dietary planning applications, health apps, and even institutional food services. With enhanced decomposition accuracy, AI-driven platforms could offer unparalleled personalization, guiding individuals towards healthier choices based on precise nutrient intake tailored to their medical, cultural, and personal needs.

  • FIRST PUBLISHED IN:
  • Devdiscourse
Give Feedback