98%
921
2 minutes
20
Food is the cornerstone of both survival and social life. With the increasing complexity of global dietary cultures, there is a growing demand for food intelligence to enable tasks like recipe recommendations and diet-disease correlation discovery. To address this, we introduce the food-oriented large language model (LLM) FoodSky, which offers fine-grained perception and reasoning on food data. We constructed a food corpus, FoodEarth, from various authoritative sources to enhance FoodSky's knowledge. We also developed the topic-based selective state space model and hierarchical topic retrieval augmented generation algorithms to improve FoodSky's ability to capture fine-grained food semantics and generate context-aware food-relevant text. Extensive experiments show that FoodSky significantly outperforms general-purpose LLMs on the Chinese National Chef Examination and Dietetic Examination, achieving an accuracy of 83.3% and 91.2%, respectively. Beyond enhancing culinary creativity and promoting healthier eating patterns, FoodSky aims to establish a new benchmark for domain-specific LLMs in addressing real-world food-related challenges.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12142648 | PMC |
http://dx.doi.org/10.1016/j.patter.2025.101234 | DOI Listing |
Patterns (N Y)
May 2025
Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China.
Food is the cornerstone of both survival and social life. With the increasing complexity of global dietary cultures, there is a growing demand for food intelligence to enable tasks like recipe recommendations and diet-disease correlation discovery. To address this, we introduce the food-oriented large language model (LLM) FoodSky, which offers fine-grained perception and reasoning on food data.
View Article and Find Full Text PDFFood recognition plays an important role in food choice and intake, which is essential to the health and well-being of humans. It is thus of importance to the computer vision community, and can further support many food-oriented vision and multimodal tasks, e.g.
View Article and Find Full Text PDF