HomePage |
Paper |
Demo |
Citation
This repository is the official implementation of FoodSky.
FoodSky: A Food-oriented Large Language Model that Passes the Chef and Dietetic Examination
Pengfei Zhou, Weiqing Min*, Chaoran Fu, Ying Jin, Mingyu Huang, Xiangyang Li, Shuhuan Mei, and Shuqiang Jiang*
* Correponding authors.
FoodSky is a fundamental LLM specifically designed for the world of food, marking a significant step forward in food computing. As the first Chinese LLM built just for food-related topics, FoodSky uses a vast collection of food data called FoodEarth, which includes everything from recipes to nutritional information, sourced from trusted books and websites. This model isn't just about handling huge amounts of data—it’s smart in understanding and discussing food like a professional. Whether it's passing chefs' exams or providing diet advice, FoodSky shows impressive skills, making it a reliable resource for anyone in the food industry, from chefs to dietitians. Its aim is to make working with food more creative and healthy, helping users across various scenarios with accurate and culturally aware advice. This tool is set to change how we interact with food information, blending detailed food knowledge with cutting-edge technology to better serve food enthusiasts and professionals alike.
Please refer to Quick Start Guide for detailed instructions on evaluation, training, and other resources.
- NLP Metrics: For BLEU, GLEU and ROUGE evaluation, check the
inference_nlp
directory. Example usage ininference_nlp/all_test.sh
. - CDE Testing: Code execution evaluation is available in
test/cde-eval/test.sh
. - GPT Evaluations: GPT model evaluation examples can be found in
test/gpt-eval
.
Refer to train/train_scripts/README.md
for detailed training instructions covering:
- Resource optimization (Full-tuning, LoRA, QLoRA)
- Practical tricks (FlashAttention-2, RoPE scaling, etc.)
- Experiment monitoring tools
- Release the test data and test code.
- Release the inference code.
- Release the training code and models.
- Release a version of FoodEarth dataset.
- Release a bilingual version of FoodSky and FoodEarth.
We provide the FoodSky models and FoodEarth dataset in Zenodo. To prevent server overload, our demo system requires authentication. You can try the experience account using username: test, password: 123456. If you are interested in further accessing our work, please contact us through the following channels to request credentials:
- 📧 Email: [email protected]
- 🌐 Homepage: http://123.57.42.89/FoodComputing__Home.html
If you feel FoodSky insteresting, feel free to use the following BibTeX entry to cite our paper. Thanks!
@misc{zhou2024foodskyfoodorientedlargelanguage,
title={FoodSky: A Food-oriented Large Language Model that Passes the Chef and Dietetic Examination},
author={Pengfei Zhou and Weiqing Min and Chaoran Fu and Ying Jin and Mingyu Huang and Xiangyang Li and Shuhuan Mei and Shuqiang Jiang},
year={2024},
eprint={2406.10261},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2406.10261},
}