AI RESEARCH

Epicure: Navigating the Emergent Geometry of Food Ingredient Embeddings

arXiv CS.CL

ArXi:2605.22391v1 Announce Type: cross We present Epicure, a family of three sibling skip-gram ingredient embeddings retrained from scratch on a multilingual recipe corpus. We aggregate 4.14M recipes from 11 sources spanning seven languages, English, Chinese, Russian, Vietnamese, Spanish, Turkish, Indonesian, German, and Indian-English, and normalise the raw ingredient strings to 1,790 canonical entries via an LLM-augmented pipeline.