Lexicons and grammars for language processing: industrial or handcrafted products?

ArXi:2606.03412v1 Announce Type: new During the recent years, the use of linguistic data for language processing increased progressively. Such data are now commonly called language resources. Most of the language resources used for this purpose are collections of texts as the Brown Corpus and the Penn Treebank, but electronic lexicons (WordNet, FrameNet, VerbNet, ComLex, Lexicon-Grammar. ) and formal grammars (TAG. ) developed recently. Most processes of construction of lexicons and grammars are manual, whereas the construction of corpora has always been highly automated.