This site contains the technical documentation of the GiellaLT infrastructure, developed and used by Divvun, Giellatekno and others. It is an open source website providing analysers and tools for a wide range of languages, as well as a ready-made setup for adding more languages.
Computer tools supported by our infrastructure include:
- linguistic analysers (morphology, syntax)
- spell checkers and grammar checkers
- morphologically enabled e-dictionaries
- machine translation
- speech synthesis
We write our morphologies as finite state transducers in the formalisms lexc, twolc and xfst rewrite rules, and compile them into computer programs for language analysis with the compilers hfst or foma. Our syntaxes we write in constraint grammar, and we compile our constraint grammars with vislcg3. The installation of these compilers is documented on the Getting Started page.
All our resources, infrastructure and linguistic content alike, are available under various open source licenses, mostly GPL or MIT.
The linguistic source code is found in the present GitHub organisation (giellalt). In addition to that, we maintain the following GitHub organisations, mostly with more technical content: borealium, divvun, divvungiellatekno, giellatekno. Another relevant GitHub organisation is apertium.
You may also cooperate with us, e.g. use our servers for your language, embark on a shared project, etc. In order to do that, please contact us, or write to us directly on our Zulip instance.