Hi all,
this issue stems from two facts:
So it is only natural to try to leverage them for automating docs translation.
Possible workflow:
- Keep the English version of the docs a ground truth.
- Use GitHub Actions to automate everything.
- When commit is detected in the English version, GA calls an LLM (e.g. Gemini Free Tier) to translate the commited file into other languages.
- GA commits directly or opens a PR with the changes for manual approval.
Translation does not need to be done from scratch. One could prompt the LLM to reuse the existing non-English translation as a baseline for the new translation. This could allow for automatic docs translation, but still curated by humans. For example, if a docs contributor spots an error in the LLM translation, they could fix it in the markdown of that language and the LLM would take that into account in the next translation.
Does that make sense?
Caveats:
- I'm not sure how cross-file linking would work. Cross linking to another file is not a problem because as far as I can tell all languages share the same file names. But linking to another file header might break from time to time because header names might change with translations. But in any case, that is an error that is easily detected during docs compilation and can easily be fixed manually .
- If GA fails and some file remains untranslated, I'm unsure what would be the best way to keep track of it so that it is also translated next time the GA runs.
Hi all,
this issue stems from two facts:
So it is only natural to try to leverage them for automating docs translation.
Possible workflow:
Translation does not need to be done from scratch. One could prompt the LLM to reuse the existing non-English translation as a baseline for the new translation. This could allow for automatic docs translation, but still curated by humans. For example, if a docs contributor spots an error in the LLM translation, they could fix it in the markdown of that language and the LLM would take that into account in the next translation.
Does that make sense?
Caveats: