Skip to content

Add Korean.#50

Merged
mattico merged 2 commits intomattico:masterfrom
doosik71:korean
Nov 13, 2022
Merged

Add Korean.#50
mattico merged 2 commits intomattico:masterfrom
doosik71:korean

Conversation

@doosik71
Copy link
Copy Markdown
Contributor

Add Korean language module.
Stemmer needs implementation.
It's simple but better than nothing.
Korean stop words came from https://github.com/stopwords-iso/stopwords-ko/blob/master/stopwords-ko.txt

Thanks.

@mattico
Copy link
Copy Markdown
Owner

mattico commented Nov 10, 2022

I'd be happy to merge this once CI is passing!

@doosik71
Copy link
Copy Markdown
Contributor Author

doosik71 commented Nov 11, 2022

Previous commit failed because test files were missing.
So I add two test files(tests/data/ko.in.txt and tests/data/ko.out.txt) to pass --all-features test.
Korean sentences used in ko.in.txt are came from http://guny.kr/stuff/klorem/ which generates Korean Lorem Ipsum.
Thanks.

@mattico mattico merged commit a8f9a48 into mattico:master Nov 13, 2022
@mattico
Copy link
Copy Markdown
Owner

mattico commented Nov 13, 2022

Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants