You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
- text normalisation (including post-OCR/HTR correction)
34
+
- optical character recognition & handwriting recognition
35
+
- machine translation
36
+
- language modelling
37
+
- text analysis
38
+
- text retrieval, indexing, and querying (raw text, querying of annotations is covered by the [annotation group](https://github.com/CLARIAH/IG-Annotation))
39
+
- (list is not exhaustive)
40
+
41
+
Though our scope is not limited to Dutch, it is probably fair to say that Dutch, Flemish and Frisian, merit most
42
+
attention, as we are a project in the Netherlands.
23
43
24
44
Aspects that are outside the scope of this Interest Group (because they are covered by other IGs):
25
45
26
46
- manual text annotation (covered by the [annotation group](https://github.com/CLARIAH/IG-Annotation))
27
47
- annotation models and formats (covered by the [annotation group](https://github.com/CLARIAH/IG-Annotation))
48
+
- speech recognition (covered by the AV group)
28
49
29
50
## Communication
30
51
31
52
We use the following communication channel:
32
53
33
-
- slack (to be announced)
54
+
-[slack](clariah-workspace.slack.com) (if you don't have access yet, please contact one of the coordinators)
34
55
35
56
## Tasks
36
57
37
-
Text processing problems that we try to tackle are:
38
-
39
-
- automatic linguistic enrichment for multiple languages and multiple time periods
40
-
- named entity extraction
41
-
- dependency relations
42
-
- part-of-speech tagging
43
-
- lemmatisation
44
-
- sentiment analysis
45
-
- text normalisation (including post-OCR/HTR correction)
46
-
- interoperability
47
-
- choosing standards
48
-
- ... (todo)
58
+
1. Provide [an inventory](docs/inventory.md) of current text processing tools, services and models in CLARIAH,
59
+
either developed in CLARIAH (WP3 or WP6), or third party projects that are adopted as solutions.
60
+
2. Specify what requirements we want text processing solutions to adhere to for CLARIAH, to facilitate interoperability
61
+
between tools/services. Indicate to what extent the existing solutions adhere to these requirements.
0 commit comments