Sentence Segmentation

We will need to be able to extract all sentences that use the word *Galaxy* from an input document.  This implies that we are able to split an input document on sentence boundaries.

NLTK will be sufficient for testing and development, but may not be sufficient (time or space) in large scale production. Consider using Stanford CoreNLP, Apache OpenNLP, or something else as a standalone service for common tasks like tokenization and sentence splitting.  The Lappsgrid can provide [standalone Dockerized services](https://github.com/lappsgrid-incubator/nlp-aas) for this that communicate via REST or AMQP. 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sentence Segmentation #4

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Sentence Segmentation #4

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions