Skip to content

argosopentech/sbd

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SBD

Sentence boundary detection for machine translation

Benchmarks

Model Average Accuracy Average Runtime (seconds)
Spacy en_core_web_sm 0.924311498164287 0.0250468651453654
Spacy xx_sent_ud_sm 0.924311498164287 0.00476229190826416
Argos Translate 2 Beta 0.515548280365557 1.87798078854879
Stanza en 0.924311498164287 0.0219400326410929

Libraries tested

Data

The data is from Wikipedia and I (native English speaker) manually split the sentences. Currently only English data is used but this script can easily be extended for more languages.

https://en.wikipedia.org/

About

Sentence boundary detection for machine translation

Resources

License

Stars

Watchers

Forks

Contributors 2

  •  
  •