Skip to content
/ tcsuite Public

Suite for gathering, processing and serving textual content

License

Notifications You must be signed in to change notification settings

qwwqe/tcsuite

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

42 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

tcsuite

Suite for gathering, processing and serving textual content

TODO

  • Method and interface comments
  • zh-TW tokenizer
  • Unify object instantiation and implementation interfaces for sub-components (lexicon/zhtwlexicon, fetcher/libertyfetcher, tokenizer/zhtwtokenizer, etc)
  • Add callback to Fetchers (for immediate tokenization)
  • Add tokenization tables to DB
  • Improve efficiency in conversion from original_content to tokenized_content

About

Suite for gathering, processing and serving textual content

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages