algorithm - Sentence Extraction -
let's have following:
- a list of sentences (with no punctuation)
- a long string of words (with no punctuation)
my goal segment string chunks of sentences, excluding words don't form sentence.
i going think of ad-hoc solution this, felt may well-known problem (i mean problem it's advisable implement efficient existing algorithm, shortest path problem, etc...)
so, there algorithms this?
edit: sample sentences:
hello how want hungry want eat thirsty want drink name morning night sample input string:
do how eat so, how you should output.
(the input sample may strange because it's speech recognizer output)
thanks.
there few freely available tools discussed here:
http://metaoptimize.com/qa/questions/7001/the-best-sentence-segmentation-tool-until-now
Comments
Post a Comment