algorithm - Sentence Extraction -


let's have following:

  • a list of sentences (with no punctuation)
  • a long string of words (with no punctuation)

my goal segment string chunks of sentences, excluding words don't form sentence.

i going think of ad-hoc solution this, felt may well-known problem (i mean problem it's advisable implement efficient existing algorithm, shortest path problem, etc...)

so, there algorithms this?

edit: sample sentences:

hello  how  want  hungry  want eat  thirsty  want drink  name  morning  night 

sample input string:

do how eat 

so, how you should output.

(the input sample may strange because it's speech recognizer output)

thanks.

there few freely available tools discussed here:

http://metaoptimize.com/qa/questions/7001/the-best-sentence-segmentation-tool-until-now


Comments

Popular posts from this blog

linux - xterm copying to CLIPBOARD using copy-selection causes automatic updating of CLIPBOARD upon mouse selection -

qt - Errors in generated MOC files for QT5 from cmake -