Summary: | Extract dates for KOrganizer automatically from emails | ||
---|---|---|---|
Product: | [Applications] kontact | Reporter: | Kurt Bennater <curdyben> |
Component: | general | Assignee: | kdepim bugs <kdepim-bugs> |
Status: | RESOLVED DUPLICATE | ||
Severity: | wishlist | CC: | kde-2011.08 |
Priority: | NOR | ||
Version: | unspecified | ||
Target Milestone: | --- | ||
Platform: | openSUSE | ||
OS: | Linux | ||
Latest Commit: | Version Fixed In: |
Description
Kurt Bennater
2005-04-16 17:36:54 UTC
Some people from Stanford have actually been looking into an even more ambitious project, the fully automated extraction of events from emails, see the first two links at http://nlp.stanford.edu/courses/cs224n/2004/ . At least the extraction of dates seems to have worked well (as explained in http://nlp.stanford.edu/courses/cs224n/2004/jblack-final-report.pdf ). A rudimentary perl tool for date extraction and a parser for normalizing the output can be found at http://www.shlrc.mq.edu.au/masters/students/rdennys/PerlAsst3.html , other perl tools include http://search.cpan.org/~mcmahon/Date-PeriodParser-0.03/PeriodParser.pm (understands even expressions like "tomorrow") and for multi-language support: http://search.cpan.org/~gbarr/TimeDate-1.16/lib/Date/Parse.pm The KDE desktop search engine Kat includes code which can identify the language a document is written in. Perhaps the same works for emails? So I guess, at least some important parts for this feature are already there... |