Jump to content

Document classification

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by The Anome (talk | contribs) at 10:37, 27 December 2004 (new stub). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

(diff) ← Previous revision | Latest revision (diff) | Newer revision → (diff)

Document classification is a problem in computer science. The task is to assign a document to one or more categories, based on its contents alone.

Document classification techniques include:

and approaches based on natural language processing.