Publikationer
DESIRE Toolkit Components - Matcher
Avdelning/ar:
Publiceringsår: 2000
Språk: Engelska
Dokumenttyp: Rapport
Förlag: DESIRE EU project
Övrig information: Software tool for automated subject classification
Sammanfattning
The Matcher tool implements a subject classification process using a subject-specific thesaurus by which terms are intellectually mapped to categories or subject classes. The classification process is made up of several steps. First, the document to be classified is fetched. Text is extracted from this document, and all thesaurus terms are matched to it. Some heuristic processing rules are applied to the results from the matching process. Finally, the outcome is formatted either for presentation or for storing in a database.
Disputation
Nyckelord
- Technology and Engineering
- focussed web crawling
- Automated classification
Övrigt
Published

