Webbläsaren som du använder stöds inte av denna webbplats. Alla versioner av Internet Explorer stöds inte längre, av oss eller Microsoft (läs mer här: * https://www.microsoft.com/en-us/microsoft-365/windows/end-of-ie-support).

Var god och använd en modern webbläsare för att ta del av denna webbplats, som t.ex. nyaste versioner av Edge, Chrome, Firefox eller Safari osv.

DESIRE Toolkit Components - Matcher

Författare

  • Anders Ardö

Summary, in English

The Matcher tool implements a subject classification process using a subject-specific thesaurus by which terms are intellectually mapped to categories or subject classes. The classification process is made up of several steps. First, the document to be classified is fetched. Text is extracted from this document, and all thesaurus terms are matched to it. Some heuristic processing rules are applied to the results from the matching process. Finally, the outcome is formatted either for presentation or for storing in a database.

Publiceringsår

2000

Språk

Engelska

Dokumenttyp

Rapport

Förlag

DESIRE EU project

Ämne

  • Electrical Engineering, Electronic Engineering, Information Engineering

Nyckelord

  • focussed web crawling
  • Automated classification

Status

Published