Evaluating stages of development in second language French: A machine-learning approach
Författare
Redaktör
- Joakim Nivre
- Heiki-Jaan Kalep
- Kadri Muischnek
- Mare Koit
Summary, in English
This paper describes a system to define and
evaluate development stages in second language
French. The identification of such
stages can be formulated as determining the
frequency of some lexical and grammatical
features in the learners’ production and how
they vary over time. The problems in this
procedure are threefold: identify the relevant
features, decide on cutoff points for the
stages, and evaluate the degree of success of
the model.
The system addresses these three problems.
It consists of a morphosyntactic analyzer
called Direkt Profil and a machine-learning
module connected to it. We first describe the
usefulness and rationale behind its development.
We then present the corpus we used
to develop the analyzer. Finally, we present
new and substantially improved results on
training machine-learning classifiers compared
to previous experiments (Granfeldt et
al., 2006). We also introduce a method to
select attributes in order to identify the most
relevant grammatical features.
evaluate development stages in second language
French. The identification of such
stages can be formulated as determining the
frequency of some lexical and grammatical
features in the learners’ production and how
they vary over time. The problems in this
procedure are threefold: identify the relevant
features, decide on cutoff points for the
stages, and evaluate the degree of success of
the model.
The system addresses these three problems.
It consists of a morphosyntactic analyzer
called Direkt Profil and a machine-learning
module connected to it. We first describe the
usefulness and rationale behind its development.
We then present the corpus we used
to develop the analyzer. Finally, we present
new and substantially improved results on
training machine-learning classifiers compared
to previous experiments (Granfeldt et
al., 2006). We also introduce a method to
select attributes in order to identify the most
relevant grammatical features.
Publiceringsår
2007
Språk
Engelska
Publikation/Tidskrift/Serie
NODALIDA 2007 PROCEEDINGS
Fulltext
- Available as PDF - 103 kB
- Download statistics
Dokumenttyp
Konferensbidrag
Förlag
University of Tartu
Ämne
- Computer Science
- Languages and Literature
Conference name
NODALIDA 2007
Conference date
0001-01-02
Status
Published
Projekt
- Direct Profile: A program for analysing developmental sequences and developmental stages in written learner French
Forskningsgrupp
- Fransk språkvetenskap
ISBN/ISSN/Övrigt
- ISBN: 978-9985-4-0514-7