Do better IR tools improve the accuracy of engineers’ traceability recovery?
Författare
Summary, in English
model of IR evaluation for performance assessment. We conducted a pilot experiment using printed candidate lists from the tools RETRO and ReqSimile to investigate how different quality levels of tool output affect the tracing accuracy of engineers. Statistical testing of equivalence, commonly used in medicine, has been conducted to analyze the data. The low number of subjects in this pilot experiment resulted neither
in statistically significant equivalence nor difference. While our results are not conclusive, there are indications that it is worthwhile to investigate further into the actual value of improving tool support for semi-automatic traceability recovery. For example, our pilot experiment showed that the effect size of using RETRO versus ReqSimile is of practical
significance regarding precision and F-measure. The interpretation
of the effect size regarding recall is less clear. The experiment needs to be replicated with more subjects and on varying tasks to draw firm conclusions.
Avdelning/ar
Publiceringsår
2011
Språk
Engelska
Sidor
23-30
Publikation/Tidskrift/Serie
[Host publication title missing]
Fulltext
- Available as PDF - 608 kB
- Download statistics
Dokumenttyp
Konferensbidrag
Förlag
Association for Computing Machinery (ACM)
Ämne
- Computer Science
Nyckelord
- requirements traceability
- information retrieval
- controlled experiment
- equivalence testing
Conference name
MALETS 2011: International Workshop on Machine Learning Technologies in Software Engineering
Conference date
2011-11-12
Status
Published
Projekt
- Embedded Applications Software Engineering