A machine learning-based approach for demarcating requirements in textual specifications

Sallam Abualhaija, Chetan Arora, Mehrdad Sabetzadeh, Lionel C. Briand, Eduardo Vaz

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

A simple but important task during the analysis of a textual requirements specification is to determine which statements in the specification represent requirements. In principle, by following suitable writing and markup conventions, one can provide an immediate and unequivocal demarcation of requirements at the time a specification is being developed. However, neither the presence nor a fully accurate enforcement of such conventions is guaranteed. The result is that, in many practical situations, analysts end up resorting to after-the-fact reviews for sifting requirements from other material in a requirements specification. This is both tedious and time-consuming. We propose an automated approach for demarcating requirements in free-form requirements specifications. The approach, which is based on machine learning, can be applied to a wide variety of specifications in different domains and with different writing styles. We train and evaluate our approach over an independently labeled dataset comprised of 30 industrial requirements specifications. Over this dataset, our approach yields an average precision of 81.2% and an average recall of 95.7%. Compared to simple baselines that demarcate requirements based on the presence of modal verbs and identifiers, our approach leads to an average gain of 16.4% in precision and 25.5% in recall.

Original languageEnglish
Title of host publicationProceedings - 2019 IEEE 27th International Requirements Engineering Conference, RE 2019
EditorsDaniela Damian, Anna Perini, Seok-Won Lee
PublisherIEEE Computer Society
Pages51-62
Number of pages12
ISBN (Electronic)9781728139128
DOIs
Publication statusPublished - Sep 2019
Externally publishedYes
Event27th IEEE International Requirements Engineering Conference, RE 2019 - Jeju Island, Korea, Republic of
Duration: 23 Sep 201927 Sep 2019

Publication series

NameProceedings of the IEEE International Conference on Requirements Engineering
Volume2019-September
ISSN (Print)1090-705X
ISSN (Electronic)2332-6441

Conference

Conference27th IEEE International Requirements Engineering Conference, RE 2019
Country/TerritoryKorea, Republic of
CityJeju Island
Period23/09/1927/09/19

Keywords

  • Machine Learning
  • Natural Language Processing
  • Requirements Identification and Classification
  • Textual Requirements

Fingerprint

Dive into the research topics of 'A machine learning-based approach for demarcating requirements in textual specifications'. Together they form a unique fingerprint.

Cite this