Extracting facts from open source software

Research output: Contribution to conferencePaper

46 Citations (Scopus)


Open source software systems are becoming increasingly important these days. Many companies are investing in open source projects and lots of them are also using such software in their own work. But because open source software is often developed without proper management the quality and reliability of the code may be uncertain. The quality of the code needs to be measured and this can be done only with the help of proper tools. In this paper we will describe a framework called Columbus with which we calculate the object oriented metrics validated by Basili et al. for illustrating how fault-proneness detection from the open source web and e-mail suite called Mozilla can be done. We will also compare the metrics of several versions of Mozilla to see how the predicted fault-proneness of the software system changed during its development. The Columbus frame-work has been further developed recently with a compiler wrapping technology that now gives us the possibility of automatically analyzing and extracting information from software systems without modifying any of the source code or makefiles. We will also introduce our fact extraction process here to show what logic drives the various tools of the Columbus framework and what steps need to be taken to obtain the desired facts.

Original languageEnglish
Number of pages10
Publication statusPublished - Dec 1 2004
EventProceedings - 20th IEEE International Conference on Software Maintenance, ICSM 2004 - Chicago, IL, United States
Duration: Sep 11 2004Sep 14 2004


OtherProceedings - 20th IEEE International Conference on Software Maintenance, ICSM 2004
CountryUnited States
CityChicago, IL



  • C
  • C++
  • CAN
  • Columbus
  • Compiler wrapping
  • Fact extraction
  • Fault-proneness detection
  • Metrics
  • Mozilla
  • Open source
  • Reverse engineering
  • Schema

ASJC Scopus subject areas

  • Software

Cite this

Ferenc, R., Siket, I., & Gyimóthy, T. (2004). Extracting facts from open source software. 60-69. Paper presented at Proceedings - 20th IEEE International Conference on Software Maintenance, ICSM 2004, Chicago, IL, United States. https://doi.org/10.1109/ICSM.2004.1357790