International Conference on Statistical Analysis of Textual Data
7-10 Jun 2016 Nice (France)
CoFiH: A heuristic for concept discovery in computer-assisted conceptual analysis
Louis Chartrand * , Jean-Guy Meunier  1@  , Davide Pulizzotto  1@  , José López González  1@  , Jean-François Chartier  1@  , Julian Trujillo Amaya  2@  , Tan Ngoc Le  1, *@  
1 : Laboratoire d'analyse cognitive de l'information  (LANCI)  -  Website
2 : Université de Valle
* : Corresponding author

While conceptual analysis can be facilitated by computer assistance, the absence of proper models for concepts in text has curtailed the development of such tools. The most common heuristic, which consists in identifying keywords as canonical expression of a concept, poses problems of ambiguity and fails to retrieve most of the relevant textual data. In this paper, we present CoFiH, an algorithm that exploits topics in order to retrieve segments relevant to a given concept. It is then applied to C.S. Peirce's Collected Papers to facilitate the analysis of Peirce's concept of LAW. Compared to the baseline, CoFiH produces better recall and enables a meaningful analysis along several topics.


Online user: 1