ALPaGE

Marie Candito - Research/Publications

Université Paris Diderot - INRIA

Universite Paris 7

I work at the Alpage laboratory, in the area of natural language processing.
More precisely my current topics are syntactic parsing, semantic-syntactic interface, semantic and syntactic resources.

Resources : Here are some resources for statistical French dependency parsing. Including the Sequoia Treebank.

PhD students :

2016 Michalon O., Ribeyre C., Candito M. and Nasr A. 2016,
Deeper syntax for better semantic parsing, Proceedings of the 26th International Conference on Computational Linguistics (Coling), Osaka, Japan, 2016.
pdf
Djemaa, M., Candito, M., Muller P. and Vieu L. 2016,
Corpus annotation within the French Framenet: methodology and results, Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC), Portorož, Slovenia, 2016.
pdf
Vieu L., Muller P. Djemaa, M., Candito, M., Muller P. and Vieu L. 2016,
A General Framework for the Annotation of Causality Based on FrameNet, Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC), Portorož, Slovenia, 2016.
pdf
Seddah D. and Candito M. 2016,
Hard Time Parsing Questions: Building a QuestionBank for French, Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC), Portorož, Slovenia, 2016.
pdf
2014 Candito M. and Constant M., 2014,
Strategies for Multiword Expression Analysis and Dependency Parsing, Proceedings of the 52th Annual Meeting of the Association for Computational Linguistics (ACL'14), Baltimore, USA.
pdf
Ribeyre C., Candito M. and Seddah D., 2014,
Semi-Automatic Deep Syntactic Annotations of the French Treebank. Proceedings of the 13th International Workshop on Treebanks and Linguistic Theories (TLT13), Tübingen, Germany.
pdf
Marie Candito, Guy Perrier, Bruno Guillaume, Corentin Ribeyre, Karën Fort, Djamé Seddah and Eric de la Clergerie, 2014,
Deep Syntax Annotation of the Sequoia French Treebank, Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC), Reykjavik, Iceland, 2014.
pdf
Online Annotation guide
Download from corpus site
Candito, M. Amsili, P., Barque, L., Benamara, F., Chalendar, G., Djemaa, M., Haas, P., Huyghe, R., Mathieu, Y., Muller, P., Sagot, B. & Vieu, L., 2014,
Developing a French FrameNet: methodology and first results, Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC), Reykjavik, Iceland, 2014.
pdf
Seddah D., Candito M. and Henestroza Anguiano E., 2014,
A word clustering approach to domain adaptation: Robust parsing of source and target domains, In Journal of Logic and Computation (2013) 24(2): 395-411.

2013 Constant M., Candito M. and Seddah D., 2013,
The LIGM-Alpage architecture for the SPMRL 2013 Shared Task: Multiword Expression Analysis and Dependency Parsing, Proceedings of the Fourth SPMRL Workshop, Seattle, USA.
pdf
Djamé Seddah; Reut Tsarfaty; Sandra Kübler; Marie Candito; Jinho D. Choi; Richárd Farkas; Jennifer Foster; Iakes Goenaga; Koldo Gojenola Galletebeitia; Yoav Goldberg; Spence Green; Nizar Habash; Marco Kuhlmann; Wolfgang Maier; Yuval Marton; Joakim Nivre; Adam Przepiórkowski; Ryan Roth; Wolfgang Seeker; Yannick Versley; Veronika Vincze; Marcin Woliński; Alina Wróblewska, 2013,
Overview of the SPMRL 2013 Shared Task: A Cross-Framework Evaluation of Parsing Morphologically Rich Languages, Proceedings of the Fourth SPMRL Workshop, Seattle, USA.
pdf
2012 Candito M. and Seddah D., 2012,
Effectively long-distance dependencies in French : annotation and parsing evaluation, Proceedings of TLT'11, Lisbon, Portugal.
pdf
Seddah D., Sagot B. and Candito M., 2012,
The Alpage Architecture at the SANCL 2012 Shared Task: Robust Pre-Processing and Lexical Bridging for User-Generated Content Parsing., in in Notes of the first workshop of Syntactic Analysis of Non Canonical Languages (SANCL'2012), colocated with NAACL'2012, Montreal, Canada.

Seddah D., Candito M., Crabbé B. and Henestroza Anguiano E., 2012,
Ubiquitous Usage of a Broad Coverage French Corpus: Processing the Est Republicain corpus, Proceedings of LREC 2012, Istanbul, Turkey.
pdf
Candito M.-H. and Djamé Seddah, 2012,
Le corpus Sequoia : annotation syntaxique et exploitation pour l’adaptation d’analyseur par pont lexical, Proceedings of TALN'2012, Grenoble, France
Download Sequoia Treebank pdf
2011 Candito M.-H., Henestroza Anguiano E. and Seddah D., 2011,
A Word Clustering Approach to Domain Adaptation: Effective Parsing of Biomedical Texts, Proceedings of the 12th International Conference on Parsing Technologies (IWPT'2011) - short paper, Dublin, Ireland
pdf
Henestroza Anguiano E. and Candito M.-H., 2011,
Resolving Difficult Syntactic Attachments with Parse Correction, Proceedings of EMNLP'2011 (poster session), Edimburg, Scottland
pdf
2010 Candito M.-H., Nivre J., Denis P. and Henestroza Anguiano E., 2010,
Benchmarking of Statistical Dependency Parsers for French, Proceedings of COLING'2010 (poster session), Beijing, China
pdf
Candito M.-H. and Seddah D., 2010,
Parsing word clusters, Proceedings of the NAACL/HLT First Workshop on Statistical Parsing of Morphologically Rich Languages (SPMRL 2010), Los Angeles, USA
pdf
Tsarfaty R., Seddah D., Goldberg Y., Kuebler S., Versley Y., Candito M., Foster J., Rehbein I. and Tounsi L., 2010,
Statistical Parsing of Morphologically Rich Languages (SPMRL) What, How and Whither, Proceedings of the NAACL/HLT First Workshop on Statistical Parsing of Morphologically Rich Languages (SPMRL 2010), Los Angeles, USA
pdf
Seddah D. and ChrupaŁa G. and Cetinoglu O. and van Genabith J. and Candito M.-H., 2010,
Lemmatization and Statistical Lexicalized Parsing of Morphologically-Rich Languages. Proceedings of the NAACL/HLT Workshop on Statistical Parsing of Morphologically Rich Languages (SPMRL 2010), Los Angeles, USA
pdf
Candito M.-H., Crabbé B., and Denis P., 2010,
Statistical French dependency parsing: treebank conversion and first results, Proceedings of LREC'2010, La Valletta, Malta
pdf
2009 Seddah, D., Candito M.-H. and Crabbé B., 2009,
Crossparser evaluation and tagset variation: a French treebank study. Proceedings of IWPT'09, Paris, France
pdf
Candito M.-H. and Crabbé B., 2009,
Improving generative statistical parsing with semi-supervised word clustering. Proceedings of IWPT'09 - short paper, Paris, France
pdf
Candito M.-H., Crabbé B., Denis P. and Guérin F., 2009,
Analyse syntaxique du français : des constituants aux dépendances. Proceedings of TALN 2009, Senlis, France
pdf
Candito M.-H., Crabbé B. and Seddah D., 2009,
On statistical parsing of French with supervised and semi-supervised strategies. Proceedings of the EACL 2009 workshop : Grammatical Inference for computational linguistics, Athens, Greece
pdf
2008 Crabbé B. et Candito M.-H., 2008,
Expériences d'analyses syntaxique statistique du français. Proceedings of TALN 2008, Avignon, France
pdf
some time ago... Candito, M.-H., 1999,
Organisation modulaire et paramétrable de grammaires électroniques lexicalisées. Application au français et à l'italien. Thèse de doctorat de l'université Paris 7.
pdf
Candito M.-H. and Kahane S. 1998,
Can the derivation tree represent a semantic graph? An answer in the light of Meaning-Text Theory. Proceedings of TAG+4. Philadelphia, USA
pdf