Éric Charton scite author profile

Éric Charton

5Publications

29Citation Statements Received

42Citation Statements Given

How they've been cited

How they cite others

Affiliations

National Bank of Canada, Polytechnique Montréal, Laboratoire Informatique d'Avignon

Publications

Order By: Most citations

Unsupervised knowledge acquisition for Extracting Named Entities from speech

Béchet

Charton

2010

View full text Add to dashboard Cite

International audienceThis paper presents a Named Entity Recognition (NER) method dedicated to process speech transcriptions. The main principle behind this method is to collect in an unsupervised way lexical knowledge for all entries in the ASR lexicon. This knowledge is gathered with two methods: by automatically extracting NEs on a very large set of textual corpora and by exploiting directly the structure contained in the Wikipedia resource. This lexical knowledge is used to update the statistical models of our NER module based on a mixed approach with generative models (Hidden Markov Models-HMM) and discriminative models (Conditional Random Field-CRF). This approach has been evaluated within the French ESTER 2 evaluation program and obtained the best results at the NER task on ASR transcripts

show abstract

Automatic Semantic Web Annotation of Named Entities

Charton

Gagnon

Ozell

2011

View full text Add to dashboard Cite

Using Collaborative Tagging for Text Classification: From Text Classification to Opinion Mining

et al. 2013

View full text Add to dashboard Cite

Numerous initiatives have allowed users to share knowledge or opinions using collaborative platforms. In most cases, the users provide a textual description of their knowledge, following very limited or no constraints. Here, we tackle the classification of documents written in such an environment. As a use case, our study is made in the context of text mining evaluation campaign material, related to the classification of cooking recipes tagged by users from a collaborative website. This context makes some of the corpus specificities difficult to model for machine-learning-based systems and keyword or lexical-based systems. In particular, different authors might have different opinions on how to classify a given document. The systems presented hereafter were submitted to the DÉfi Fouille de Textes 2013 evaluation campaign, where they obtained the best overall results, ranking first on task 1 and second on task 2. In this paper, we explain our approach for building relevant and effective systems dealing with such a corpus.

show abstract

Improving Access to Justice with Legal Chatbots

Queudot

Charton

Meurs

2020

Stats

View full text Add to dashboard Cite

On average, one in three Canadians will be affected by a legal problem over a three-year period. Unfortunately, whether it is legal representation or legal advice, the very high cost of these services excludes disadvantaged and most vulnerable people, forcing them to represent themselves. For these people, accessing legal information is therefore critical. In this work, we attempt to tackle this problem by embedding legal data in a conversational interface. We introduce two dialog systems (chatbots) created to provide legal information. The first one, based on data from the Government of Canada, deals with immigration issues, while the second one informs bank employees about legal issues related to their job tasks. Both chatbots rely on various representations and classification algorithms, from mature techniques to novel advances in the field. The chatbot dedicated to immigration issues is shared with the research community as an open resource project.

show abstract

Quick Starting Dialog Systems with Paraphrase Generation

Marceau¹,

Belbahar²,

Queudot³

et al. 2022

Preprint

View full text Add to dashboard Cite

Acquiring training data to improve the robustness of dialog systems can be a painstakingly long process. In this work, we propose a method to reduce the cost and effort of creating new conversational agents by artificially generating more data from existing examples, using paraphrase generation. Our proposed approach can kick-start a dialog system with little human effort, and brings its performance to a level satisfactory enough for allowing actual interactions with real end-users. We experimented with two neural paraphrasing approaches, namely Neural Machine Translation and a Transformerbased seq2seq model. We present the results obtained with two datasets in English and in French: a crowd-sourced public intent classification dataset and our own corporate dialog system dataset. We show that our proposed approach increased the generalization capabilities of the intent classification model on both datasets, reducing the effort required to initialize a new dialog system and helping to deploy this technology at scale within an organization.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Éric Charton

Unsupervised knowledge acquisition for Extracting Named Entities from speech

Automatic Semantic Web Annotation of Named Entities

Using Collaborative Tagging for Text Classification: From Text Classification to Opinion Mining

Improving Access to Justice with Legal Chatbots

Quick Starting Dialog Systems with Paraphrase Generation

Contact Info

Product

Resources

About