publication . Article . 2003

A corpus-based approach to generalising a chatbot system

Abu Shawar, Bayan; Atwell, Eric;
Open Access English
  • Published: 01 Sep 2003
  • Publisher: Sociedad EspaƱola para el Procesamiento del Lenguaje Natural
  • Country: Spain
International research in NLP is dominated by work on English. NLP techniques and systems can be ported to other natural languages, but this is generally a labour-intensive task, requiring scarce computational and linguistic expertise; hence minority languages are poorly represented in NLP technology. We present an automated approach to porting an NLP technology, the AIML-based chatbot, to new languages, by using a corpus in the target language to retrain the chatbot. We have successfully automated production of chatbots talking French, and Afrikaans; and are developing further demonstrators in Spanish and Arabic.
ACM Computing Classification System: InformationSystems_INFORMATIONSTORAGEANDRETRIEVALComputingMethodologies_ARTIFICIALINTELLIGENCEComputingMethodologies_PATTERNRECOGNITIONComputingMethodologies_DOCUMENTANDTEXTPROCESSING
free text keywords: Chatbot, Dialogue, Corpus, Machine learning, English language, French language, Afrikaans language, Arabic language
Related Organizations
Powered by OpenAIRE Open Research Graph
Any information missing or wrong?Report an Issue