First workshop on Automated Knowledge Base Construction
May 17-19, 2010
CALL FOR PAPERS
4-8 page submissions due March 5, 2010 for non-Pascal members
4-8 page submissions due March 15, 2010 for Pascal members
Good decision-making is dependent on comprehensive, accurate knowledge. But the information relevant to many important decisions in areas such as business, government, medicine and scientific research is massive, and growing at an accelerating pace. Relevant raw data is widely available on the web and other data sources, but usually in order to be useful it must be gathered, organized, and normalized into a knowledge base.
Hand-built knowledge bases such as Wikipedia have made us all better decision-makers. However more than human editing will be necessary to create a wide variety of domain-specific, deeply comprehensive, more highly structured knowledge bases.
A variety of automated methods have begun to reach levels of accuracy and scalability that make them applicable to automatically constructing useful knowledge bases. These capabilities have been enabled by research in areas including information extraction, information integration, databases, search and machine learning.
There are substantial scientific and engineering challenges in advancing and integrating such relevant methodologies.
This workshop will gather researchers in a variety of fields that contribute to the automated construction of knowledge bases.
There has recently been is a tremendous amount of new work in this area, some of it in traditionally disconnected communities. In this workshop the organizers aim to bring these communities together.
Topics of interest include:
* information extraction; open information extraction, named entity extraction; entity resolution, relation extraction.
* information integration; schema alignment; ontology alignment; ontology construction.
* alignment between knowledge bases and text.
* joint inference between text interpretation and knowledge base pattern analysis, reading the web, learning by reading.
* databases; distributed information systems; probabilistic databases.
* scalable computation; distributed computation.
* information retrieval; search on mixtures of structured and unstructured data; querying under uncertainty.
* machine learning; unsupervised, lightly-supervised and distantly-supervised learning; learning from naturally-available data.
* human-computer collaboration in knowledge base construction; automated population of wikis.
* inference; scalable approximate inference.
* languages, toolkits and systems for automated knowledge base construction.
* demonstrations of existing automatically-built knowledge bases.
Speakers / Participants include:
Philip Bohannon, Yahoo! Research
Michael Cafarella, University of Michigan
AnHai Doan, University of Wisconsin
Patrick Gallinari, LIP6
Alon Halevy, Google
Zachary Ives, University of Pennsylvania
Ron Kaplan, Powerset / Microsoft
Andrew McCallum, University of Massachusetts
Zaiqing Nie, Microsoft Research
Fernando Pereira, Google & University of Pennsylvania
Sunita Sarawagi, Indian Institute of Technology
Gerhard Weikum, Max Planck Institute
* Paper submission deadline: Friday 5 March 2010
* Notification of acceptance: Friday 2 April 2010
* Camera-ready due from authors: Monday 3 May 2010
* Workshop: Mon-Wed 17-19 May 2010 (just after AISTATS)
Program Chair: Andrew McCallum
Publication Chair: Guillaume Bouchard
Funding Chair: Cedric Archambeau
Local Arrangements: Onno Zoeter
Host Arrangements: Jean-Marc Andreoli
The workshop will take place in the chateau at Xerox Research Centre Europe, near Grenoble, France. In addition to technical talks, talks about submitted papers and a poster session, we will have plenty of time for informal discussions and community building: a reception dinner the evening before we begin, lunches provided on-site with time to talk, a banquet dinner on the first evening, and on the second day a half-day hike in the local Alps, including a raclette dinner at a refuge.