Natural Language Data Management and Interfaces
Yunyao Li, Davood Rafiei ISBN: 9781681734088 | PDF ISBN: 9781681734095 Hardcover ISBN: 9781681734101 Copyright © 2018 | 156 Pages | Publication Date: August, 2018 |
The volume of natural language text data has been rapidly increasing over the past two decades, due to factors such as the growth of the Web, the low cost associated with publishing, and the progress on the digitization of printed texts. This growth combined with the proliferation of natural language systems for search and retrieving information provides tremendous opportunities for studying some of the areas where database systems and natural language processing systems overlap.
This book explores two interrelated and important areas of overlap: (1) managing natural language data and (2) developing natural language interfaces to databases. It presents relevant concepts and research questions, state-of-the-art methods, related systems, and research opportunities and challenges covering both areas. Relevant topics discussed on natural language data management include data models, data sources, queries, storage and indexing, and transforming natural language text. Under natural language interfaces, it presents the anatomy of these interfaces to databases, the challenges related to query understanding and query translation, and relevant aspects of user interactions. Each of the challenges is covered in a systematic way: first starting with a quick overview of the topics, followed by a comprehensive view of recent techniques that have been proposed to address the challenge along with illustrative examples. It also reviews some notable systems in details in terms of how they address different challenges and their contributions. Finally, it discusses open challenges and opportunities for natural language management and interfaces.
The goal of this book is to provide an introduction to the methods, problems, and solutions that are used in managing natural language data and building natural language interfaces to databases. It serves as a starting point for readers who are interested in pursuing additional work on these exciting topics in both academic and industrial environments.
Table of Contents
Preface
Acknowledgments
Introduction
Background
Natural Language Data Management
Natural Language Interfaces to Databases
Open Challenges and Opportunities
Conclusions
Bibliography
Authors’ Biographies
Index
About the Author(s)
Yunyao Li, IBM Reseach – Almaden
Yunyao Li is a Senior Research Manager and Research Staff Member with IBM Research – Almaden. She is also a Master Inventor and a member of IBM Academy of Technology. Her expertise is in the interdisciplinary areas of databases, natural language processing, human-computer interaction, machine learning, and information retrieval. Her contributions in these areas have led to over 50 research publications, more than 20 patents granted or filed, multiple graduate-level courses (including 2 Massive Open Online Courses), and billions of revenue generated from technology transfer. She is widely recognized in these areas both within IBM and by the external research community, and regularly serves on prestigious program committees, editorial boards, and review panels. Her current research interest focuses on taming unstructured and semi-structured content to enable the building of new generations of AI applications for the enterprise. Yunyao is also passionate about improving the diversity for the STEM field. She has been actively mentoring women and under-represented minorities for over ten years. She received her Ph.D. and master’s degrees from the University of Michigan, Ann Arbor and undergraduate degrees from Tsinghua University, Beijing, China.
Davood Rafiei, University of Alberta
Davood Rafiei did his undergrad work at the Sharif University of Technology, his M.Sc. at the University of Waterloo, and his Ph.D. at the University of Toronto before joining the University of Alberta, where he is now Associate Professor of Computer Science and member of the Database Systems Research Group. His areas of expertise, which span over databases and the Web, is on managing large complex data. In particular, his research interest includes natural language data management and integration, spatial analysis and mining of web content, similarity-based queries and indexing, and web information retrieval. Davood regularly serves in the program committees of major database and data mining conferences (such as SIGMOD, VLDB, KDD, ICDM, CIKM) and Web conferences (such as WWW). Davood has spent time, as a visiting scientist, at Google (Mountain View), Kyoto University, and the University of Paris Descartes.