September 25, 2015 Forthcoming by Chapman and Hall/CRC
Reference – 270 Pages – 122 B/W Illustrations
ISBN 9781482239133 – CAT# K23259
Series: Chapman & Hall/CRC Data Mining and Knowledge Discovery Series
Provides you with an awareness of the key technologies and benefits of accelerated discovery in science
Describes a general methodology for accelerating scientific discovery through unstructured analytics
Presents tools, techniques, and best practices for implementing text mining in practice
Includes ten real-world problems and examples that illustrate applications of accelerated discovery in molecular biology, pharmaceutical research, microbe analysis, and other areas
Unstructured Mining Approaches to Solve Complex Scientific Problems
As the volume of scientific data and literature increases exponentially, scientists need more powerful tools and methods to process and synthesize information and to formulate new hypotheses that are most likely to be both true and important. Accelerating Discovery: Mining Unstructured Information for Hypothesis Generation describes a novel approach to scientific research that uses unstructured data analysis as a generative tool for new hypotheses.
The author develops a systematic process for leveraging heterogeneous structured and unstructured data sources, data mining, and computational architectures to make the discovery process faster and more effective. This process accelerates human creativity by allowing scientists and inventors to more readily analyze and comprehend the space of possibilities, compare alternatives, and discover entirely new approaches.
Encompassing systematic and practical perspectives, the book provides the necessary motivation and strategies as well as a heterogeneous set of comprehensive, illustrative examples. It reveals the importance of heterogeneous data analytics in aiding scientific discoveries and furthers data science as a discipline.
Scott Spangler is a principal data scientist, distinguished engineer, and master inventor in the Watson Innovations Group at the IBM Almaden Research Center. He has been involved with knowledge base and data mining research for the past 25 years. His recent work has applied Watson technology to help accelerate cancer research. He holds 45 patents and is the author of over 30 publications. He received a BS in mathematics from MIT and an MS in computer science from the University of Texas.