Repurposing Legacy Data: Innovative Case Studies
Repurposing Legacy Data: Innovative Case Studies
by Jules Berman
Release Date: 17 Mar 2015
Print Book ISBN :9780128028827
eBook ISBN :9780128029152
Pages: 176Dimensions: 229 X 152
This book takes a look at how data scientists have re-purposed legacy data, whether their own, or legacy data that has been donated to the public domain. The case studies in the book, from such diverse fields as cosmology, quantum physics, high-energy physics, microbiology, psychiatry, medicine and hospital administration, all serve to demonstrate how innovative people draw value from legacy data. By following the case examples, readers will learn how legacy data is restored, merged and analyzed for purposes that were never imagined by the original data creators. The book is a valuable resource for data scientists, big data curators, statisticians and researchers, as well as graduate-level students in computer science, statistics and information sciences.
- Discusses how combining existing data with other data sets of the same kind can produce an aggregate data set that serves to answer questions that could not be answered with any of the original data
- Presents a method for re-analyzing original data sets using alternate or improved methods that can provide outcomes more precise and reliable than those produced in the original analysis
- Explains how to integrate heterogeneous data sets for the purpose of answering questions or developing concepts that span several different scientific fields
Repurposing Legacy Data: Innovative Case Studies takes a look at how data scientists have re-purposed legacy data, whether their own, or legacy data that has been donated to the public domain.
Most of the data stored worldwide is legacy data-data created some time in the past, for a particular purpose, and left in obsolete formats. As with keepsakes in an attic, we retain this information thinking it may have value in the future, though we have no current use for it.
The case studies in this book, from such diverse fields as cosmology, quantum physics, high-energy physics, microbiology, psychiatry, medicine, and hospital administration, all serve to demonstrate how innovative people draw value from legacy data. By following the case examples, readers will learn how legacy data is restored, merged, and analyzed for purposes that were never imagined by the original data creators.
Primary Market Data scientists, Big Data curators, Statisticians, Researchers; Secondary Market: graduate level students in computer science, statistics, information sciences
Jules Berman holds two bachelor of science degrees from MIT (Mathematics, and Earth and Planetary Sciences), a PhD from Temple University, and an MD, from the University of Miami. He was a graduate researcher in the Fels Cancer Research Institute, at Temple University, and at the American Health Foundation in Valhalla, New York. His post-doctoral studies were completed at the U.S. National Institutes of Health, and his residency was completed at the George Washington University Medical Center in Washington, D.C. Dr. Berman served as Chief of Anatomic Pathology, Surgical Pathology and Cytopathology at the Veterans Administration Medical Center in Baltimore, Maryland, where he held joint appointments at the University of Maryland Medical Center and at the Johns Hopkins Medical Institutions. In 1998, he became the Program Director for Pathology Informatics in the Cancer Diagnosis Program at the U.S. National Cancer Institute, where he worked and consulted on Big Data projects. In 2006, Dr. Berman was President of the Association for Pathology Informatics. In 2011 he received the Lifetime Achievement Award from the Association for Pathology Informatics. He is a co-author on hundreds of scientific publications. Today Dr. Berman is a free-lance author, writing extensively in his three areas of expertise: informatics, computer programming, and cancer biology. A complete list of his publications is available at http://www.julesberman.info/pubs.htm As a Program Director at the National Cancer Institute, Dr. Berman directed a multi-institutional Big Data project and actively organized and participated in high-level conferences and meetings where Big Data efforts were planned. He made a number of contributions to the field, particularly in the areas of identification, de-identification, data exchange protocols, standards development, regulatory/legal issues, and metadata annotation. Aside from his personal experiences, he is a serious scholar of the subject and has studied the works of many other authors who have dealt with the many pitfalls in Big Data creation and analysis. He aims to provide readers with a balanced perspective of Big Data, that represents the views held by leaders in this multi-disciplined field.
Ph.D., M.D., freelance author with expertise in informatics, computer programming, and cancer biology, Columbia, MD, USA
Interview with Jules J. Berman
On Managing Big Data. Interview with Jules J. Berman, ODBMS Industry Watch, July 23, 2014
Download Chapter 9. Data Quality Management (.PDF): Multi-Domain Master Data Management_Ch9