{"id":4911,"date":"2019-02-04T09:11:54","date_gmt":"2019-02-04T09:11:54","guid":{"rendered":"http:\/\/www.odbms.org\/blog\/?p=4911"},"modified":"2019-02-04T16:28:04","modified_gmt":"2019-02-04T16:28:04","slug":"on-gaining-knowledge-of-diabetes-using-graphs-interview-with-alexander-jarasch","status":"publish","type":"post","link":"https:\/\/www.odbms.org\/blog\/2019\/02\/on-gaining-knowledge-of-diabetes-using-graphs-interview-with-alexander-jarasch\/","title":{"rendered":"On gaining Knowledge of Diabetes using Graphs. Interview with Alexander Jarasch"},"content":{"rendered":"<blockquote><p><strong>&#8220;The challenge is that we have to combine lots of different types of data, simultaneously, depending on genetics,\u00a0epigenetics, different subject matter areas such as\u00a0lipidomics,\u00a0metabolomics, the lifestyle and behaviour of the patient and looking at people in different cultures and environments. \u00a0The variety of data we need to analyse is a major challenge, which is why from a data perspective we use graph. It is here we can make the links to answer biomedical queries.&#8221; <\/strong><strong>&#8211;Alexander Jarasch.<\/strong><\/p><\/blockquote>\n<p>I have interviewed<strong> Alexander Jarasch, <\/strong>head of data and knowledge management at the<strong>\u00a0<a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/www.dzd-ev.de\/en\/index.html');\"  href=\"https:\/\/www.dzd-ev.de\/en\/index.html\" target=\"_blank\">German Center for Diabetes Research<\/a>\u00a0<\/strong>(DZD)<strong>. <\/strong>We discussed what are the main challenges in trying to understand more about diabetes,<strong>\u00a0<\/strong>and<strong>\u00a0<\/strong>how diabetes researchers are using graph database\u00a0technology in order to create\u00a0knowledge graphs\u00a0and\u00a0find hidden connections in medical data.<\/p>\n<p>RVZ<\/p>\n<p><strong>Q1. You are the head of data and knowledge management at the German Center for Diabetes Research (DZD). What are your main tasks?<\/strong><\/p>\n<p><strong>Alexander Jarasch:\u00a0<\/strong>There are several responsibilities that my team fulfils within DZD, \u2013 these include IT infrastructure which can encompass databases, data transfer services, data management and knowledge management as a second part of our remit.<\/p>\n<p><strong>Q2. Diabetes is one of the most widespread diseases worldwide. What are the main challenges in trying to understand more about diabetes?<\/strong><\/p>\n<p><strong>Alexander Jarasch:\u00a0<\/strong><a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/en.wikipedia.org\/wiki\/Diabetes_mellitus');\"  href=\"https:\/\/en.wikipedia.org\/wiki\/Diabetes_mellitus\" target=\"_blank\">Diabetes<\/a> is a metabolic disease, and a complex area to understand. It is not yet obvious what causes type 2 diabetes, but it is clearly linked to obesity. Here, we try to understand the molecular mechanisms, where diabetes starts and how we can try and prevent it. The challenge is that we have to combine lots of different types of data, simultaneously, depending on genetics, <a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/en.wikipedia.org\/wiki\/Epigenetics');\"  href=\"https:\/\/en.wikipedia.org\/wiki\/Epigenetics\" target=\"_blank\">epigenetics<\/a>, different subject matter areas such as <a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/en.wikipedia.org\/wiki\/Lipidomics');\"  href=\"https:\/\/en.wikipedia.org\/wiki\/Lipidomics\" target=\"_blank\">lipidomics<\/a>, <a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/en.wikipedia.org\/wiki\/Metabolomics');\"  href=\"https:\/\/en.wikipedia.org\/wiki\/Metabolomics\" target=\"_blank\">metabolomics<\/a>, the lifestyle and behaviour of the patient and looking at people in different cultures and environments.<\/p>\n<p>All these dependencies are connected to each other. <a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/en.wikipedia.org\/wiki\/Metabolism');\"  href=\"https:\/\/en.wikipedia.org\/wiki\/Metabolism\" target=\"_blank\">Metabolism<\/a> is connected to the environment, genetics, epigenetics and so forth. The big challenge is to see this not just from one perspective, but from as many perspectives at the same time as we can get.<\/p>\n<p>From a data management point of view it is not easy bringing all this patient-related data together with basic research data, and then to combine it with publicly available data, all held in disparate data stores and souces. We need to bring this heterogenous data together and connect it in a very clear way.<\/p>\n<p><strong>Q3. How is the status of research in treating and preventing the disease?<\/strong><\/p>\n<p><strong>Alexander Jarasch:\u00a0<\/strong>Diabetes is not currently curable. We have to distinguish between <a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/en.wikipedia.org\/wiki\/Diabetes_mellitus_type_1');\"  href=\"https:\/\/en.wikipedia.org\/wiki\/Diabetes_mellitus_type_1\" target=\"_blank\">type 1<\/a> and <a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/en.wikipedia.org\/wiki\/Diabetes_mellitus_type_2');\"  href=\"https:\/\/en.wikipedia.org\/wiki\/Diabetes_mellitus_type_2\" target=\"_blank\">type 2 diabetes<\/a>. Preventing type 1 is not relevant, as it is genetic and one inherits it. Preventing type 2 is very complicated. Obviously it is suggested that patients lead a healthier life, play more sport and drink less alcohol. But some patients don\u2019t respond to lifestyle interventions.<\/p>\n<p>The research itself is very complex and diverse. You can look at it from the patient side, the basic research side or the animal model side. Preventing diabetes is a complicated field and the research is ongoing. There is no clear outcome for the patient at present.<\/p>\n<p><strong>Q4. How do you gain knowledge of diabetes from the datasets and the databases you already have?<\/strong><\/p>\n<p><strong>Alexander Jarasch:\u00a0<\/strong>We have different types of data \u2013 patient data from clinical trials, animal models, basic research \u2013 basically all the data from the various <a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/en.wikipedia.org\/wiki\/Omics');\"  href=\"https:\/\/en.wikipedia.org\/wiki\/Omics\" target=\"_blank\">omics<\/a>. We analyse this to gain more knowledge by connecting this data and viewing it all simultaneously. We also look to gain knowledge from the large data sets by applying <a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/en.wikipedia.org\/wiki\/Machine_learning');\"  href=\"https:\/\/en.wikipedia.org\/wiki\/Machine_learning\" target=\"_blank\">machine learning<\/a>. On the database side we have introduce <a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/en.wikipedia.org\/wiki\/Graph_database');\"  href=\"https:\/\/en.wikipedia.org\/wiki\/Graph_database\" target=\"_blank\">graph databases<\/a> in the form of <a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/neo4j.com\/');\"  href=\"https:\/\/neo4j.com\/\" target=\"_blank\">Neo4j<\/a> in order to create <a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/en.wikipedia.org\/wiki\/Knowledge_Graph');\"  href=\"https:\/\/en.wikipedia.org\/wiki\/Knowledge_Graph\" target=\"_blank\">knowledge graphs<\/a>.<\/p>\n<p><strong>Q5. If you look at the characteristics of Big Data: Volume, Variety, Velocity, Veracity; which ones are relevant for you?<\/strong><\/p>\n<p><strong>Alexander Jarasch:\u00a0<\/strong>I would not highlight anyone of these as they all have the same level of importance.\u00a0If we don\u2019t have enough data we don\u2019t have the statistical significance , if we don\u2019t have a variety of data we can\u2019t distinguish between its different states. If we don\u2019t have high quality data we can\u2019t keep up with the velocity necessary to answer the questions. The variety of data we need to analyse is a major challenge, which is why from a data perspective we use graph. It is here we can make the links to answer biomedical queries.<\/p>\n<p><strong>Q6. What are the main benefits when you start connecting patients&#8217; data?<\/strong><\/p>\n<p><strong>Alexander Jarasch:\u00a0<\/strong>The main benefit of connecting a patient\u2019s data, which could also incidentally be an animal model, is that you can see the data from a number of perspectives. The more parameters you have the more complete the puzzle can be. The benefit here is being able to see the patient from many different sides. One discipline is not sufficient to answer the biomedical questions or help in the prevention of diabetes.<\/p>\n<p>We can also connect between different centers. Diabetes, for example, has co-complications with other diseases. These include cancer, cardiovascular disease and Alzheimer\u2019s. We can now connect and look at these different types of data and better understand how symptoms and causes interconnect.<\/p>\n<p><strong>Q7. In one of your <a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/neo4j.com\/graphconnect-2018\/session\/graphs-in-medicine-fighting-diabetes');\"  href=\"https:\/\/neo4j.com\/graphconnect-2018\/session\/graphs-in-medicine-fighting-diabetes\" target=\"_blank\">use case you have studied pre diabetes, using graphs<\/a> to connect data from animal models, genetics, metabolomics and literature to deduce causes of prediabetes in human. What results did you obtain so far?<\/strong><\/p>\n<p><strong>Alexander Jarasch:\u00a0<\/strong>We have connected different types of public data and our own data. One result is the hypothesis of seven metabolites that overlap between human genomic data and that seen in a prediabetes pig model. This is now under further investigation and we will dig deeper. The question is which pathways do these metabolites follow and how are they regulated in the body? It is in itself a very complex question.<\/p>\n<p><strong>Q8. What is your experience so far in using graph technology and specifically Neo4j?<\/strong><\/p>\n<p><strong>Alexander Jarasch:\u00a0<\/strong>We are now at a point with graph databases where we can easily connect different types of data \u2013 where the drawings and brainstorming sessions with researchers come very close to the data model. This makes it much easier to query data, even for non-computer scientists to answer questions. When it comes to Neo4j it is easy to install and implement. The query language, <a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/neo4j.com\/developer\/cypher-query-language\/');\"  href=\"https:\/\/neo4j.com\/developer\/cypher-query-language\/\" target=\"_blank\">Cypher<\/a>, is easy to understand and the visualisation software is again very promising for non-computer scientists. Essentially, it makes it far easier for us to combine different types of data.<\/p>\n<p><strong>Q9. What are the main benefits in using graph technology in your area of work?<\/strong><\/p>\n<p><strong>Alexander Jarasch:\u00a0<\/strong>The main benefit of graph technology is its ability to connect heterogenous data across different locations and species. This is possible with relational databases, but it is very complicated. We do still use relational databases as they are connected to different devices and recurring processes and are fit for purpose in these roles. It is in combining and connecting heterogenous data, where graph technology has the greatest impact. This is a situation where relational databases are rather limited.<\/p>\n<p><strong>Q10. Do you think that connecting data and applying modern machine learning techniques will help scientists getting closer to understand this complex disease and hopefully help to care for patients in the future?<\/strong><\/p>\n<p><strong>Alexander Jarasch:\u00a0<\/strong>Yes, I would definitely agree with this. Connecting different types of data is key to modern data analysis and especially in life science \/ health care industry. Of course this makes the process much more complex and far bigger. Applying machine learning techniques can help to cope this and to gain the knowledge from many data sources. This provides us with a better understanding of diseases in general I would say. We are applying ML techniques on our big data sets. One example would be to <a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/en.wikipedia.org\/wiki\/Cluster_analysis');\"  href=\"https:\/\/en.wikipedia.org\/wiki\/Cluster_analysis\" target=\"_blank\">cluster<\/a> patient groups in order to identify different subtypes of diabetes.<\/p>\n<p>The question is how can we distinguish between patient (groups) and treat people individually when they get diabetes. Some people, for example, don\u2019t react to lifestyle intervention when it comes to diabetes. We have tall, lean people who have diabetes, obese people with diabetes but also obese people who don\u2019t have diabetes. Obviously, the mechanisms behind that must be quite different from each other, and thus a single therapy or prevention for all people is most likely not working. That\u2019s why we connecting data sources and try to cluster our patients into subgroups to come up with individual treatments or suggested interventions. Graph technology provides us with a way of connecting relevant data sources.<\/p>\n<p>&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8211;<\/p>\n<p><a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/www.odbms.org\/blog\/wp-content\/uploads\/2019\/01\/Jarasch_Alexander_2.jpeg');\"  href=\"http:\/\/www.odbms.org\/blog\/wp-content\/uploads\/2019\/01\/Jarasch_Alexander_2.jpeg\"><img decoding=\"async\" loading=\"lazy\" class=\"alignnone  wp-image-4932\" src=\"http:\/\/www.odbms.org\/blog\/wp-content\/uploads\/2019\/01\/Jarasch_Alexander_2-214x300.jpeg\" alt=\"Jarasch_Alexander_2\" width=\"111\" height=\"156\" srcset=\"https:\/\/www.odbms.org\/blog\/wp-content\/uploads\/2019\/01\/Jarasch_Alexander_2-214x300.jpeg 214w, https:\/\/www.odbms.org\/blog\/wp-content\/uploads\/2019\/01\/Jarasch_Alexander_2-732x1024.jpeg 732w, https:\/\/www.odbms.org\/blog\/wp-content\/uploads\/2019\/01\/Jarasch_Alexander_2.jpeg 915w\" sizes=\"(max-width: 111px) 100vw, 111px\" \/><\/a><\/p>\n<p><strong>Alexander Jarasch <\/strong><em>currently works at the German Center for diabetes research and is responsible for data and knowledge management. Before he worked at the &#8220;Pharma Research and Early Development (pRED)&#8221; at Roche. Alexander does research in Computing in Mathematics, Natural Science, Engineering and Medicine, Databases and Data Mining.<\/em><\/p>\n<p><strong>Resources<\/strong><\/p>\n<p><span id=\"eow-title\" class=\"watch-title\" dir=\"ltr\" title=\"Graphs to Fight Diabetes \u2013 Dr. Alexander Jarasch, DZD\">&#8211; <a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/www.youtube.com\/watch?v=1QqOzHR1Mj0');\"  href=\"https:\/\/www.youtube.com\/watch?v=1QqOzHR1Mj0\" target=\"_blank\">Graphs to Fight Diabetes \u2013 Dr. Alexander Jarasch, DZD (link to YouTube Video)<\/a>,\u00a0GraphConnect-2018, New York.<\/span><\/p>\n<p>&#8211;<a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/neo4j.com\/graphconnect-2018\/session\/graphs-in-medicine-fighting-diabetes');\"  href=\"https:\/\/neo4j.com\/graphconnect-2018\/session\/graphs-in-medicine-fighting-diabetes\" target=\"_blank\">\u00a0Graphs to Fight Diabetes<\/a><a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/www.youtube.com\/watch?v=1QqOzHR1Mj0');\"  href=\"https:\/\/www.youtube.com\/watch?v=1QqOzHR1Mj0\" target=\"_blank\">\u2013<\/a><a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/neo4j.com\/graphconnect-2018\/session\/graphs-in-medicine-fighting-diabetes');\"  href=\"https:\/\/neo4j.com\/graphconnect-2018\/session\/graphs-in-medicine-fighting-diabetes\" target=\"_blank\">\u00a0Dr. Alexander Jarasch (link to Slides),<\/a>\u00a0GraphConnect-2018, New York.<\/p>\n<p>&#8211;\u00a0<a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/earlyadopter.com\/2018\/10\/25\/neo4jjarasch\/');\"  href=\"https:\/\/earlyadopter.com\/2018\/10\/25\/neo4jjarasch\/\" target=\"_blank\">Using Graphs to Fight Diabetes: A Podcast with Alexander Jarasch<\/a>,\u00a02018\/10\/25.<\/p>\n<p>&#8211;\u00a0<a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC5851211\/');\"  href=\"https:\/\/www.ncbi.nlm.nih.gov\/pmc\/articles\/PMC5851211\/\" target=\"_blank\">Artificial Intelligence Methodologies and Their Application to Diabetes<\/a>,\u00a0<span class=\"cit\">J Diabetes Sci Technol. 2018 Mar; 12(2): 303\u2013310,\u00a0<\/span><span class=\"fm-vol-iss-date\">Published online 2017 May 25. <\/span><span class=\"doi\">doi:\u00a0<a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/dx.doi.org\/10.1177%2F1932296817710475');\"  href=\"https:\/\/dx.doi.org\/10.1177%2F1932296817710475\" target=\"pmc_ext\">10.1177\/1932296817710475<\/a><\/span><\/p>\n<p>&#8211;\u00a0<a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/www.phusewiki.org\/wiki\/images\/images\/e\/ef\/Slater08.pdf');\"  href=\"http:\/\/www.phusewiki.org\/wiki\/images\/images\/e\/ef\/Slater08.pdf\" target=\"_blank\">Beyond data integration<\/a>,\u00a0Drug Discovery Today, February 2008 R<\/p>\n<p><strong>Related Posts<\/strong><\/p>\n<p>&#8211;\u00a0<a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/www.odbms.org\/blog\/2018\/09\/on-using-ai-and-data-analytics-in-pharmaceutical-research-interview-with-bryn-roberts\/');\"  href=\"http:\/\/www.odbms.org\/blog\/2018\/09\/on-using-ai-and-data-analytics-in-pharmaceutical-research-interview-with-bryn-roberts\/\" target=\"_blank\" rel=\"nofollow\">On using AI and Data Analytics in Pharmaceutical Research. Interview with Bryn Roberts\u00a0<\/a>,<span class=\"feed-source\">\u00a0ODBMS Industry Watch,<\/span><span class=\"feed-date\">\u00a02018-09-10<\/span><\/p>\n<p>&#8211;\u00a0<a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/www.odbms.org\/2017\/08\/beyond-the-molecule-and-beyond-the-device-machine-learning-and-the-future-of-healthcare\/');\"  href=\"http:\/\/www.odbms.org\/2017\/08\/beyond-the-molecule-and-beyond-the-device-machine-learning-and-the-future-of-healthcare\/\" target=\"_blank\">Beyond the Molecule and Beyond the Device: Machine Learning and the Future of Healthcare<\/a>, ODBMS.org, EXPERT ARTICLES,\u00a024 AUG, 2017<\/p>\n<p><strong>Follow us on Twitter: <a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/twitter.com\/odbmsorg');\"  href=\"https:\/\/twitter.com\/odbmsorg\" target=\"_blank\">@odbmsorg<\/a><\/strong><\/p>\n<p>##<\/p>\n<!-- AddThis Advanced Settings generic via filter on the_content --><!-- AddThis Share Buttons generic via filter on the_content -->","protected":false},"excerpt":{"rendered":"<p>&#8220;The challenge is that we have to combine lots of different types of data, simultaneously, depending on genetics,\u00a0epigenetics, different subject matter areas such as\u00a0lipidomics,\u00a0metabolomics, the lifestyle and behaviour of the patient and looking at people in different cultures and environments. \u00a0The variety of data we need to analyse is a major challenge, which is why [&hellip;]<!-- AddThis Advanced Settings generic via filter on get_the_excerpt --><!-- AddThis Share Buttons generic via filter on get_the_excerpt --><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[1],"tags":[1378,35,66,1019,1377,1382,1381,1379,231,233,715,1376,1380,1383,350,1385,1384,402],"_links":{"self":[{"href":"https:\/\/www.odbms.org\/blog\/wp-json\/wp\/v2\/posts\/4911"}],"collection":[{"href":"https:\/\/www.odbms.org\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.odbms.org\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.odbms.org\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.odbms.org\/blog\/wp-json\/wp\/v2\/comments?post=4911"}],"version-history":[{"count":22,"href":"https:\/\/www.odbms.org\/blog\/wp-json\/wp\/v2\/posts\/4911\/revisions"}],"predecessor-version":[{"id":4940,"href":"https:\/\/www.odbms.org\/blog\/wp-json\/wp\/v2\/posts\/4911\/revisions\/4940"}],"wp:attachment":[{"href":"https:\/\/www.odbms.org\/blog\/wp-json\/wp\/v2\/media?parent=4911"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.odbms.org\/blog\/wp-json\/wp\/v2\/categories?post=4911"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.odbms.org\/blog\/wp-json\/wp\/v2\/tags?post=4911"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}