Skip navigation

The Right Approach to Extracting Actionable Insights from Your Data

Christopher Waller

VP, Chief Scientist, EPAM
Blog
  • Life Sciences & Healthcare

Data Lakes: A Solution to Siloed Legacy Corporate Databases?

Everybody else has one – I want one, too! Of course, I’m talking about data lakes. Their still-growing popularity has created massive repositories of all sorts of data – structured, unstructured, numerical, textual – within many life sciences and healthcare organizations whose digital research labs are consuming and analyzing data at an unprecedented pace.

We often hear of the five Vs when we talk about data lakes – volume, variety, velocity, veracity and – wait for it – value! The argument that most companies make to justify the investment is that, by throwing all of a corporation’s data together and applying analytics through Machine Learning (ML) or Artificial Intelligence (AI), the fifth V will most certainly appear. But what is right approach to extracting this value from your data and making it an actionable asset?

Whether we’re dealing with enterprise data lakes or our siloed legacy corporate databases, there’s still an inherent flaw in the approach: we’re often only looking for answers in the places we expect to find them, throwing new technologies at old problems expecting something magical to happen. In many cases, the answers exist in the interstitial spaces between the galaxies of data in our databases, and it takes a more thoughtful, tempered approach to find them.

Following the Scientific Method from Corporate Data Silos to Data Lakes

The life sciences industry is loaded with vast amounts of chemical and biological data collected by scientists to advance our knowledge of various diseases and empower us with the insights we need to create life-altering medicines, as well as discover the nature of our chemical and biological universe.

We are always searching for new technologies to make this process faster. ML algorithms and AI technology have been used in the life sciences and healthcare arena for quite some time now, from using multi-parameter regression to uncover relationships between chemical structures and biological activities to the application of more complex natural language processing to better define clinical trial cohorts and everything in between. The result is that data is siloed by nature.

Enterprise data lakes pose a significant challenge and possess significant potential to alter and even transform our industry by providing us with a technology that can, in conjunction with some ontology or other linking mechanism, break down the legacy data silos. Once data is combined, is it possible to use AI technology to troll through all the data?

Extracting Actionable Insights from Data Lakes: The Role of the Data Scientist

The analytics continuum in life sciences involves a wide variety of technologies that allow us to extract actionable insights from the information contained in our data collection. How we apply these insights is the responsibility of data scientists in a life sciences organization.

The industry is at a tipping point where there is a strong desire to become more ‘digital.’ This represents a paradigm shift from the legacy ‘experiment first’ culture where data are collected to disprove (or more often prove) a hypothesis and generally discarded (or at best stored and never used again) to a ‘data first’ culture where data scientists play a key role in the transformation and are tasked with extracting actionable insights from data that, in some cases, had been long forgotten.

Companies are all hiring data scientists as quickly as they are minted from data science programs at universities across the world. These data scientists are intended to drive the transformation of life sciences to become more data-driven through the application of analytics, including ML techniques. These data scientists represent the Natural Intelligence (NI) investments that life sciences companies are making.

A Final Word on AI

The life sciences industry must first become comfortable with this new approach being catalyzed through investments in NI. AI will naturally start to replace NI resources as the industry learns the value of the data and trusts the intelligence (natural or artificial) that is driving change. Whether you’re using a data lake or siloed data approach, there’s still a need for having the right people in place to take advantage of these technologies.

In closing, while I strongly believe that AI will play a major role in life sciences and healthcare, we must temper our enthusiasm and resist the urge to apply AI wholesale to our existing databases and emerging data lakes with the expectation that answers to our most challenging problems will be delivered. We must develop an even greater emphasis on data quality, metadata, interoperability and domain applicability if we truly expect to extract value in the form of actionable insights from the investments being in made.

In my final installment of this blog series, we’ll explore cloud collaboration as an enabler of industry transformation with specific use cases related to the digital research labs of the future and the associated security-based challenges and opportunities.

Hello. How Can We Help You?


Our Offices

  • Canada

    • Ottawa

      343 Preston Street,
      ON K1S 1N4, Ottawa
      Canada

      Map
    • Toronto

      5 Park Home Avenue,
      Suite 400,
      ON M2N 6L4, North York,
      Toronto
      Canada

      Map
      F: +1-416-595-1551
  • Mexico

    • Guadalajara

      Periférico Sur #8110,
      Col. El Mante
      45609 Tlaquepaque, Jalisco
      Mexico

      Map
  • United States

    • Newtown, PA

      41 University Drive,
      Suite 202,
      Newtown, PA 18940
      USA

      Map
      F: +1-267-759-8989
    • Bellevue, WA

      110 110th Ave. NE,
      Suite 310
      Bellevue, WA 98004
      USA

      Map
    • Boston, MA

      21 Drydock Avenue,
      Suite 410 W,
      Boston, MA 02210
      USA

      Map
    • Conshohocken, PA

      101 East 8th Ave,
      Suite 201,
      Conshohocken, PA 19428
      USA

      Map
    • Los Angeles, CA

      11601 Wilshire Blvd,
      Suite 350,
      Los Angeles, CA 90025
      USA

      Map
    • New York, NY

      24 West 25th Street,
      5th Floor,
      New York, NY 10010
      USA

      Map
      F: +1-267-759-8989
    • Philadelphia, PA

      30 South 15th Street,
      9th Floor,
      Philadelphia, PA 19102
      USA

      Map
    • San Francisco, CA

      222 Kearny Street,
      Suite 308,
      San Francisco, CA 94108
      USA

      Map
    • Washington D.C.

      7901 Jones Branch Drive,
      Suite 400,
      McLean, VA 22102
      USA

      Map
  • Australia

  • China

    • Guangzhou

      Unit B01, 23/F,
      Yuexiuxinduhui Building,
      No. 236, 6th Zhongshan Road,
      Yuexiu District, Guangzhou,
      China 510180

      Map
    • 广州

      中国广州市越秀区
      中山六路236号
      越秀新都会大厦中座 23楼 B01室
      邮编510180

      地图
    • Shanghai

      Room B509, 5th Floor,
      48 Weihai Road,
      Huangpu District, Shanghai,
      China 200000

      Map
    • 上海

      上海市黄浦区
      威海路48号
      5楼B509室
      邮编200000

      地图
    • Shenzhen

      3/F, Block 5, Vision Shenzhen Business Park,
      9th Gaoxin South Road, 
      Shenzhen Hi-tech Industrial Park,
      Nanshan District, Shenzhen,
      Guangdong, China 518057

      Map
    • 深圳

      中国广东省深圳市
      南山区高新南九道
      威新软件园5号楼3楼
      邮编518057

      地图
    • Suzhou

      Building 12, Creative Industrial Park,
      328 Xinghu Street,
      Suzhou Industrial Park,
      Suzhou, China 215123

      Map
    • 苏州

      中国江苏省苏州市
      苏州工业园区星湖街328号
      创意产业园内12号楼
      邮编215123

      地图
  • Hong Kong

    • Hong Kong

      26F&17F, The Wellington Tower,
      198 Wellington Street,
      Central, HK

      Map
  • India

    • Bangalore

      Smartworks,  
      Global Technology Park,
      Block C, Outer Ring Rd,
      Adarsh Palm Retreat, Bellandur,
      Bengaluru, Karnataka 560103
      India

      Map
    • Hyderabad

      10, 11 & 12th Floors,
      Salarpuria Sattva Knowledge City,
      Plot No. 2, Phase - 1,
      Survey No. 83/1,
      Raidurgam Village,
      Serilingampally Mandal,
      Hyderabad, Telangana - 500081
      India

      Map
    • Pune

      SmartWork Business Center Pvt Ltd,
      Suite 8, Level 1,
      West Wing, Nyati Unitree,
      Samrat Ashok Road,
      Yerwada, Pune - 411006,
      Maharashtra
      India

      Map
  • Japan

    • Tokyo

      Floor 1-10-11
      Shibadaimon Centre Building 10th
      Shibadaimon Minato-ku
      Tokyo 105-0012
      Japan

      Map
      F: +81-03-6880-9201
  • Singapore

    • Singapore

      5 Shenton Way
      UIC Building, #10-01,
      Singapore (068808)

      Map
  • United Arab Emirates

    • Dubai

      EPAM Systems FZ-LLC Dubai Branch
      2307 Arenco Tower, Dubai Media City
      PO Box 501929 Dubai
      United Arab Emirates

      Map