When Stanislav Dusko Ehrlich – a world expert in microbiology and a pioneer of metagenomics – and his team set out to create their next generation biotech research platform, they needed a technology solution to support their stringent capacity and performance requirements for big data analytics.
Scientific research in the life sciences is often akin to searching for needles in haystacks. Finding the one protein, chemical, or genome that behaves or responds in the way the scientist is looking for is the key to the discovery process. For decades, high performance computing (HPC) systems have accelerated this process, often by helping to identify and eliminate in feasible targets sooner.
“Datameer is all about providing a self-service, end-to-end experience for big data analytics on Hadoop. From data integration to analytics to visualization, we are wizard-led, point-and-click. Most recently we announced our Smart Analytics module, which allows business users to use data mining algorithms through a drag and drop UI. These new capabilities complement what data scientists are doing and enable business analysts to take advantage of advanced algorithms without involving IT.”
Leading researchers in data science and genomics are recommending strategies to help genomic scientists better manage, share, analyze and archive massive research and clinical data sets in an effort to ensure that the big data explosion results in better health outcomes and faster research discoveries.