Posts Tagged ‘data science’

The Next Stage for Strata Hadoop World and Big Data

David Dietrich

Director of Technical Marketing in Big Data Solutions at EMC
David Dietrich is the Director of Technical Marketing for the Big Data Solutions group at EMC – a group that provides technology and services solutions to customers with needs related to Big Data and data science. Prior to this role, Mr. Dietrich was head of data science education at EMC, where he developed three courses in data science for practitioners, heads of teams and executives.

Latest posts by David Dietrich (see all)

Strata + Hadoop World New York is an event I look forward to attending each year.  I like the sessions, speakers, and also seeing how the attendees and vendors have evolved over time.  When I first began attending SHW 4 or 5 years ago, the attendees were only about 200 people, and this year in NYC they topped 6,300!   As a result, the conference has had to move from the New York Hilton to the Javits Center, since it is such a huge event now.

As with Big Data itself, focusing on the sheer volume of conference attendees doesn’t tell the whole story.  Here are a few of my observations from the conference.

  1. Early majority now joining the fray. As I mentioned, I’ve attended Strata for 4-5 years, and knowing that others also return every year, I expected more attendees at the sessions around emerging technologies that are gaining adoption, such as Spark or Kafka.  Although this did occur, I underestimated the heavy attendance in sessions for people just attending Strata for the first time.  This means the ‘beginner’ sessions were packed, for every person who’s been using Hadoop for years and now wants to experiment with Spark, there are still scores more trying to learn R or learn how to deploy Hadoop in a feasible way.

(more…)

What Are The Real Effects Of Climate Change?

Mona Patel

Senior Manager, Big Data Solutions Marketing at EMC
Mona Patel is a Senior Manager for Big Data Marketing at EMC Corporation. With over 15 years of working with data at The Department of Water and Power, Air Touch Communications, Oracle, and MicroStrategy, Mona decided to grow her career at EMC, a leader in Big Data.

Contributing to social good is now literally at everyone’s fingertips. That is why EMC and Earthwatch Institute have teamed up to encourage citizens to become data collectors, or citizen scientists. Through the collection of more data sources, data scientists can better uncover how climate change is affecting plants and animals by altering the timing of key natural events.

This collaboration is called the Whenology project, with the first study underway to investigate how climate change is affecting raptor migrations at Acadia National Park. To create awareness and encourage more participation, EMC launched a microsite that provides educational materials, track progress, and report insights.

whenology

I spoke with EMC Distinguished Engineer John Cardente about the Whenology project and it’s potential to provide a powerful citizen science platform for collaboratively tackling virtually any large-scale, high impact societal issue.

1.  What is the Whenology project and what are your major objectives?

(more…)

Hadoop Summit 2015 Reflections

Chris Harrold

CTO Big Data Solutions at EMC
Chris is responsible for the development of large-scale analytics solutions for EMC customers around emerging analytics platform technologies. Currently, he is focused on EMC Business Data Lake Solutions and delivering this solution to key EMC customer accounts.

Latest posts by Chris Harrold (see all)

Big Data - Hadoop Summit 2015

Before the ink has even really dried on Hadoop Summit15 in San Jose I am sitting down in a rare moment of peace to write out some reflections from my experience and what I have seen from the sessions, keynotes, partners, and users here at the show.

Hadoop Gets Real

The most lasting impression I got from the overall theme of the show and the people in attendance was that Hadoop is not an “emerging tool” anymore. The momentum, use cases, and indeed the buzz of attendees was that there is massive adoption and momentum built up in the marketplace. Behind this wave of early adoption is a lot of pent-up demand that is waiting for things to stabilize and become more enterprise ready. Once the tooling around the Hadoop ecosystem is more robust, and the platforms that it runs on are more operational, there is no limit to the demand that this ecosystem can produce.

In counterpoint to this fact, there is another countercurrent of theme that Hadoop is not “all things to all people”, and so there is a lot of discussion around the emergence of the logical successor to Hadoop as the analytics tool of record. Certainly the buzz around Spark is indicative that this is the way of the future and ties into the second theme of the show that I observed in numerous conversations and sessions.

(more…)

Is It All About The Data Scientist?

Mona Patel

Senior Manager, Big Data Solutions Marketing at EMC
Mona Patel is a Senior Manager for Big Data Marketing at EMC Corporation. With over 15 years of working with data at The Department of Water and Power, Air Touch Communications, Oracle, and MicroStrategy, Mona decided to grow her career at EMC, a leader in Big Data.

The answer is no. It is a holistic, team effort that involves expanding the mind and skill set of executives, business users, IT implementers, data scientists, and application developers to all work collectively to define a strategy and derive newer insight from big data.

And that is why EMC is so heavily focused on breaking down organizational silos and training professionals to become data scientists or at least think like data scientists, transforming these individuals into data savvy professionals working towards the same goal – competitive advantage.

I spoke to Louis Frolio, Advisory Technical Ed Consultant for EMC Big Data Solutions, how as part of a team in EMC Education Services is creating a massive professional transformation through a MOOC – Massive Open Online Course. Data Lakes for Big Data MOOC gives you an opportunity to become a data savvy professional and take on a big data or data science role in your organization at absolutely no cost.

The course kicked off May 11, but you still have plenty of time to enroll and complete the course to earn a certificate before June 8. The top 500 students (based on cumulative grade for the MOOC) will receive an electronic copy of the Data Science book just released by EMC Education Services.

1.  What is a MOOC and what is the goal of this education format? Why was it used for this course?

(more…)

Follow EMC

Subscribe to Blog via Email

Enter your email address to subscribe to this blog and receive notifications of new posts by email.

EMC Community Network

Participate in the Everything Big Data technical community

Follow @emcbigdata