Archive for the ‘Data Science’ Category

Is It All About The Data Scientist?

Mona Patel

Senior Manager, Big Data Solutions Marketing at EMC
Mona Patel is a Senior Manager for Big Data Marketing at EMC Corporation. With over 15 years of working with data at The Department of Water and Power, Air Touch Communications, Oracle, and MicroStrategy, Mona decided to grow her career at EMC, a leader in Big Data.

The answer is no. It is a holistic, team effort that involves expanding the mind and skill set of executives, business users, IT implementers, data scientists, and application developers to all work collectively to define a strategy and derive newer insight from big data.

And that is why EMC is so heavily focused on breaking down organizational silos and training professionals to become data scientists or at least think like data scientists, transforming these individuals into data savvy professionals working towards the same goal – competitive advantage.

I spoke to Louis Frolio, Advisory Technical Ed Consultant for EMC Big Data Solutions, how as part of a team in EMC Education Services is creating a massive professional transformation through a MOOC – Massive Open Online Course. Data Lakes for Big Data MOOC gives you an opportunity to become a data savvy professional and take on a big data or data science role in your organization at absolutely no cost.

The course kicked off May 11, but you still have plenty of time to enroll and complete the course to earn a certificate before June 8. The top 500 students (based on cumulative grade for the MOOC) will receive an electronic copy of the Data Science book just released by EMC Education Services.

1.  What is a MOOC and what is the goal of this education format? Why was it used for this course?

(more…)

Want To Build A Data Science Team?

Mona Patel

Senior Manager, Big Data Solutions Marketing at EMC
Mona Patel is a Senior Manager for Big Data Marketing at EMC Corporation. With over 15 years of working with data at The Department of Water and Power, Air Touch Communications, Oracle, and MicroStrategy, Mona decided to grow her career at EMC, a leader in Big Data.

EMC Offers a Holistic Approach to data science. Many of our customers invest in big data solutions to target their sales prospects better, explore advanced medical research, and make their internal processes more efficient. The biggest obstacle to getting these initiatives out of the gate is the shortage of big data skills within their own firms and across the industry.

To address this skills gap, EMC has developed a thorough data science and big data analytics curriculum for our customers. EMC was one of the first companies to offer data science education with rigorous, live instruction using free and open source tools. As of today, more than 10,000 customers, partners, and college students have attended the training.

data_science_book_top_banner_image_973x300

I spoke with EMC’s David Dietrich, who leads this unique program to discuss his approach to data science education, which differs from more traditional product-oriented education. What I found most interesting is that in addition to David’s work at EMC, he has also helped design big data analytics curricula for Babson College and other universities.  More recently,  David has published a book, Data Science and Big Data Analytics, to help further develop data science skills and expertise in the industry.

1.  Why is EMC pushing so hard to educate and develop data scientists?

As an information company, we’re extremely attuned to the value of big data, which is exploding in both the sheer amount and how organizations in virtually every field and industry are using it to solve critical problems. When EMC acquired our first big data company, Greenplum, several years ago, we quickly became aware that there was a shortage of people who had the data science and business skills to help companies utilize big data.

2.  How is EMC taking a holistic approach to data science education?

We recognize that learning how to use big data technology alone does not ensure success. Senior management must make sure that appropriate people and processes are in place to drive the change and innovation necessary for valuable big data results to occur. To help companies on their journey, we offer courses for data scientists, who execute big data projects, and business executives who sponsor, run and manage them.

Our goal is to educate all levels of an organization so that data scientists and business people understand one another. That way, the organization is able to roll out big data projects with greater adoption and success. In addition to offering courses to our customers, we also work closely with universities and educational institutions to help them develop their own curriculum and programs.

3.  Please describe some of the important skills for aspiring data scientists.

Working in strategy and analytics for the past 20 years, I’ve always been drawn to experimenting with data to solve problems, which is exactly is the mindset you need to tackle big data. Companies often ask me how to go about using massive amounts of structured and unstructured data to solve business problems. How do they know what to choose and ignore? How do they know what algorithms to apply? Our courses encourage a culture of experimentation that leads to answering these questions. We teach our students how to test an idea with data, measure it quantitatively, learn from it and iterate. This test and learn mindset is critical to becoming a talented data scientist and data-driven organization.

4.  What are some of the challenges with evolving into a data-driven organization?

There can be a substantial divide between data scientists and business people who manage and work with them on big data projects. Many business people lack the technical background to understand how the algorithms apply to the problem and how to test ideas with data. And some data scientists may not understand the business context. We’re trying to educate each side so they can get a clearer picture and drive toward common goals. Once you bridge that gap, you can start driving real change, and solving old problems with big data or new information sources that were once unusable.

5.  What should companies expect after they have successfully made the leap to big data?

We’re educating them in how to train and staff a big data team, as well as build processes to be effective and successful. With this approach, companies can more effectively define the business problem, acquire the right data sets, experiment, communicate the results, and finally, operationalize the new processes.

OpenChorus Project: The Dawn of The Data Science Movement

Mona Patel

Senior Manager, Big Data Solutions Marketing at EMC
Mona Patel is a Senior Manager for Big Data Marketing at EMC Corporation. With over 15 years of working with data at The Department of Water and Power, Air Touch Communications, Oracle, and MicroStrategy, Mona decided to grow her career at EMC, a leader in Big Data.

OpenChorus Project is the first real attempt to help companies succeed with Big Data. How? We all know that the barrier to success has been a lack of available data science talent and the tools needed to address Big Data analytic challenges. Open sourcing Greenplum Chorus is an attempt to rapidly grow the data science community by giving them a rich analytic platform to easily gain insight, grow and share their skills, and ultimately deliver value with Big Data projects.

Partners, startups, and even individual developers can download the source code and deliver new Chorus-integrated Big Data applications and tools needed for the diverse requirements across industries and business functions. For example, the release of Greenplum Chorus 2.2 at the end of this quarter will include valuable contributions from partners Gnip, Tableau, and Kaggle, enabling Data Scientists to correlate Twitter data into their analysis, leverage advanced Tableau visualizations, and gain access to Kaggle expert Data Scientists.
Check out the interview with Logan Lee, Director of Product Management at Greenplum, about the company’s reasons for releasing the Chorus code and the types of contributions that are expected to create a much needed Data Science movement.

Want To Become A Data Scientist? EMC Can Train You in 5 Days.

Mona Patel

Senior Manager, Big Data Solutions Marketing at EMC
Mona Patel is a Senior Manager for Big Data Marketing at EMC Corporation. With over 15 years of working with data at The Department of Water and Power, Air Touch Communications, Oracle, and MicroStrategy, Mona decided to grow her career at EMC, a leader in Big Data.

Everyone agrees that there is a shortage of Data Scientists. If not addressed soon, Big Data breakthroughs in areas such as healthcare, renewable energy, public sector, etc will decelerate.  I am proud to say that EMC is doing its part to solve the problem by fostering Data Science development with training and certification, hands on expertiseweb events, internships, and more.  For example, EMC Education Services offers a 5-day Data Science and Big Data Analytics  training and certification,  designed to enable immediate and effective participation in big data and other analytics projects.

Data Scientist course outline

As a Big Data citizen, I want to motivate those thinking about moving into the world of Data Science, to take action and get trained. I met with Barry Heller, a developer for EMC’s Data Science curriculum, who leverages his extensive education and past experience as an EMC Data Scientist for curriculum development.  If Barry’s story resonates and you relate in some way, I hope it inspires you to start a career in Data Science.

1) How many people have completed the EMC Data Science and Big Data Analytics training since its creation early this year?

(more…)

Follow Dell EMC

Dell EMC Big Data Portfolio

See how the Dell EMC Big Data Portfolio can make a difference for your analytics journey

Subscribe to Blog via Email

Enter your email address to subscribe to this blog and receive notifications of new posts by email.

Dell EMC Community Network

Participate in the Everything Big Data technical community

Follow us on Twitter