EMC World 2016 – Big Data and Analytics Sessions

Erin K. Banks

Portfolio Marketing Director at EMC
Erin K. Banks has been in the IT industry for almost 20 years. She is the Portfolio Marketing Director for Big Data at EMC. Previously she worked at Juniper Networks in Technical Marketing for the Security Business Unit. She has also worked at VMware and EMC as an SE in the Federal Division, focused on Virtualization and Security. She holds both CISSP and CISA accreditations. Erin has a BS in Electrical Engineering and is an author, blogger, and avid runner. You can find her on social media at @Banksek

EMC World 2016 is only a few weeks away and I am really excited this year because this is the first time I have had an opportunity to pick sessions and work with business units on the message and I have no booth duty. I can’t tell you the last time I got to attend a conference and didn’t have booth duty. This is also exciting for me because I led the Modern Analytics and Big Data IT Leadership track and I feel that we have a great selection of sessions. We also have some great Data Lake Technology tracks with focus on the infrastructure that is available from EMC to assist you in getting faster and richer insights from your data. There are 10 IT Leadership tracks and 35 Technology tracks focused on Big Data and the analytics required to make your data valuable. 
EMC World 2016

 

All the Modern Analytics and Big Data IT Leadership tracks will be in Venetian C and here is the schedule of the 10 sessions:

 

Monday May 2, 2016

8:30 – 9:30 AM:          EMC Big Data Portfolio and How It Impacts Analytics

12:00 – 1:00 PM:         Think Like a Data Scientist

1:30 – 2:30 PM:           Big Data Patterns For The Internet of Things

4:30 – 5:30 PM:           NoSQL and Modern Analytics

 

Tuesday May 3, 2016

8:30 – 9:30 AM:          Future Smart Cities Today

12:00 – 1:00 PM:         Modern Analytics for Security: Security the Smart Grid and IoT

1:30 – 2:30 PM:           Big Data, Big Deal: How to Revolutionize Product and Service Experiences to Unlock Customer Value

 

Wednesday May 4, 2016

8:30 – 9:30 AM:          IoT@EMC: Using the Internet of Things as a Digital Transformation Catalyst

12:00 – 1:00 PM:         Modern Data Analytics

1:30 – 2:30 PM:           The Connected Consumer: Real-Time Customer 360

If you are interested in getting more information on any of these session, check out the session catalog that includes all the session at EMC World 2016. We look forward to see you there!

Hadoop Summit 2016 – Dublin, Ireland

Erin K. Banks

Portfolio Marketing Director at EMC
Erin K. Banks has been in the IT industry for almost 20 years. She is the Portfolio Marketing Director for Big Data at EMC. Previously she worked at Juniper Networks in Technical Marketing for the Security Business Unit. She has also worked at VMware and EMC as an SE in the Federal Division, focused on Virtualization and Security. She holds both CISSP and CISA accreditations. Erin has a BS in Electrical Engineering and is an author, blogger, and avid runner. You can find her on social media at @Banksek

The conference season is upon us and this time we will be traveling to beautiful Dublin, Ireland for Hadoop Summit 2016 on April 13th and 14th, 2016. Of course we are thrilled to be a Diamond Sponsor and we hope that you will stop by Booth #301 to talk to us about your Big Data and Analytics journey and how our Big Data Portfolio can help you along the way. We have great information to share with you about a Data Lake from EMC and why we feel that it is essential to the foundation for your analytics ecosystem. Our capabilities don’t end at the Data Lake though, we continue the Big Data Journey by utilizing Data Lake Extensions, Big Data Systems, and Big Data Solutions. Stop by to find out more about the capabilities in our portfolio as well as the Global Services we can provide to assist you with your Big Data use cases. Hadoop summit booth

We wanted to provide you with an overview of the sessions we will be speaking at over the two days

General Session: April 13th @ 0945 – 0955

Speaker: Carey James, Director Business Development, EMC Big Data Solutions

Abstract: Gaining Richer Insights and Business Outcomes with the EMC Big Data Portfolio

A recent study by IDC suggests that world data growth is expanding at a 5x rate, and will lead to as much as 40,000 exabytes by 2020. With data generation at such enormous rates, it is becoming essential that our interaction and understanding of these sets must bring out the value in a way that reveals rich insights and truly enhances business outcomes.  However, faced with obstacles from ingesting and indexing the right data from multiple sources to not being able to retain data long enough, and waiting on IT to spin up sufficient resources, businesses can struggle in their Big Data initiatives early on. EMC knows these challenges first-hand as we have been through them ourselves. From this experience, we have learned two things: the power of data is a game changer and the power of infrastructure is essential to discovering actionable insights. In this session, we will review the EMC Big Data Portfolio and related services that will help get your own initiatives off the ground– no matter where you are on your Big Data journey.

Breakout Session: April 13th @ 1220 – 1300 in Liffey Hall 1

Title: Tame that Beast: How to bring Operations, Governance, and Reliability to Hadoop

Speaker: Dr. Stefan Radtke, CTO EMEA, Emerging Technologies Division

Abstract: Many companies have created extremely powerful Hadoop use cases with highly valuable outcomes. The diverse adoption and application of Hadoop is producing an extremely robust ecosystem. However, teams often create silos around their Hadoop, forgetting some of the hard-learned lessons IT has gained over the years. One often overlooked feature is governance.

Does your company have good KPIs and measurements around what gets loaded into Hadoop? Do you have a good taxonomy and metadata tool? As your business grows, are you able to support 99.99% operations that your Hadoop instance can support? If your primary data center goes down, can you replicate models and data into another facility? As the prevalence of Hadoop usage grows, these questions are becoming increasingly common—and urgent.

Breakout Session: April 14th @ 1500 – 1540 in Wicklow Hall 2B

Title: Hadoop Everywhere: Geo-Distributed Storage for Big Data

Speakers: Nikhil Joshi, Consultant Product Manager and Vishrut Shah, Director of Engineering

Abstract: Traditionally, HDFS provides robust protection against disk failures, node failures and rack failures. The mechanisms to protect data against entire datacenter failures and outages leave much to be desired. Neither the storage substrate (HDFS), nor the applications on top (MapReduce, Hive, HBase…etc) are capable of running across geographies/data-centers. With Hadoop’s increased enterprise adoption, there is greater need to protect business critical datasets in Hadoop clusters. This is motivated in large part by compliance, regulation, data protection and business continuity planning. ‘distcp’, which has been the foundation for most Hadoop vendor backup and recovery solutions, just doesn’t cut it when strong consistency is required or when there are more than 2 sites. Cloud-native applications (especially in IoT scenarios) generate humongous amounts of data all across the globe. There is a need for global storage infrastructure to reason over this corpus of data. It’s time for Hadoop storage to break out of its single datacenter confines. In this talk, we will discuss the challenges, approaches and architectures to take Hadoop storage global! *Topics Covered*: Hadoop Compatible Filesystems (HCFS), Geo-distribution of data, Disaster Recovery, Storage Overhead, Strong Consistency, Multi-protocol Data Access, Shared Storage Architectures

Follow us on @EMCbigdata and get social with us on twitter by using #EMC #HS16Dublin

Strata+Hadoop San Jose 2016

Erin K. Banks

Portfolio Marketing Director at EMC
Erin K. Banks has been in the IT industry for almost 20 years. She is the Portfolio Marketing Director for Big Data at EMC. Previously she worked at Juniper Networks in Technical Marketing for the Security Business Unit. She has also worked at VMware and EMC as an SE in the Federal Division, focused on Virtualization and Security. She holds both CISSP and CISA accreditations. Erin has a BS in Electrical Engineering and is an author, blogger, and avid runner. You can find her on social media at @Banksek

We are happy to announce that we will be at Strata+Hadoop San Jose 2016 next week (March 29 – 31) at the San Jose Convention Center. We will be in booth #1431elephant phone holder and have theater presentations covering our entire Big Data Portfolio and discussing how we can help you on your Big Data analytics journey. These adorable EMC elephant phone holder booth giveaways will be there too. Make sure you stop by, say hi, and attend one of our presentations or just talk to us about your analytics environments and how you want to get the most value out of your data because we can help you with that.

We also have two sessions during the conference:

Tame That Beast: How to Bring Operations, Governance, and Reliability to Hadoop from Keith Manthey (@KeithManthey). This session is on Wednesday at 1:50 – 2:30 PM in 210 B/F

and

Developing a Big Data Business Strategy from Bill Schmarzo (@schmarzo). This session is Wednesday at 11:00 – 11:40 AM in LL21 E/F

We look forward to seeing you there!Strata Rendering

By the way… here is a picture of our booth so you know what to look for when you visit.

 

Big Data Conversation with Steve Jones from Capgemini

Erin K. Banks

Portfolio Marketing Director at EMC
Erin K. Banks has been in the IT industry for almost 20 years. She is the Portfolio Marketing Director for Big Data at EMC. Previously she worked at Juniper Networks in Technical Marketing for the Security Business Unit. She has also worked at VMware and EMC as an SE in the Federal Division, focused on Virtualization and Security. She holds both CISSP and CISA accreditations. Erin has a BS in Electrical Engineering and is an author, blogger, and avid runner. You can find her on social media at @Banksek

Last year at a San Jose Earthquakes event, I met Steve Jones, Global VP for Big Data and Analytics at Capgemini:   “Is it football, is it soccer, will that discussion ever end?”Big Data Concept

In any case, we had a very lively discussion regarding the sport and our individual visions of the world. Steve is honest, fair, and a great conversationalist. I knew that I wanted to pick his brain further regarding his thoughts around Big Data and share them with you.

Below is the Big Data conversation that I had with Steve. I hoped his answers would provide greater insight into Big Data and how he sees its impact on business. And I was not disappointed. I hope you will enjoy his thoughts just as much as I do.

 

EKB: How does your company define Big Data?

SJ: It’s not really about “Big” or “Fast” it’s really about a shift away from single schema data approaches towards ‘schema on need’ and the integration of insight to the point of action.  Volumes are one challenge, but the real challenge is the mental shift away from data warehouses toward more flexible insight-driven solutions.

EKB: Why did you make the decision to focus on Big Data?

SJ: Capgemini was one of the first big systems integrators to look at what Big Data really meant operationally.  Back in 2011 we were already talking about how companies need to change the way they looked at governing data in a Big Data world.  We made the focus back then to really concentrate on driving the transformation towards this new world as we saw it as crucial to helping our clients deliver long term value.

EKB: What impact is the change to Big Data having for your clients?

SJ: The first impact is on how we deliver information projects, being able to use multiple different types of analytical engines on the same sets of data means we are able to solve more problems without requiring a new technology stack and another data silo.  The other is in terms of being able to deliver specific pieces of insight much faster.  What new technologies are doing is enabling the sorts of agile and DevOps practices that have become the normal way of working for application development to be applied to information projects.

EKB: How do you see the Big Data market changing in the future?

SJ: The big shift we are seeing now is the rise of fast data with technologies like Spark, Storm and grid databases such as Gemfire, the combination of Big and Fast helps companies to actually react as something is happening and even anticipate it in advance.  This means that insights are being integrated directly back into operational processes and having to work at application, rather than traditional BI, speeds.

EKB: What is the biggest myth about Big Data?

SJ: That there isn’t any governance and that it’s about replacing a data warehouse.  The reality is that today very little information decisions are driven from a data warehouse, Excel, systems information and local data marts drive most decisions.  The reality is that taking proper control of your data requires you to recognize the full landscape.

EKB: What business questions has Big Data helped you answer?

SJ: Big Data has helped us answer a huge range of questions for clients, some that couldn’t have been done before due to technology or cost.  But really its helped us answer a much more important general question “How can I get new insights faster?”, specific insights are great, but a new way of working that delivers continual value is better.

EKB: What is advice you would give to someone embarking on a Big Data project?

SJ:

1)Start with the view that you are going to replace the substrate on your entire data landscape.

2) Think about governance not as about quality but as about enabling collaboration

3) Minimize your number of different technologies. ­The lesson of Google, Facebook and Amazon is that you don’t need a huge variety of technologies that do the same thing, you need a few things that each do one thing well.

You can follow more of what Steve Jones is thinking about Big Data by following him on twitter @mosesjones.

Follow EMC

Subscribe to Blog via Email

Enter your email address to subscribe to this blog and receive notifications of new posts by email.

EMC Community Network

Participate in the Everything Big Data technical community

Follow @emcbigdata

RT @CHILLEVENTS2: Join us at booth 703 we are just monkeying around spreading the word about Hadoop Analytics #EMC #HS16SJ https://t.co/BIW… about 19 hours ago