Archive for the ‘Hadoop’ Category

EMC at Hadoop Summit San Jose 2016

Erin K. Banks

Portfolio Marketing Director at Dell EMC
Erin K. Banks has been in the IT industry for almost 20 years. She is the Portfolio Marketing Director for Big Data and Data Analytics at Dell EMC. Previously she worked at Juniper Networks in Technical Marketing for the Security Business Unit. She has also worked at VMware and EMC as an SE in the Federal Division, focused on Virtualization and Security. She holds both CISSP and CISA accreditations. Erin has a BS in Electrical Engineering and is an author, blogger, and avid runner. You can find her on social media at @banksek

Once again another Hadoop Summit conference is upon us and this time it is in San Jose and of course we will be attending. We will be in Booth # 703 and we have several sessions, Including a keynote that we hope you will be able to attend. Here is a quick snapshot of the sessions

Hadoop Summit San Jose

Navigating the World of User Data Management and Data Discovery

on June 29th at 11:30 AM

 

Building a Data Analytics PaaS for Smart Cities

on June 29th at 4:10 PM

 

Gaining Richer Insights and Business Outcomes with the EMC Big Data Portfolio (Keynote) on June 30th at 10:15 AM

A recent study by IDC suggests that world data growth is expanding at a 5x rate, and will lead to as much as 40,000 exabytes by 2020. With data generation at such enormous rates, it is becoming essential that our interaction and understanding of these sets must bring out the value in a way that reveals rich insights and truly enhances business outcomes.  However, faced with obstacles from ingesting and indexing the right data from multiple sources, to not being able to retain data long enough, and waiting on IT to spin up sufficient resources, businesses can struggle in their Big Data initiatives early on. EMC knows these challenges first-hand as we have been through them ourselves. From this experience, we have learned two things: the power of data is a game changer and the power of infrastructure is essential to discovering actionable insights. In this session, we will review the EMC Big Data Portfolio and related services that will help get your own initiatives off the ground– no matter where you are on your Big Data journey.

 

We also have a 40-minute technical session on June 28th at 2:50 PM in Liffy Hall 1

Increasing Hadoop Resiliency and Performance with EMC Isilon

Isilon is a core platform for EMC Big Data Solutions that improves Hadoop Resiliency, Operations and Performance.  There has been limited analysis on the impact failures (disk, compute, namenode, datanode) have on Hadoop Performance and data driven operations.  Even if you are not an EMC customer today, this lecture will provide valuable insights on why such failures affect Hadoop’s performance and operations and how EMC Isilon better addresses these issues to limit impact of various failure scenarios.

We look forward to seeing you there and don’t forget to tweet with hashtag #HS16SJ

Strata+Hadoop London 2016

Erin K. Banks

Portfolio Marketing Director at Dell EMC
Erin K. Banks has been in the IT industry for almost 20 years. She is the Portfolio Marketing Director for Big Data and Data Analytics at Dell EMC. Previously she worked at Juniper Networks in Technical Marketing for the Security Business Unit. She has also worked at VMware and EMC as an SE in the Federal Division, focused on Virtualization and Security. She holds both CISSP and CISA accreditations. Erin has a BS in Electrical Engineering and is an author, blogger, and avid runner. You can find her on social media at @banksek

London here we come… starting June 1 through June 3, we will be at the SOLD OUT Strata+Hadoop World at ExCel LondonSHW london banner

We will be in stand #408 over the three days and on Tuesday, June 2 at 1455, Sebastian Darrington, CTO for Big Data and Analytics in EMEA will be presenting “Developing a Successful Big Data Strategy”. This session will take place in Capital Suite 7 and here is an abstract for the session:

Many businesses have undertaken Big Data projects, but for every successful project, there are dozens that have failed or have stagnated. Customers are realizing they need to understand both the business outcomes and technology in order to target their goals more accurately. They need to tackle the challenges of Hadoop at scale, the maturity of the organization, and how it affects their strategy. Whether you are an innovator, a late adopter, or somewhere in between, join us as we explore the reasons why such projects hit obstacles, what typical challenges are experienced, and how to overcome them along your own Big Data journey.SHW London booth

If you want more information on this conference, check out our ECN page for this conference

 

 

Hadoop Summit 2016 – Dublin, Ireland

Erin K. Banks

Portfolio Marketing Director at Dell EMC
Erin K. Banks has been in the IT industry for almost 20 years. She is the Portfolio Marketing Director for Big Data and Data Analytics at Dell EMC. Previously she worked at Juniper Networks in Technical Marketing for the Security Business Unit. She has also worked at VMware and EMC as an SE in the Federal Division, focused on Virtualization and Security. She holds both CISSP and CISA accreditations. Erin has a BS in Electrical Engineering and is an author, blogger, and avid runner. You can find her on social media at @banksek

The conference season is upon us and this time we will be traveling to beautiful Dublin, Ireland for Hadoop Summit 2016 on April 13th and 14th, 2016. Of course we are thrilled to be a Diamond Sponsor and we hope that you will stop by Booth #301 to talk to us about your Big Data and Analytics journey and how our Big Data Portfolio can help you along the way. We have great information to share with you about a Data Lake from EMC and why we feel that it is essential to the foundation for your analytics ecosystem. Our capabilities don’t end at the Data Lake though, we continue the Big Data Journey by utilizing Data Lake Extensions, Big Data Systems, and Big Data Solutions. Stop by to find out more about the capabilities in our portfolio as well as the Global Services we can provide to assist you with your Big Data use cases. Hadoop summit booth

We wanted to provide you with an overview of the sessions we will be speaking at over the two days

General Session: April 13th @ 0945 – 0955

Speaker: Carey James, Director Business Development, EMC Big Data Solutions

Abstract: Gaining Richer Insights and Business Outcomes with the EMC Big Data Portfolio

A recent study by IDC suggests that world data growth is expanding at a 5x rate, and will lead to as much as 40,000 exabytes by 2020. With data generation at such enormous rates, it is becoming essential that our interaction and understanding of these sets must bring out the value in a way that reveals rich insights and truly enhances business outcomes.  However, faced with obstacles from ingesting and indexing the right data from multiple sources to not being able to retain data long enough, and waiting on IT to spin up sufficient resources, businesses can struggle in their Big Data initiatives early on. EMC knows these challenges first-hand as we have been through them ourselves. From this experience, we have learned two things: the power of data is a game changer and the power of infrastructure is essential to discovering actionable insights. In this session, we will review the EMC Big Data Portfolio and related services that will help get your own initiatives off the ground– no matter where you are on your Big Data journey.

Breakout Session: April 13th @ 1220 – 1300 in Liffey Hall 1

Title: Tame that Beast: How to bring Operations, Governance, and Reliability to Hadoop

Speaker: Dr. Stefan Radtke, CTO EMEA, Emerging Technologies Division

Abstract: Many companies have created extremely powerful Hadoop use cases with highly valuable outcomes. The diverse adoption and application of Hadoop is producing an extremely robust ecosystem. However, teams often create silos around their Hadoop, forgetting some of the hard-learned lessons IT has gained over the years. One often overlooked feature is governance.

Does your company have good KPIs and measurements around what gets loaded into Hadoop? Do you have a good taxonomy and metadata tool? As your business grows, are you able to support 99.99% operations that your Hadoop instance can support? If your primary data center goes down, can you replicate models and data into another facility? As the prevalence of Hadoop usage grows, these questions are becoming increasingly common—and urgent.

Breakout Session: April 14th @ 1500 – 1540 in Wicklow Hall 2B

Title: Hadoop Everywhere: Geo-Distributed Storage for Big Data

Speakers: Nikhil Joshi, Consultant Product Manager and Vishrut Shah, Director of Engineering

Abstract: Traditionally, HDFS provides robust protection against disk failures, node failures and rack failures. The mechanisms to protect data against entire datacenter failures and outages leave much to be desired. Neither the storage substrate (HDFS), nor the applications on top (MapReduce, Hive, HBase…etc) are capable of running across geographies/data-centers. With Hadoop’s increased enterprise adoption, there is greater need to protect business critical datasets in Hadoop clusters. This is motivated in large part by compliance, regulation, data protection and business continuity planning. ‘distcp’, which has been the foundation for most Hadoop vendor backup and recovery solutions, just doesn’t cut it when strong consistency is required or when there are more than 2 sites. Cloud-native applications (especially in IoT scenarios) generate humongous amounts of data all across the globe. There is a need for global storage infrastructure to reason over this corpus of data. It’s time for Hadoop storage to break out of its single datacenter confines. In this talk, we will discuss the challenges, approaches and architectures to take Hadoop storage global! *Topics Covered*: Hadoop Compatible Filesystems (HCFS), Geo-distribution of data, Disaster Recovery, Storage Overhead, Strong Consistency, Multi-protocol Data Access, Shared Storage Architectures

Follow us on @EMCbigdata and get social with us on twitter by using #EMC #HS16Dublin

Strata+Hadoop San Jose 2016

Erin K. Banks

Portfolio Marketing Director at Dell EMC
Erin K. Banks has been in the IT industry for almost 20 years. She is the Portfolio Marketing Director for Big Data and Data Analytics at Dell EMC. Previously she worked at Juniper Networks in Technical Marketing for the Security Business Unit. She has also worked at VMware and EMC as an SE in the Federal Division, focused on Virtualization and Security. She holds both CISSP and CISA accreditations. Erin has a BS in Electrical Engineering and is an author, blogger, and avid runner. You can find her on social media at @banksek

We are happy to announce that we will be at Strata+Hadoop San Jose 2016 next week (March 29 – 31) at the San Jose Convention Center. We will be in booth #1431elephant phone holder and have theater presentations covering our entire Big Data Portfolio and discussing how we can help you on your Big Data analytics journey. These adorable EMC elephant phone holder booth giveaways will be there too. Make sure you stop by, say hi, and attend one of our presentations or just talk to us about your analytics environments and how you want to get the most value out of your data because we can help you with that.

We also have two sessions during the conference:

Tame That Beast: How to Bring Operations, Governance, and Reliability to Hadoop from Keith Manthey (@KeithManthey). This session is on Wednesday at 1:50 – 2:30 PM in 210 B/F

and

Developing a Big Data Business Strategy from Bill Schmarzo (@schmarzo). This session is Wednesday at 11:00 – 11:40 AM in LL21 E/F

We look forward to seeing you there!Strata Rendering

By the way… here is a picture of our booth so you know what to look for when you visit.

 

Follow Dell EMC

Dell EMC Big Data Portfolio

See how the Dell EMC Big Data Portfolio can make a difference for your analytics journey

Subscribe to Blog via Email

Enter your email address to subscribe to this blog and receive notifications of new posts by email.

Dell EMC Community Network

Participate in the Everything Big Data technical community

Follow us on Twitter