Global Sales Contact List

Contact   A B C D E F G H I J K L M N O P Q R S T U V W X Y Z

Archive for the ‘Big Data’ Category

Big Data Conversation with Splunk

Erin K. Banks

Erin K. Banks

Portfolio Marketing Director at EMC
Erin K. Banks has been in the IT industry for almost 20 years. She is the Portfolio Marketing Director for Big Data at EMC. Previously she worked at Juniper Networks in Technical Marketing for the Security Business Unit. She has also worked at VMware and EMC as an SE in the Federal Division, focused on Virtualization and Security. She holds both CISSP and CISA accreditations. Erin has a BS in Electrical Engineering and is an author, blogger, and avid runner. You can find her on social media at @Banksek
Erin K. Banks

I had the opportunity to talk with Jon Rooney, Senior Director, IT Solutions Marketing, from Splunk a couple of weeks ago. It was a great chance for me to know more about Splunk and of course I had to ask him his thoughts on Big Data. He was kind enough to allow our conversation to be a part of my Big Data Conversation series. logo_splunk clear

A little background about Splunk though Jon’s voice… Splunk helps you make sense of machine data and machine data is the largest and fastest growing component of Big Data. The most under used data comes from the massive amounts of data from applications, devices, servers, network end points and are often under-used because of how difficult is can be to capture, store and analyze using outdated. Our Big Data story is about real-time machine data. We keep your systems up and running and we keep you more secure.

 

EB: How does your company define Big Data?

JR: Splunk wouldn’t define it differently then anyone else. We believe that the jumping off point to Big Data is volume, velocity, and variety. All the data that is too unwieldy to put in to traditional databases and is difficult to keep up with.  The business press would discuss Big Data with Hadoop and state it was all about dumping together all your e-commerce and company transactions, and develop sentiment analysis about what people wrote on Twitter and product reviews. This is the human generated part of Big Data but the machine generated part of Big Data is actually the bigger portion of that and the harder to manage at scale. If you look at what people were doing with that data like pattern recognition, you can do that through batch but we focus on real-time data. Yes, we have that historical piece but It is much more valuable to do while it is happening then doing it post-mortem which is the traditional way of doing it.

 

EB: Do you feel the majority of organizations associate Big Data with Hadoop?

JR: I don’t think our customers do but the broader business and tech media, in the past 6 – 8 years, use “how does Amazon know what to recommend you” and “how does the CDC know that there are flu infections based on what they see on twitter”. Those are examples that ground Big Data vs “how do you look at millions of transactions through an API end point to see response time”. These are also Big Data examples and what Splunk does.

 

EB: How do you see Big Data changing in the future?

JR: People over time, as it becomes normalized, will see the scale of “big” change. The goalpost on what “big” means will move. People will remove the requirement that it is Big Data if you can’t cleanly fit in to a relationship database. Right now if you have to put it in to a NoSQL database, it is Big Data but that is not necessarily true. Right now there is a tight coupling between NoSQL databases and Big Data and I think that will change just as architectures change. You need to have the solution fit your architecture better and not because it handles petabytes of data. It now becomes another storage strategy that isn’t solely driven by volume, velocity, and variety. There are other architectural considerations that can help you make a decision.

 

EB: What is the biggest myth about Big Data?

JR:There are a lot. One of them is that not many people have figured it out and that there are only a handful of businesses that are driven by Big Data. There is the myth that people over estimate the sophistication of analysis done in Big Data, everyone thinks that everyone is doing what Amazon is doing when instead people are doing simple correlations.

EMC World 2016 – Big Data and Analytics Sessions

Erin K. Banks

Erin K. Banks

Portfolio Marketing Director at EMC
Erin K. Banks has been in the IT industry for almost 20 years. She is the Portfolio Marketing Director for Big Data at EMC. Previously she worked at Juniper Networks in Technical Marketing for the Security Business Unit. She has also worked at VMware and EMC as an SE in the Federal Division, focused on Virtualization and Security. She holds both CISSP and CISA accreditations. Erin has a BS in Electrical Engineering and is an author, blogger, and avid runner. You can find her on social media at @Banksek
Erin K. Banks

EMC World 2016 is only a few weeks away and I am really excited this year because this is the first time I have had an opportunity to pick sessions and work with business units on the message and I have no booth duty. I can’t tell you the last time I got to attend a conference and didn’t have booth duty. This is also exciting for me because I led the Modern Analytics and Big Data IT Leadership track and I feel that we have a great selection of sessions. We also have some great Data Lake Technology tracks with focus on the infrastructure that is available from EMC to assist you in getting faster and richer insights from your data. There are 10 IT Leadership tracks and 35 Technology tracks focused on Big Data and the analytics required to make your data valuable. 
EMC World 2016

 

All the Modern Analytics and Big Data IT Leadership tracks will be in Venetian C and here is the schedule of the 10 sessions:

 

Monday May 2, 2016

8:30 – 9:30 AM:          EMC Big Data Portfolio and How It Impacts Analytics

12:00 – 1:00 PM:         Think Like a Data Scientist

1:30 – 2:30 PM:           Big Data Patterns For The Internet of Things

4:30 – 5:30 PM:           NoSQL and Modern Analytics

 

Tuesday May 3, 2016

8:30 – 9:30 AM:          Future Smart Cities Today

12:00 – 1:00 PM:         Modern Analytics for Security: Security the Smart Grid and IoT

1:30 – 2:30 PM:           Big Data, Big Deal: How to Revolutionize Product and Service Experiences to Unlock Customer Value

 

Wednesday May 4, 2016

8:30 – 9:30 AM:          IoT@EMC: Using the Internet of Things as a Digital Transformation Catalyst

12:00 – 1:00 PM:         Modern Data Analytics

1:30 – 2:30 PM:           The Connected Consumer: Real-Time Customer 360

If you are interested in getting more information on any of these session, check out the session catalog that includes all the session at EMC World 2016. We look forward to see you there!

Hadoop Summit 2016 – Dublin, Ireland

Erin K. Banks

Erin K. Banks

Portfolio Marketing Director at EMC
Erin K. Banks has been in the IT industry for almost 20 years. She is the Portfolio Marketing Director for Big Data at EMC. Previously she worked at Juniper Networks in Technical Marketing for the Security Business Unit. She has also worked at VMware and EMC as an SE in the Federal Division, focused on Virtualization and Security. She holds both CISSP and CISA accreditations. Erin has a BS in Electrical Engineering and is an author, blogger, and avid runner. You can find her on social media at @Banksek
Erin K. Banks

The conference season is upon us and this time we will be traveling to beautiful Dublin, Ireland for Hadoop Summit 2016 on April 13th and 14th, 2016. Of course we are thrilled to be a Diamond Sponsor and we hope that you will stop by Booth #301 to talk to us about your Big Data and Analytics journey and how our Big Data Portfolio can help you along the way. We have great information to share with you about a Data Lake from EMC and why we feel that it is essential to the foundation for your analytics ecosystem. Our capabilities don’t end at the Data Lake though, we continue the Big Data Journey by utilizing Data Lake Extensions, Big Data Systems, and Big Data Solutions. Stop by to find out more about the capabilities in our portfolio as well as the Global Services we can provide to assist you with your Big Data use cases. Hadoop summit booth

We wanted to provide you with an overview of the sessions we will be speaking at over the two days

General Session: April 13th @ 0945 – 0955

Speaker: Carey James, Director Business Development, EMC Big Data Solutions

Abstract: Gaining Richer Insights and Business Outcomes with the EMC Big Data Portfolio

A recent study by IDC suggests that world data growth is expanding at a 5x rate, and will lead to as much as 40,000 exabytes by 2020. With data generation at such enormous rates, it is becoming essential that our interaction and understanding of these sets must bring out the value in a way that reveals rich insights and truly enhances business outcomes.  However, faced with obstacles from ingesting and indexing the right data from multiple sources to not being able to retain data long enough, and waiting on IT to spin up sufficient resources, businesses can struggle in their Big Data initiatives early on. EMC knows these challenges first-hand as we have been through them ourselves. From this experience, we have learned two things: the power of data is a game changer and the power of infrastructure is essential to discovering actionable insights. In this session, we will review the EMC Big Data Portfolio and related services that will help get your own initiatives off the ground– no matter where you are on your Big Data journey.

Breakout Session: April 13th @ 1220 – 1300 in Liffey Hall 1

Title: Tame that Beast: How to bring Operations, Governance, and Reliability to Hadoop

Speaker: Dr. Stefan Radtke, CTO EMEA, Emerging Technologies Division

Abstract: Many companies have created extremely powerful Hadoop use cases with highly valuable outcomes. The diverse adoption and application of Hadoop is producing an extremely robust ecosystem. However, teams often create silos around their Hadoop, forgetting some of the hard-learned lessons IT has gained over the years. One often overlooked feature is governance.

Does your company have good KPIs and measurements around what gets loaded into Hadoop? Do you have a good taxonomy and metadata tool? As your business grows, are you able to support 99.99% operations that your Hadoop instance can support? If your primary data center goes down, can you replicate models and data into another facility? As the prevalence of Hadoop usage grows, these questions are becoming increasingly common—and urgent.

Breakout Session: April 14th @ 1500 – 1540 in Wicklow Hall 2B

Title: Hadoop Everywhere: Geo-Distributed Storage for Big Data

Speakers: Nikhil Joshi, Consultant Product Manager and Vishrut Shah, Director of Engineering

Abstract: Traditionally, HDFS provides robust protection against disk failures, node failures and rack failures. The mechanisms to protect data against entire datacenter failures and outages leave much to be desired. Neither the storage substrate (HDFS), nor the applications on top (MapReduce, Hive, HBase…etc) are capable of running across geographies/data-centers. With Hadoop’s increased enterprise adoption, there is greater need to protect business critical datasets in Hadoop clusters. This is motivated in large part by compliance, regulation, data protection and business continuity planning. ‘distcp’, which has been the foundation for most Hadoop vendor backup and recovery solutions, just doesn’t cut it when strong consistency is required or when there are more than 2 sites. Cloud-native applications (especially in IoT scenarios) generate humongous amounts of data all across the globe. There is a need for global storage infrastructure to reason over this corpus of data. It’s time for Hadoop storage to break out of its single datacenter confines. In this talk, we will discuss the challenges, approaches and architectures to take Hadoop storage global! *Topics Covered*: Hadoop Compatible Filesystems (HCFS), Geo-distribution of data, Disaster Recovery, Storage Overhead, Strong Consistency, Multi-protocol Data Access, Shared Storage Architectures

Follow us on @EMCbigdata and get social with us on twitter by using #EMC #HS16Dublin

Strata+Hadoop San Jose 2016

Erin K. Banks

Erin K. Banks

Portfolio Marketing Director at EMC
Erin K. Banks has been in the IT industry for almost 20 years. She is the Portfolio Marketing Director for Big Data at EMC. Previously she worked at Juniper Networks in Technical Marketing for the Security Business Unit. She has also worked at VMware and EMC as an SE in the Federal Division, focused on Virtualization and Security. She holds both CISSP and CISA accreditations. Erin has a BS in Electrical Engineering and is an author, blogger, and avid runner. You can find her on social media at @Banksek
Erin K. Banks

We are happy to announce that we will be at Strata+Hadoop San Jose 2016 next week (March 29 – 31) at the San Jose Convention Center. We will be in booth #1431elephant phone holder and have theater presentations covering our entire Big Data Portfolio and discussing how we can help you on your Big Data analytics journey. These adorable EMC elephant phone holder booth giveaways will be there too. Make sure you stop by, say hi, and attend one of our presentations or just talk to us about your analytics environments and how you want to get the most value out of your data because we can help you with that.

We also have two sessions during the conference:

Tame That Beast: How to Bring Operations, Governance, and Reliability to Hadoop from Keith Manthey (@KeithManthey). This session is on Wednesday at 1:50 – 2:30 PM in 210 B/F

and

Developing a Big Data Business Strategy from Bill Schmarzo (@schmarzo). This session is Wednesday at 11:00 – 11:40 AM in LL21 E/F

We look forward to seeing you there!Strata Rendering

By the way… here is a picture of our booth so you know what to look for when you visit.

 

Follow EMC

Subscribe to Blog via Email

Enter your email address to subscribe to this blog and receive notifications of new posts by email.

EMC Community Network

Participate in the Everything Big Data technical community

Follow @emcbigdata