Well, if you like SQL, you'll love Hive! Hive is a popular part of the Hadoop "ecosystem." Hive sits on top of the Hadoop Distributed File System (HDFS) and allows you to treat your Big Data files as if they were relational database tables. Typically, Hive "tables" are simply "flat" text files that use commas, tabs, or other character as field delimiters.
The World is Adopting Scrum. Are You?
Scrum is one of the most popular Agile methodologies. It is an adaptive, iterative, fast, flexible, and effective methodology designed to deliver significant value quickly and throughout a project. Scrum ensures transparency in communication and creates an environment of collective accountability and continuous progress. You can become Scrum Master, Scrum Developer, or Scrum Product Owner certified through one of our instructor-led classes currently scheduled!
Hadoop Data Science
Are you an architect, software developer, analyst, or data scientist who wants to understand how to apply data science and machine learning on Hadoop? If so, then you need /training/etc's 3-day HDP Data Science course. It covers data science principles and techniques through lecture and hands-on experience. During the three-day course, students learn the processes and practice of data science, including machine learning and natural language processing. Students also learn the tools and programming languages used by data scientists, including Python, IPython, Mahout, Pig, NumPy, pandas, SciPy, Scikit-learn, the Natural Language Toolkit (NLTK), and Spark MLlib.