7 Most Important Books in “Big data &Machine learning”
Introducing Data Science explains vital data science concepts and teaches you how to accomplish the fundamental tasks that occupy data scientists. You’ll explore data visualization, graph databases, the use of NoSQL, and the data science process. You’ll use the Python language and common Python libraries as you experience firsthand the challenges of dealing with data at scale. Discover how Python allows you to gain insights from data sets so big that they need to be stored on multiple machines, or from data moving so quickly that no single machine can handle it
The book presents a breakdown of each variant of machine learning, how it works and how it is used within certain industries. Also covered are various algorithm types (supervised, unsupervised and so on) during training phases of machine learning. The reader will learn that with the right tools any developer or technology professional can glean information from their existing data. The book outlines the key types of machine learning, providing coded solutions for real world examples. There is a strong focus on data preparation and data cleaning, the core fundamental of machine learning. Each chapter includes how the code works and running examples.
This book provides a comprehensive view on the recent trend toward high performance computing architectures especially as it relates to analytics and data mining. Topics that are covered include: big data (and its characteristics), high performance computing for analytics, massively parallel processing (MPP) databases, algorithms for big data, in-memory databases, implementation of machine learning algorithms for big data platforms and analytics environments. However none gives a historical and comprehensive view of all these separate topics in a single document.
4) Data Science and Big Data Analytics: Discovering, Analyzing, Visualizing and Presenting DataHardcover – 2015
Data Science & Big Data Analytics educates readers about what Big Data is and how to extract value from it. The book covers methods and technologies required to analyze structured and unstructured datasets, as more individuals and organizations build out their capabilities to analyze Big Data and draw insights from it. Additional focus areas include machine learning, data visualization and presentation skills. The book provides practical foundation level training that enables immediate and effective participation in big data and other analytics projects. It provides grounding in basic and advanced analytic methods and an introduction to big data analytics technology and tools, including MapReduce and Hadoop.
Through a series of recent breakthroughs, deep learning has boosted the entire field of machine learning. Now, even programmers who know close to nothing about this technology can use simple, efficient tools to implement programs capable of learning from data. This practical book shows you how.
By using concrete examples, minimal theory, and two production-ready Python frameworks—scikit-learn and TensorFlow—author Aurélien Géron helps you gain an intuitive understanding of the concepts and tools for building intelligent systems. You’ll learn a range of techniques, starting with simple linear regression and progressing to deep neural networks. With exercises in each chapter to help you apply what you’ve learned, all you need is programming experience to get started.
Explore the machine learning landscape, particularly neural nets
Use scikit-learn to track an example machine-learning project end-to-end
Explore several training models, including support vector machines, decision trees, random forests, and ensemble methods
Use the TensorFlow library to build and train neural nets
Dive into neural net architectures, including convolutional nets, recurrent nets, and deep reinforcement learning
Learn techniques for training and scaling deep neural nets
Apply practical code examples without acquiring excessive machine learning theory or algorithm details
The third edition of a bestseller, Statistical and Machine-Learning Data Mining: Techniques for Better Predictive Modeling and Analysis of Big Data is still the only book, to date, to distinguish between statistical data mining and machine-learning data mining. is a compilation of new and creative data mining techniques, which address the scaling-up of the framework of classical and modern statistical methodology, for predictive modeling and analysis of big data. SM-DM provides proper solutions to common problems facing the newly minted data scientist in the data mining discipline. Its presentation focuses on the needs of the data scientists (commonly known as statisticians, data miners and data analysts), delivering practical yet powerful, simple yet insightful quantitative techniques, most of which use the "old" statistical methodologies improved upon by the new machine learning influence.