Take a look at our Data Visualization books. Shulph carries a great selection of Data Visualization books, and we are always adding more.
Explore the exciting world of machine learning with the fastest growing technology in the world Key Features Understand various machine learning concepts with real-world examples Implement a supervised machine learning pipeline from data ingestion to validation Gain insights into how you can use machine learning in everyday life Book Description Machine learning—the ability of a machine to give right answers based on input data—has revolutionized the way we do business. Applied Supervised Learning with Python provides a rich understanding of how you can apply machine learning techniques in your data science projects using Python. You'll explore Jupyter Notebooks, the technology used commonly in academic and commercial circles with in-line code running support. With the help of fun examples, you'll gain experience working on the Python machine learning toolkit—from performing basic data cleaning and processing to working with a range of regression and classification algorithms. Once you've grasped the basics, you'll learn how to build and train your own models using advanced techniques such as decision trees, ensemble modeling, validation, and error metrics. You'll also learn data visualization techniques using powerful Python libraries such as Matplotlib and Seaborn. This book also covers ensemble modeling and random forest classifiers along with other methods for combining results from multiple models, and concludes by delving into cross-validation to test your algorithm and check how well the model works on unseen data. By the end of this book, you'll be equipped to not only work with machine learning algorithms, but also be able to create some of your own! What you will learn Understand the concept of supervised learning and its applications Implement common supervised learning algorithms using machine learning Python libraries Validate models using the k-fold technique Build your models with decision trees to get results effortlessly Use ensemble modeling techniques to improve the performance of your model Apply a variety of metrics to compare machine learning models Who this book is for Applied Supervised Learning with Python is for you if you want to gain a solid understanding of machine learning using Python. It'll help if you to have some experience in any functional or object-oriented language and a basic understanding of Python libraries and expressions, such as arrays and dictionaries.
Design clever algorithms that can uncover interesting structures and hidden relationships in unstructured, unlabeled data Key Features Learn how to select the most suitable Python library to solve your problem Compare k-Nearest Neighbor (k-NN) and non-parametric methods and decide when to use them Delve into the applications of neural networks using real-world datasets Book Description Unsupervised learning is a useful and practical solution in situations where labeled data is not available. Applied Unsupervised Learning with Python guides you on the best practices for using unsupervised learning techniques in tandem with Python libraries and extracting meaningful information from unstructured data. The course begins by explaining how basic clustering works to find similar data points in a set. Once you are well versed with the k-means algorithm and how it operates, you'll learn what dimensionality reduction is and where to apply it. As you progress, you'll learn various neural network techniques and how they can improve your model. While studying the applications of unsupervised learning, you will also understand how to mine topics that are trending on Twitter and Facebook and build a news recommendation engine for users. You will complete the course by challenging yourself through various interesting activities such as performing a Market Basket Analysis and identifying relationships between different merchandises. By the end of this course, you will have the skills you need to confidently build your own models using Python. What you will learn Understand the basics and importance of clustering Build k-means, hierarchical, and DBSCAN clustering algorithms from scratch with built-in packages Explore dimensionality reduction and its applications Use scikit-learn (sklearn) to implement and analyse principal component analysis (PCA)on the Iris dataset Employ Keras to build autoencoder models for the CIFAR-10 dataset Apply the Apriori algorithm with machine learning extensions (Mlxtend) to study transaction data Who this book is for This course is designed for developers, data scientists, and machine learning enthusiasts who are interested in unsupervised learning. Some familiarity with Python programming along with basic knowledge of mathematical concepts including exponents, square roots, means, and medians will be beneficial.
In the fast moving world of the fourth industrial revolution not everyone needs to be a data scientist but everyone should be data literate, with the ability to read, analyze and communicate with data.It is not enough for a business to have the best data if those using it don't understand the right questions to ask or how to use the information generated to make decisions. Be Data Literate is the essential guide to developing the curiosity, creativity and critical thinking necessary to make anyone data literate, without retraining as a data scientist or statistician. With learnings to show development and real-world examples from industries implementing data literacy skills, this book explains how to confidently read and speak the 'language of data' in the modern business environment and everyday life. Be Data Literate is a practical guide to understanding the four levels of analytics, how to analyze data and the key steps to making smarter, data-informed decisions. Written by a founding pioneer and worldwide leading expert on data literacy, this book empowers professionals with the skills they need to succeed in the digital world.
Get to grips with processing large volumes of data and presenting it as engaging, interactive insights using Spark and Python. Key Features Get a hands-on, fast-paced introduction to the Python data science stack Explore ways to create useful metrics and statistics from large datasets Create detailed analysis reports with real-world data Book Description Processing big data in real time is challenging due to scalability, information inconsistency, and fault tolerance. Big Data Analysis with Python teaches you how to use tools that can control this data avalanche for you. With this book, you'll learn practical techniques to aggregate data into useful dimensions for posterior analysis, extract statistical measurements, and transform datasets into features for other systems. The book begins with an introduction to data manipulation in Python using pandas. You'll then get familiar with statistical analysis and plotting techniques. With multiple hands-on activities in store, you'll be able to analyze data that is distributed on several computers by using Dask. As you progress, you'll study how to aggregate data for plots when the entire data cannot be accommodated in memory. You'll also explore Hadoop (HDFS and YARN), which will help you tackle larger datasets. The book also covers Spark and explains how it interacts with other tools. By the end of this book, you'll be able to bootstrap your own Python environment, process large files, and manipulate data to generate statistics, metrics, and graphs. What you will learn Use Python to read and transform data into different formats Generate basic statistics and metrics using data on disk Work with computing tasks distributed over a cluster Convert data from various sources into storage or querying formats Prepare data for statistical analysis, visualization, and machine learning Present data in the form of effective visuals Who this book is for Big Data Analysis with Python is designed for Python developers, data analysts, and data scientists who want to get hands-on with methods to control data and transform it into impactful insights. Basic knowledge of statistical measurements and relational databases will help you to understand various concepts explained in this book.
No need to spend hours ploughing through endless data – let Spark, one of the fastest big data processing engines available, do the hard work for you. Key Features Get up and running with Apache Spark and Python Integrate Spark with AWS for real-time analytics Apply processed data streams to machine learning APIs of Apache Spark Book Description Processing big data in real time is challenging due to scalability, information consistency, and fault-tolerance. This book teaches you how to use Spark to make your overall analytical workflow faster and more efficient. You'll explore all core concepts and tools within the Spark ecosystem, such as Spark Streaming, the Spark Streaming API, machine learning extension, and structured streaming. You'll begin by learning data processing fundamentals using Resilient Distributed Datasets (RDDs), SQL, Datasets, and Dataframes APIs. After grasping these fundamentals, you'll move on to using Spark Streaming APIs to consume data in real time from TCP sockets, and integrate Amazon Web Services (AWS) for stream consumption. By the end of this book, you'll not only have understood how to use machine learning extensions and structured streams but you'll also be able to apply Spark in your own upcoming big data projects. What you will learn Write your own Python programs that can interact with Spark Implement data stream consumption using Apache Spark Recognize common operations in Spark to process known data streams Integrate Spark streaming with Amazon Web Services (AWS) Create a collaborative filtering model with the movielens dataset Apply processed data streams to Spark machine learning APIs Who this book is for Data Processing with Apache Spark is for you if you are a software engineer, architect, or IT professional who wants to explore distributed systems and big data analytics. Although you don't need any knowledge of Spark, prior experience of working with Python is recommended.
A practical guide to implementing a scalable and fast state-of-the-art analytical data estateKey FeaturesStore and analyze data with enterprise-grade security and auditingPerform batch, streaming, and interactive analytics to optimize your big data solutions with easeDevelop and run parallel data processing programs using real-world enterprise scenariosBook DescriptionAzure Data Lake, the modern data warehouse architecture, and related data services on Azure enable organizations to build their own customized analytical platform to fit any analytical requirements in terms of volume, speed, and quality. This book is your guide to learning all the features and capabilities of Azure data services for storing, processing, and analyzing data (structured, unstructured, and semi-structured) of any size. You will explore key techniques for ingesting and storing data and perform batch, streaming, and interactive analytics. The book also shows you how to overcome various challenges and complexities relating to productivity and scaling. Next, you will be able to develop and run massive data workloads to perform different actions. Using a cloud-based big data-modern data warehouse-analytics setup, you will also be able to build secure, scalable data estates for enterprises. Finally, you will not only learn how to develop a data warehouse but also understand how to create enterprise-grade security and auditing big data programs. By the end of this Azure book, you will have learned how to develop a powerful and efficient analytical platform to meet enterprise needs.What you will learnImplement data governance with Azure servicesUse integrated monitoring in the Azure Portal and integrate Azure Data Lake Storage into the Azure MonitorExplore the serverless feature for ad-hoc data discovery, logical data warehousing, and data wranglingImplement networking with Synapse Analytics and Spark poolsCreate and run Spark jobs with Databricks clustersImplement streaming using Azure Functions, a serverless runtime environment on AzureExplore the predefined ML services in Azure and use them in your appWho this book is forThis book is for data architects, ETL developers, or anyone who wants to get well-versed with Azure data services to implement an analytical data estate for their enterprise. The book will also appeal to data scientists and data analysts who want to explore all the capabilities of Azure data services, which can be used to store, process, and analyze any kind of data. A beginner-level understanding of data analysis and streaming will be required.
This book will help you build interactive graphs that are viewable in any web browser using JavaScript, D3.js, and SVG. You will learn how to make a scatter plot, a bar graph, a pie chart, a force directed graph, and a map. Key Features Takes you through the most common graphs you'll need Add interactivity to your visualizations Easy to follow builds Book Description D3.js is a JavaScript library that allows you to create graphs and data visualizations in the browser with HTML, SVG, and CSS. This book will take you from the basics of D3.js, so that you can create your own interactive visualizations, to creating the most common graphs that you will encounter as a developer, scientist, statistician, or data scientist. The book begins with an overview of SVG, the basis for creating two-dimensional graphics in the browser. Once the reader has a firm understanding of SVG, we will tackle the basics of how to use D3.js to connect data to our SVG elements. We will start with a scatter plot that maps run data to circles on a graph, and expand our scatter plot to make it interactive. You will see how you can easily allow the users of your graph to create, edit, and delete run data by simply dragging and clicking the graph. Next, we will explore creating a bar graph, using external data from a mock API. After that, we will explore animations and motion with a bar graph, and use various physics-based forces to create a force-directed graph. Finally, we will look at how to use GeoJSON data to create a map. What you will learn Build a scatter plot Build a bar graph Build a pie chart Build a force-directed graph Build a map Build interactivity into your graphs Who this book is for This book is for web developers, interactive news developers, data scientists, and anyone interested in representing data through interactive visualizations on the Web with D3. Some basic knowledge of JavaScript is expected, but no prior experience with data visualization or D3 is required to follow this book.
Understand, explore, and effectively present data using the powerful data visualization techniques of Python programming. Key Features Study key visualization tools and techniques with real-world data Explore industry-standard plotting libraries, including Matplotlib and Seaborn Breathe life into your visuals with exciting widgets and animations using Bokeh Book Description Data Visualization with Python reviews the spectrum of data visualization and its importance. Designed for beginners, it'll help you learn about statistics by computing mean, median, and variance for certain numbers. In the first few chapters, you'll be able to take a quick tour of key NumPy and Pandas techniques, which include indexing, slicing, iterating, filtering, and grouping. The book keeps pace with your learning needs, introducing you to various visualization libraries. As you work through chapters on Matplotlib and Seaborn, you'll discover how to create visualizations in an easier way. After a lesson on these concepts, you can then brush up on advanced visualization techniques like geoplots and interactive plots. You'll learn how to make sense of geospatial data, create interactive visualizations that can be integrated into any webpage, and take any dataset to build beautiful visualizations. What's more? You'll study how to plot geospatial data on a map using Choropleth plot and understand the basics of Bokeh, extending plots by adding widgets and animating the display of information. By the end of this book, you'll be able to put your learning into practice with an engaging activity, where you can work with a new dataset to create an insightful capstone visualization. What you will learn Understand and use various plot types with Python Explore and work with different plotting libraries Learn to create effective visualizations Improve your Python data wrangling skills Hone your skill set by using tools like Matplotlib, Seaborn, and Bokeh Reinforce your knowledge of various data formats and representations Who this book is for Data Visualization with Python is designed for developers and scientists, who want to get into data science or want to use data visualizations to enrich their personal and professional projects. You do not need any prior experience in data analytics and visualization, however, it'll help you to have some knowledge of Python and familiarity with high school level mathematics. Even though this is a beginner level course on data visualization, experienced developers will be able to improve their Python skills by working with real-world data.
Simplify your ETL processes with these hands-on data hygiene tips, tricks, and best practices. Key Features Focus on the basics of data wrangling Study various ways to extract the most out of your data in less time Boost your learning curve with bonus topics like random data generation and data integrity checks Book Description For data to be useful and meaningful, it must be curated and refined. Data Wrangling with Python teaches you the core ideas behind these processes and equips you with knowledge of the most popular tools and techniques in the domain. The book starts with the absolute basics of Python, focusing mainly on data structures. It then delves into the fundamental tools of data wrangling like NumPy and Pandas libraries. You'll explore useful insights into why you should stay away from traditional ways of data cleaning, as done in other languages, and take advantage of the specialized pre-built routines in Python. This combination of Python tips and tricks will also demonstrate how to use the same Python backend and extract/transform data from an array of sources including the Internet, large database vaults, and Excel financial tables. To help you prepare for more challenging scenarios, you'll cover how to handle missing or wrong data, and reformat it based on the requirements from the downstream analytics tool. The book will further help you grasp concepts through real-world examples and datasets. By the end of this book, you will be confident in using a diverse array of sources to extract, clean, transform, and format your data efficiently. What you will learn Use and manipulate complex and simple data structures Harness the full potential of DataFrames and numpy.array at run time Perform web scraping with BeautifulSoup4 and html5lib Execute advanced string search and manipulation with RegEX Handle outliers and perform data imputation with Pandas Use descriptive statistics and plotting techniques Practice data wrangling and modeling using data generation techniques Who this book is for Data Wrangling with Python is designed for developers, data analysts, and business analysts who are keen to pursue a career as a full-fledged data scientist or analytics expert. Although, this book is for beginners, prior working knowledge of Python is necessary to easily grasp the concepts covered here. It will also help to have rudimentary knowledge of relational database and SQL.
Build an end-to-end business solution in the cognitive automation lifecycle and explore UiPath Document Understanding, UiPath AI Center, and DruidKey FeaturesExplore out-of-the-box (OOTB) AI Models in UiPathLearn how to deploy, manage, and continuously improve machine learning models using UiPath AI CenterDeploy UiPath-integrated chatbots and master UiPath Document UnderstandingBook DescriptionArtificial intelligence (AI) enables enterprises to optimize business processes that are probabilistic, highly variable, and require cognitive abilities with unstructured data. Many believe there is a steep learning curve with AI, however, the goal of our book is to lower the barrier to using AI. This practical guide to AI with UiPath will help RPA developers and tech-savvy business users learn how to incorporate cognitive abilities into business process optimization. With the hands-on approach of this book, you'll quickly be on your way to implementing cognitive automation to solve everyday business problems. Complete with step-by-step explanations of essential concepts, practical examples, and self-assessment questions, this book will help you understand the power of AI and give you an overview of the relevant out-of-the-box models. You'll learn about cognitive AI in the context of RPA, the basics of machine learning, and how to apply cognitive automation within the development lifecycle. You'll then put your skills to test by building three use cases with UiPath Document Understanding, UiPath AI Center, and Druid. By the end of this AI book, you'll be able to build UiPath automations with the cognitive capabilities of intelligent document processing, machine learning, and chatbots, while understanding the development lifecycle.What you will learnDiscover how to bridge the gap between RPA and cognitive automationUnderstand how to configure, deploy, and maintain ML models in UiPathExplore OOTB models to manage documents, chats, emails, and morePrepare test data and test cases for user acceptance testing (UAT)Build a UiPath automation to act upon Druid responsesFind out how to connect custom models to RPAWho this book is forAI Engineers and RPA developers who want to upskill and deploy out-of-the-box models using UiPath's AI capabilities will find this guide useful. A basic understanding of robotic process automation and machine learning will be beneficial but not mandatory to get started with this UiPath book.