Take a look at our Data Science books. Shulph carries a great selection of Data Science books, and we are always adding more.
Advanced Customer Analytics provides a clear guide to the specific analytical challenges faced by the retail sector. The book covers the nature and scale of data obtained in transactions, relative proximity to the consumer and the need to monitor customer behaviour across multiple channels. The book advocates a category management approach, taking into account the need to understand the consumer mindset through elasticity modelling and discount strategies, as well as targeted marketing and loyalty design.A practical, no-nonsense approach to complex scenarios is taken throughout, breaking down tasks into easily digestible steps. The use of a fictional retail analyst 'Scott' helps to provide accessible examples of practice. Advanced Customer Analytics does not skirt around the complexities of this subject but offers conceptual support to steer retail marketers towards making the right choices for analysing their data. Online resources include a selection of datasets to support specific chapters.
Application of Big Data and Business Analytics uses advanced analytic tools to explore the solutions to problems in society, environment and industry. The chapters within bring together researchers, engineers and practitioners, encompassing a wide and diverse set of topics in almost every field. With the increase in the availability of data, analytics has now become a major element in both the top line and the bottom line of any organization. With this in mind, Application of Big Data and Business Analytics brings together researchers, engineers and practitioners, encompassing a wide and diverse set of topics in almost every field. The primary target audience of this book includes researchers, academicians and data scientists from a variety of disciplines interested in analyzing and application of big data analytics. However, this work will also be of general interest to postgraduates and undergraduates pursuing advanced study in big data.
Become the master player of data exploration by creating reproducible data processing pipelines, visualizations, and prediction models for your applications. Key Features Get up and running with the Jupyter ecosystem and some example datasets Learn about key machine learning concepts such as SVM, KNN classifiers, and Random Forests Discover how you can use web scraping to gather and parse your own bespoke datasets Book Description Getting started with data science doesn't have to be an uphill battle. Applied Data Science with Python and Jupyter is a step-by-step guide ideal for beginners who know a little Python and are looking for a quick, fast-paced introduction to these concepts. In this book, you'll learn every aspect of the standard data workflow process, including collecting, cleaning, investigating, visualizing, and modeling data. You'll start with the basics of Jupyter, which will be the backbone of the book. After familiarizing ourselves with its standard features, you'll look at an example of it in practice with our first analysis. In the next lesson, you dive right into predictive analytics, where multiple classification algorithms are implemented. Finally, the book ends by looking at data collection techniques. You'll see how web data can be acquired with scraping techniques and via APIs, and then briefly explore interactive visualizations. What you will learn Get up and running with the Jupyter ecosystem Identify potential areas of investigation and perform exploratory data analysis Plan a machine learning classification strategy and train classification models Use validation curves and dimensionality reduction to tune and enhance your models Scrape tabular data from web pages and transform it into Pandas DataFrames Create interactive, web-friendly visualizations to clearly communicate your findings Who this book is for Applied Data Science with Python and Jupyter is ideal for professionals with a variety of job descriptions across a large range of industries, given the rising popularity and accessibility of data science. You'll need some prior experience with Python, with any prior work with libraries such as Pandas, Matplotlib, and Pandas providing you a useful head start.
Make any team or business data driven with this practical guide to overcoming common challenges and creating a data culture. Businesses are increasingly focusing on their data and analytics strategy, but a data-driven culture grounded in evidence-based decision making can be difficult to achieve. Be Data Driven outlines a step-by-step roadmap to building a data-driven organization or team, beginning with deciding on outcomes and a strategy before moving onto investing in technology and upskilling where necessary. This practical guide explains what it means to be a data-driven organization and explores which technologies are advancing data and analytics. Crucially, it also examines the most common challenges to becoming data driven, from a foundational skills gap to issues with leadership and strategy and the impact of organizational culture. With case studies of businesses who have successfully used data, Be Data Driven shows managers, leaders and data professionals how to address hurdles, encourage a data culture and become truly data driven.
Enhance your data analysis and predictive modeling skills using popular Python tools Key Features Cover all fundamental libraries for operation and manipulation of Python for data analysis Implement real-world datasets to perform predictive analytics with Python Access modern data analysis techniques and detailed code with scikit-learn and SciPy Book Description Python is one of the most common and popular languages preferred by leading data analysts and statisticians for working with massive datasets and complex data visualizations. Become a Python Data Analyst introduces Python's most essential tools and libraries necessary to work with the data analysis process, right from preparing data to performing simple statistical analyses and creating meaningful data visualizations. In this book, we will cover Python libraries such as NumPy, pandas, matplotlib, seaborn, SciPy, and scikit-learn, and apply them in practical data analysis and statistics examples. As you make your way through the chapters, you will learn to efficiently use the Jupyter Notebook to operate and manipulate data using NumPy and the pandas library. In the concluding chapters, you will gain experience in building simple predictive models and carrying out statistical computation and analysis using rich Python tools and proven data analysis techniques. By the end of this book, you will have hands-on experience performing data analysis with Python. What you will learn Explore important Python libraries and learn to install Anaconda distribution Understand the basics of NumPy Produce informative and useful visualizations for analyzing data Perform common statistical calculations Build predictive models and understand the principles of predictive analytics Who this book is for Become a Python Data Analyst is for entry-level data analysts, data engineers, and BI professionals who want to make complete use of Python tools for performing efficient data analysis. Prior knowledge of Python programming is necessary to understand the concepts covered in this book
Get to grips with processing large volumes of data and presenting it as engaging, interactive insights using Spark and Python. Key Features Get a hands-on, fast-paced introduction to the Python data science stack Explore ways to create useful metrics and statistics from large datasets Create detailed analysis reports with real-world data Book Description Processing big data in real time is challenging due to scalability, information inconsistency, and fault tolerance. Big Data Analysis with Python teaches you how to use tools that can control this data avalanche for you. With this book, you'll learn practical techniques to aggregate data into useful dimensions for posterior analysis, extract statistical measurements, and transform datasets into features for other systems. The book begins with an introduction to data manipulation in Python using pandas. You'll then get familiar with statistical analysis and plotting techniques. With multiple hands-on activities in store, you'll be able to analyze data that is distributed on several computers by using Dask. As you progress, you'll study how to aggregate data for plots when the entire data cannot be accommodated in memory. You'll also explore Hadoop (HDFS and YARN), which will help you tackle larger datasets. The book also covers Spark and explains how it interacts with other tools. By the end of this book, you'll be able to bootstrap your own Python environment, process large files, and manipulate data to generate statistics, metrics, and graphs. What you will learn Use Python to read and transform data into different formats Generate basic statistics and metrics using data on disk Work with computing tasks distributed over a cluster Convert data from various sources into storage or querying formats Prepare data for statistical analysis, visualization, and machine learning Present data in the form of effective visuals Who this book is for Big Data Analysis with Python is designed for Python developers, data analysts, and data scientists who want to get hands-on with methods to control data and transform it into impactful insights. Basic knowledge of statistical measurements and relational databases will help you to understand various concepts explained in this book.
For many organizations data is a by-product, but for the smarter ones it is the heartbeat of their business. Most businesses have a wealth of data buried in their systems which, if used effectively, could increase revenue, reduce costs and risk and improve customer satisfaction and employee experience. Beginning with how to choose projects which reflect your organization's goals and how to make the business case for investing in data, this book then takes the reader through the five 'waves' of organizational data maturity. It takes the reader from getting started on the data journey with some quick wins, to how data can help your business become a leading innovator which systematically outperforms competitors. Data and Analytics Strategy for Business outlines how to build consistent, high-quality sources of data which will create business value and explores how automation, AI and machine learning can improve performance and decision making. Filled with real-world examples and case studies, this book is a stage-by-stage guide to designing and implementing a results-driven data strategy. Table of contents Section - PART ONE: How data and analytics can help you grow your business; Chapter - 01: How can this book help you?; Chapter - 02: The business case for data; Chapter - 03: Your data and analytics strategy; Chapter - 04: A team game; Section - PART TWO: Wave 1: Aspire; Chapter - 05: A quick win; Chapter - 06: Repeat and learn; Section - PART THREE: Wave 2: Mature; Chapter - 07: Data governance; Chapter - 08: Data quality; Chapter - 09: A single customer view; Chapter - 10: Reports and dashboards; Chapter - 11: Data risk management and ethics; Section - PART FOUR: Wave 3: Industrialize; Chapter - 12: Automation, automation, automation; Chapter - 13: Scaling up and scaling out; Chapter - 14: Optimizing; Section - PART FIVE: Wave 4: Realize; Chapter - 15: The voice of the customer; Chapter - 16: Maximizing data science; Chapter - 17: Sharing data with suppliers and customers; Section - PART SIX: Wave 5: Differentiate; Chapter - 18: From data-driven to AI-driven; Chapter - 19: Data products; Chapter - 20: Right leadership, right time; Chapter - 21: Epilogue: Data success; About the author Simon Asplen-Taylor is an experienced and successful data and analytics leader based in London, UK, having served as Chief Data Officer for multiple FTSE firms and led some of the largest data led transformations in Europe. He specialises in transforming business through the use of data, analytics and artificial intelligence and is currently leading the data transformation at Lloyd's of London. He was included in the dataIQ 100 Most Influential People in Data in both 2020 and 2021.
Leverage the power of the Python data science libraries and advanced machine learning techniques to analyse large unstructured datasets and predict the occurrence of a particular future event. Key Features Explore the depths of data science, from data collection through to visualization Learn pandas, scikit-learn, and Matplotlib in detail Study various data science algorithms using real-world datasets Book Description Data Science with Python begins by introducing you to data science and teaches you to install the packages you need to create a data science coding environment. You will learn three major techniques in machine learning: unsupervised learning, supervised learning, and reinforcement learning. You will also explore basic classification and regression techniques, such as support vector machines, decision trees, and logistic regression. As you make your way through chapters, you will study the basic functions, data structures, and syntax of the Python language that are used to handle large datasets with ease. You will learn about NumPy and pandas libraries for matrix calculations and data manipulation, study how to use Matplotlib to create highly customizable visualizations, and apply the boosting algorithm XGBoost to make predictions. In the concluding chapters, you will explore convolutional neural networks (CNNs), deep learning algorithms used to predict what is in an image. You will also understand how to feed human sentences to a neural network, make the model process contextual information, and create human language processing systems to predict the outcome. By the end of this book, you will be able to understand and implement any new data science algorithm and have the confidence to experiment with tools or libraries other than those covered in the book. What you will learn Pre-process data to make it ready to use for machine learning Create data visualizations with Matplotlib Use scikit-learn to perform dimension reduction using principal component analysis (PCA) Solve classification and regression problems Get predictions using the XGBoost library Process images and create machine learning models to decode them Process human language for prediction and classification Use TensorBoard to monitor training metrics in real time Find the best hyperparameters for your model with AutoML Who this book is for Data Science with Python is designed for data analysts, data scientists, database engineers, and business analysts who want to move towards using Python and machine learning techniques to analyze data and predict outcomes. Basic knowledge of Python and data analytics will prove beneficial to understand the various concepts explained through this book.
Build a strong foundation of machine learning algorithms in 7 days Key Features Use Python and its wide array of machine learning libraries to build predictive models Learn the basics of the 7 most widely used machine learning algorithms within a week Know when and where to apply data science algorithms using this guide Book Description Machine learning applications are highly automated and self-modifying, and continue to improve over time with minimal human intervention, as they learn from the trained data. To address the complex nature of various real-world data problems, specialized machine learning algorithms have been developed. Through algorithmic and statistical analysis, these models can be leveraged to gain new knowledge from existing data as well. Data Science Algorithms in a Week addresses all problems related to accurate and efficient data classification and prediction. Over the course of seven days, you will be introduced to seven algorithms, along with exercises that will help you understand different aspects of machine learning. You will see how to pre-cluster your data to optimize and classify it for large datasets. This book also guides you in predicting data based on existing trends in your dataset. This book covers algorithms such as k-nearest neighbors, Naive Bayes, decision trees, random forest, k-means, regression, and time-series analysis. By the end of this book, you will understand how to choose machine learning algorithms for clustering, classification, and regression and know which is best suited for your problem What you will learn Understand how to identify a data science problem correctly Implement well-known machine learning algorithms efficiently using Python Classify your datasets using Naive Bayes, decision trees, and random forest with accuracy Devise an appropriate prediction solution using regression Work with time series data to identify relevant data events and trends Cluster your data using the k-means algorithm Who this book is for This book is for aspiring data science professionals who are familiar with Python and have a little background in statistics. You'll also find this book useful if you're currently working with data science algorithms in some capacity and want to expand your skill set
Data Science and Analytics explores the solutions to problems in society, environment and in industry. With the increase in the availability of data, analytics has now become a major element in both the top line and the bottom line of any organization. This book explores perspectives on how big data and business analytics are increasingly essential in better decision making. This edited work explores the application of big data and business analytics by academics, researchers, industrial experts, policy makers and practitioners, helping the reader to understand how big data can be efficiently utilized in better managerial applications. Data Science and Analytics brings together researchers, engineers and practitioners to encompass a wide and diverse range of topics in a wide range of fields. The book will provide unique insights to researchers, academics and data scientists from a variety of disciplines interested in analyzing and application of big data analytics, as well as data analysts, students and scholars pursuing advanced study in big data.