Take a look at our Computer Vision & Pattern Recognition books. Shulph carries a great selection of Computer Vision & Pattern Recognition books, and we are always adding more.
Create end-to-end systems that can power robots with artificial vision and deep learning techniques Key Features Study ROS, the main development framework for robotics, in detail Learn all about convolutional neural networks, recurrent neural networks, and robotics Create a chatbot to interact with the robot Book Description Artificial Vision and Language Processing for Robotics begins by discussing the theory behind robots. You'll compare different methods used to work with robots and explore computer vision, its algorithms, and limits. You'll then learn how to control the robot with natural language processing commands. You'll study Word2Vec and GloVe embedding techniques, non-numeric data, recurrent neural network (RNNs), and their advanced models. You'll create a simple Word2Vec model with Keras, as well as build a convolutional neural network (CNN) and improve it with data augmentation and transfer learning. You'll study the ROS and build a conversational agent to manage your robot. You'll also integrate your agent with the ROS and convert an image to text and text to speech. You'll learn to build an object recognition system using a video. By the end of this book, you'll have the skills you need to build a functional application that can integrate with a ROS to extract useful information about your environment. What you will learn Explore the ROS and build a basic robotic system Understand the architecture of neural networks Identify conversation intents with NLP techniques Learn and use the embedding with Word2Vec and GloVe Build a basic CNN and improve it using generative models Use deep learning to implement artificial intelligence(AI)and object recognition Develop a simple object recognition system using CNNs Integrate AI with ROS to enable your robot to recognize objects Who this book is for Artificial Vision and Language Processing for Robotics is for robotics engineers who want to learn how to integrate computer vision and deep learning techniques to create complete robotic systems. It will prove beneficial to you if you have working knowledge of Python and a background in deep learning. Knowledge of the ROS is a plus.
Delve into practical computer vision and image processing projects and get up to speed with advanced object detection techniques and machine learning algorithms Key Features Discover best practices for engineering and maintaining OpenCV projects Explore important deep learning tools for image classification Understand basic image matrix formats and filters Book Description OpenCV is one of the best open source libraries available and can help you focus on constructing complete projects on image processing, motion detection, and image segmentation. This Learning Path is your guide to understanding OpenCV concepts and algorithms through real-world examples and activities. Through various projects, you'll also discover how to use complex computer vision and machine learning algorithms and face detection to extract the maximum amount of information from images and videos. In later chapters, you'll learn to enhance your videos and images with optical flow analysis and background subtraction. Sections in the Learning Path will help you get to grips with text segmentation and recognition, in addition to guiding you through the basics of the new and improved deep learning modules. By the end of this Learning Path, you will have mastered commonly used computer vision techniques to build OpenCV projects from scratch. This Learning Path includes content from the following Packt books: Mastering OpenCV 4 - Third Edition by Roy Shilkrot and David Millan Escriva Learn OpenCV 4 By Building Projects - Second Edition by David Millan Escriva, Vinicius G. Mendonca, and Prateek Joshi What you will learn Stay up-to-date with algorithmic design approaches for complex computer vision tasks Work with OpenCV's most up-to-date API through various projects Understand 3D scene reconstruction and Structure from Motion (SfM) Study camera calibration and overlay augmented reality (AR) using the ArUco module Create CMake scripts to compile your C++ application Explore segmentation and feature extraction techniques Remove backgrounds from static scenes to identify moving objects for surveillance Work with new OpenCV functions to detect and recognize text with Tesseract Who this book is for If you are a software developer with a basic understanding of computer vision and image processing and want to develop interesting computer vision applications with OpenCV, this Learning Path is for you. Prior knowledge of C++ and familiarity with mathematical concepts will help you better understand the concepts in this Learning Path.
Build and train scalable neural network models on various platforms by leveraging the power of Caffe2 Key Features Migrate models trained with other deep learning frameworks on Caffe2 Integrate Caffe2 with Android or iOS and implement deep learning models for mobile devices Leverage the distributed capabilities of Caffe2 to build models that scale easily Book Description Caffe2 is a popular deep learning library used for fast and scalable training and inference of deep learning models on various platforms. This book introduces you to the Caffe2 framework and shows how you can leverage its power to build, train, and deploy efficient neural network models at scale. It will cover the topics of installing Caffe2, composing networks using its operators, training models, and deploying models to different architectures. It will also show how to import models from Caffe and from other frameworks using the ONNX interchange format. It covers the topic of deep learning accelerators such as CPU and GPU and shows how to deploy Caffe2 models for inference on accelerators using inference engines. Caffe2 is built for deployment to a diverse set of hardware, using containers on the cloud and resource constrained hardware such as Raspberry Pi, which will be demonstrated. By the end of this book, you will be able to not only compose and train popular neural network models with Caffe2, but also be able to deploy them on accelerators, to the cloud and on resource constrained platforms such as mobile and embedded hardware. What you will learn Build and install Caffe2 Compose neural networks Train neural network on CPU or GPU Import a neural network from Caffe Import deep learning models from other frameworks Deploy models on CPU or GPU accelerators using inference engines Deploy models at the edge and in the cloud Who this book is for Data scientists and machine learning engineers who wish to create fast and scalable deep learning models in Caffe2 will find this book to be very useful. Some understanding of the basic machine learning concepts and prior exposure to programming languages like C++ and Python will be useful.
Gain a working knowledge of advanced machine learning and explore Python's powerful tools for extracting data from images and videos Key Features Implement image classification and object detection using machine learning and deep learning Perform image classification, object detection, image segmentation, and other Computer Vision tasks Crisp content with a practical approach to solving real-world problems in Computer Vision Book Description Python is the ideal programming language for rapidly prototyping and developing production-grade codes for image processing and Computer Vision with its robust syntax and wealth of powerful libraries. This book will help you design and develop production-grade Computer Vision projects tackling real-world problems. With the help of this book, you will learn how to set up Anaconda and Python for the major OSes with cutting-edge third-party libraries for Computer Vision. You'll learn state-of-the-art techniques for classifying images, finding and identifying human postures, and detecting faces within videos. You will use powerful machine learning tools such as OpenCV, Dlib, and TensorFlow to build exciting projects such as classifying handwritten digits, detecting facial features,and much more. The book also covers some advanced projects, such as reading text from license plates from real-world images using Google's Tesseract software, and tracking human body poses using DeeperCut within TensorFlow. By the end of this book, you will have the expertise required to build your own Computer Vision projects using Python and its associated libraries. What you will learn Install and run major Computer Vision packages within Python Apply powerful support vector machines for simple digit classification Understand deep learning with TensorFlow Build a deep learning classifier for general images Use LSTMs for automated image captioning Read text from real-world images Extract human pose data from images Who this book is for Python programmers and machine learning developers who wish to build exciting Computer Vision projects using the power of machine learning and OpenCV will find this book useful. The only prerequisite for this book is that you should have a sound knowledge of Python programming.
Leverage the power of the Python data science libraries and advanced machine learning techniques to analyse large unstructured datasets and predict the occurrence of a particular future event. Key Features Explore the depths of data science, from data collection through to visualization Learn pandas, scikit-learn, and Matplotlib in detail Study various data science algorithms using real-world datasets Book Description Data Science with Python begins by introducing you to data science and teaches you to install the packages you need to create a data science coding environment. You will learn three major techniques in machine learning: unsupervised learning, supervised learning, and reinforcement learning. You will also explore basic classification and regression techniques, such as support vector machines, decision trees, and logistic regression. As you make your way through chapters, you will study the basic functions, data structures, and syntax of the Python language that are used to handle large datasets with ease. You will learn about NumPy and pandas libraries for matrix calculations and data manipulation, study how to use Matplotlib to create highly customizable visualizations, and apply the boosting algorithm XGBoost to make predictions. In the concluding chapters, you will explore convolutional neural networks (CNNs), deep learning algorithms used to predict what is in an image. You will also understand how to feed human sentences to a neural network, make the model process contextual information, and create human language processing systems to predict the outcome. By the end of this book, you will be able to understand and implement any new data science algorithm and have the confidence to experiment with tools or libraries other than those covered in the book. What you will learn Pre-process data to make it ready to use for machine learning Create data visualizations with Matplotlib Use scikit-learn to perform dimension reduction using principal component analysis (PCA) Solve classification and regression problems Get predictions using the XGBoost library Process images and create machine learning models to decode them Process human language for prediction and classification Use TensorBoard to monitor training metrics in real time Find the best hyperparameters for your model with AutoML Who this book is for Data Science with Python is designed for data analysts, data scientists, database engineers, and business analysts who want to move towards using Python and machine learning techniques to analyze data and predict outcomes. Basic knowledge of Python and data analytics will prove beneficial to understand the various concepts explained through this book.
Create powerful, accurate, and real-time Computer Vision applications using a perfect blend of algorithms and filters. Also learn about object tracking and foreground extractions with a variety of new filters and algorithms. Key Features Filter, transform, and manipulate images using MAT class and OpenCV Framework Explore motion detection and object tracking with filters and algorithms Build object detectors using deep learning and machine learning algorithms Book Description An arena that has been positively impacted by the advancements in processing power and performance is the field of computer vision. It's only natural that over time, more and more algorithms are introduced to perform computer vision tasks more efficiently. Hands-On Algorithms for Computer Vision is a starting point for anyone who is interested in the field of computer vision and wants to explore the most practical algorithms used by professional computer vision developers. The book starts with the basics and builds up over the course of the chapters with hands-on examples for each algorithm. Right from the start, you will learn about the required tools for computer vision development, and how to install and configure them. You'll explore the OpenCV framework and its powerful collection of libraries and functions. Starting from the most simple image modifications, filtering, and transformations, you will gradually build up your knowledge of various algorithms until you are able to perform much more sophisticated tasks, such as real-time object detection using deep learning algorithms. What you will learn Get to grips with machine learning and artificial intelligence algorithms Read, write, and process images and videos Perform mathematical, matrix, and other types of image data operations Create and use histograms from back-projection images Detect motion, extract foregrounds, and track objects Extract key points with a collection of feature detector algorithms Develop cascade classifiers and use them, and train and test classifiers Employ TensorFlow object detection to detect multiple objects Who this book is for Hands-On Algorithms for Computer Vision helps those who want to learn algorithms in Computer Vision to create and customize their applications. This book will also help existing Computer Vision developers customize their applications. A basic understanding of computer vision and programming experience is needed.
A practical guide to building high performance systems for object detection, segmentation, video processing, smartphone applications, and more. Key Features Discover how to build, train, and serve your own deep neural networks with TensorFlow 2 and Keras Apply modern solutions to a wide range of applications such as object detection and video analysis Learn how to run your models on mobile devices and webpages and improve their performance Book Description Computer vision solutions are becoming increasingly common, making their way in fields such as health, automobile, social media, and robotics. This book will help you explore TensorFlow 2, the brand new version of Google's open source framework for machine learning. You will understand how to benefit from using convolutional neural networks (CNNs) for visual tasks. Hands-On Computer Vision with TensorFlow 2 starts with the fundamentals of computer vision and deep learning, teaching you how to build a neural network from scratch. You will discover the features that have made TensorFlow the most widely used AI library, along with its intuitive Keras interface, and move on to building, training, and deploying CNNs efficiently. Complete with concrete code examples, the book demonstrates how to classify images with modern solutions, such as Inception and ResNet, and extract specific content using You Only Look Once (YOLO), Mask R-CNN, and U-Net. You will also build Generative Adversarial Networks (GANs) and Variational Auto-Encoders (VAEs) to create and edit images, and LSTMs to analyze videos. In the process, you will acquire advanced insights into transfer learning, data augmentation, domain adaptation, and mobile and web deployment, among other key concepts. By the end of the book, you will have both the theoretical understanding and practical skills to solve advanced computer vision problems with TensorFlow 2.0. What you will learn Create your own neural networks from scratch Classify images with modern architectures including Inception and ResNet Detect and segment objects in images with YOLO, Mask R-CNN, and U-Net Tackle problems in developing self-driving cars and facial emotion recognition systems Boost your application's performance with transfer learning, GANs, and domain adaptation Use recurrent neural networks for video analysis Optimize and deploy your networks on mobile devices and in the browser Who this book is for If you're new to deep learning and have some background in Python programming and image processing, like reading/writing image files and editing pixels, this book is for you. Even if you're an expert curious about the new TensorFlow 2 features, you'll find this book useful. While some theoretical explanations require knowledge in algebra and calculus, the book covers concrete examples for learners focused on practical applications such as visual recognition for self-driving cars and smartphone apps.
Develop generative models for a variety of real-world use-cases and deploy them to production Key Features Discover various GAN architectures using Python and Keras library Understand how GAN models function with the help of theoretical and practical examples Apply your learnings to become an active contributor to open source GAN applications Book Description Generative Adversarial Networks (GANs) have revolutionized the fields of machine learning and deep learning. This book will be your first step towards understanding GAN architectures and tackling the challenges involved in training them. This book opens with an introduction to deep learning and generative models, and their applications in artificial intelligence (AI). You will then learn how to build, evaluate, and improve your first GAN with the help of easy-to-follow examples. The next few chapters will guide you through training a GAN model to produce and improve high-resolution images. You will also learn how to implement conditional GANs that give you the ability to control characteristics of GAN outputs. You will build on your knowledge further by exploring a new training methodology for progressive growing of GANs. Moving on, you'll gain insights into state-of-the-art models in image synthesis, speech enhancement, and natural language generation using GANs. In addition to this, you'll be able to identify GAN samples with TequilaGAN. By the end of this book, you will be well-versed with the latest advancements in the GAN framework using various examples and datasets, and you will have the skills you need to implement GAN architectures for several tasks and domains, including computer vision, natural language processing (NLP), and audio processing. Foreword by Ting-Chun Wang, Senior Research Scientist, NVIDIA What you will learn Learn how GANs work and the advantages and challenges of working with them Control the output of GANs with the help of conditional GANs, using embedding and space manipulation Apply GANs to computer vision, NLP, and audio processing Understand how to implement progressive growing of GANs Use GANs for image synthesis and speech enhancement Explore the future of GANs in visual and sonic arts Implement pix2pixHD to turn semantic label maps into photorealistic images Who this book is for This book is for machine learning practitioners, deep learning researchers, and AI enthusiasts who are looking for a perfect mix of theory and hands-on content in order to implement GANs using Keras. Working knowledge of Python is expected.
Discover how CUDA allows OpenCV to handle complex and rapidly growing image data processing in computer and machine vision by accessing the power of GPU Key Features Explore examples to leverage the GPU processing power with OpenCV and CUDA Enhance the performance of algorithms on embedded hardware platforms Discover C++ and Python libraries for GPU acceleration Book Description Computer vision has been revolutionizing a wide range of industries, and OpenCV is the most widely chosen tool for computer vision with its ability to work in multiple programming languages. Nowadays, in computer vision, there is a need to process large images in real time, which is difficult to handle for OpenCV on its own. This is where CUDA comes into the picture, allowing OpenCV to leverage powerful NVDIA GPUs. This book provides a detailed overview of integrating OpenCV with CUDA for practical applications. To start with, you'll understand GPU programming with CUDA, an essential aspect for computer vision developers who have never worked with GPUs. You'll then move on to exploring OpenCV acceleration with GPUs and CUDA by walking through some practical examples. Once you have got to grips with the core concepts, you'll familiarize yourself with deploying OpenCV applications on NVIDIA Jetson TX1, which is popular for computer vision and deep learning applications. The last chapters of the book explain PyCUDA, a Python library that leverages the power of CUDA and GPUs for accelerations and can be used by computer vision developers who use OpenCV with Python. By the end of this book, you'll have enhanced computer vision applications with the help of this book's hands-on approach. What you will learn Understand how to access GPU device properties and capabilities from CUDA programs Learn how to accelerate searching and sorting algorithms Detect shapes such as lines and circles in images Explore object tracking and detection with algorithms Process videos using different video analysis techniques in Jetson TX1 Access GPU device properties from the PyCUDA program Understand how kernel execution works Who this book is for This book is a go-to guide for you if you are a developer working with OpenCV and want to learn how to process more complex image data by exploiting GPU processing. A thorough understanding of computer vision concepts and programming languages such as C++ or Python is expected.
Leverage the power of Java and deep learning to build production-grade Computer Vision applications Key Features Build real-world Computer Vision applications using the power of neural networks Implement image classification, object detection, and face recognition Know best practices on effectively building and deploying deep learning models in Java Book Description Although machine learning is an exciting world to explore, you may feel confused by all of its theoretical aspects. As a Java developer, you will be used to telling the computer exactly what to do, instead of being shown how data is generated; this causes many developers to struggle to adapt to machine learning. The goal of this book is to walk you through the process of efficiently training machine learning and deep learning models for Computer Vision using the most up-to-date techniques. The book is designed to familiarize you with neural networks, enabling you to train them efficiently, customize existing state-of-the-art architectures, build real-world Java applications, and get great results in a short space of time. You will build real-world Computer Vision applications, ranging from a simple Java handwritten digit recognition model to real-time Java autonomous car driving systems and face recognition models. By the end of this book, you will have mastered the best practices and modern techniques needed to build advanced Computer Vision Java applications and achieve production-grade accuracy. What you will learn Discover neural networks and their applications in Computer Vision Explore the popular Java frameworks and libraries for deep learning Build deep neural networks in Java Implement an end-to-end image classification application in Java Perform real-time video object detection using deep learning Enhance performance and deploy applications for production Who this book is for This book is for data scientists, machine learning developers and deep learning practitioners with Java knowledge who want to implement machine learning and deep neural networks in the computer vision domain. You will need to have a basic knowledge of Java programming.