No catches, no fine print just unadulterated book loving, with your favourite books saved to your own digital bookshelf.
New members get entered into our monthly draw to win £100 to spend in your local bookshop Plus lots lots more…Find out more
See below for a selection of the latest books from Data mining category. Presented with a red border are the Data mining books that have been lovingly read and reviewed by the experts at Lovereading. With expert reading recommendations made by people with a passion for books and some unique features Lovereading will help you find great Data mining books and those from many more genres to read that will keep you inspired and entertained. And it's all free!
With this in-depth book, data professionals, managers, and tech leaders will learn powerful, real-world best practices and get a better understanding for data ethics. Contributors from top companies in technology, finance, and other industries share their experiences and lessons learned on bias, privacy, security, and data governance--the things you need to know for ethically collecting, managing, and using data.
This new edition includes some key topics relating to the latest version of MS Office, including use of the ribbon, current Excel file types, Dashboard, and basic Sharepoint integration. It shows how to automate operations, such as curve fitting, sorting, filtering, and analyzing data from a variety of sources. The book allows users to analyze data and automate the preparation of custom reports and demonstrates how to assign Excel VBA code to the new Ribbon user interface.
Discover a variety of data-mining algorithms that are useful for selecting small sets of important features from among unwieldy masses of candidates, or extracting useful features from measured variables. As a serious data miner you will often be faced with thousands of candidate features for your prediction or classification application, with most of the features being of little or no value. You'll know that many of these features may be useful only in combination with certain other features while being practically worthless alone or in combination with most others. Some features may have enormous predictive power, but only within a small, specialized area of the feature space. The problems that plague modern data miners are endless. This book helps you solve this problem by presenting modern feature selection techniques and the code to implement them. Some of these techniques are: Forward selection component analysis Local feature selection Linking features and a target with a hidden Markov model Improvements on traditional stepwise selection Nominal-to-ordinal conversion All algorithms are intuitively justified and supported by the relevant equations and explanatory material. The author also presents and explains complete, highly commented source code. The example code is in C++ and CUDA C but Python or other code can be substituted; the algorithm is important, not the code that's used to write it. What You Will Learn Combine principal component analysis with forward and backward stepwise selection to identify a compact subset of a large collection of variables that captures the maximum possible variation within the entire set. Identify features that may have predictive power over only a small subset of the feature domain. Such features can be profitably used by modern predictive models but may be missed by other feature selection methods. Find an underlying hidden Markov model that controls the distributions of feature variables and the target simultaneously. The memory inherent in this method is especially valuable in high-noise applications such as prediction of financial markets. Improve traditional stepwise selection in three ways: examine a collection of 'best-so-far' feature sets; test candidate features for inclusion with cross validation to automatically and effectively limit model complexity; and at each step estimate the probability that our results so far could be just the product of random good luck. We also estimate the probability that the improvement obtained by adding a new variable could have been just good luck. Take a potentially valuable nominal variable (a category or class membership) that is unsuitable for input to a prediction model, and assign to each category a sensible numeric value that can be used as a model input. Who This Book Is For Intermediate to advanced data science programmers and analysts. C++ and CUDA C experience is highly recommended. However, this book can be used as a framework using other languages such as Python.
This book thoroughly discusses the varying problems that occur in data mining, including their sources, consequences, detection, and treatment. Specific strategies for data pretreatment and analytical validation that are broadly applicable are described, making them useful in conjunction with most data mining analysis methods. Examples illustrate the performance of the pretreatment and validation methods in a variety of situations. The book, which deals with a wider range of data anomalies than are usually treated, includes a discussion of detecting anomalies through generalized sensitivity analysis (GSA), a process of identifying inconsistencies using systematic and extensive comparisons of results obtained by analysis of exchangeable datasets or subsets. Real data is made extensive use of, both in the form of a detailed analysis of a few real datasets and various published examples. A succinct introduction to functional equations illustrates their utility in describing various forms of qualitative behavior for useful data characterizations.
This book is a comprehensive introduction to the methods and algorithms of modern data analytics. It provides a sound mathematical basis, discusses advantages and drawbacks of different approaches, and enables the reader to design and implement data analytics solutions for real-world applications. This book has been used for more than ten years in the Data Mining course at the Technical University of Munich. Much of the content is based on the results of industrial research and development projects at Siemens.
As data floods into your company, you need to put it to work right away-and SQL is the best tool for the job. With the latest edition of this introductory guide, author Alan Beaulieu helps developers get up to speed with SQL fundamentals for writing database applications, performing administrative tasks, and generating reports. You'll find new chapters on SQL and big data, analytic functions, and working with very large databases. Each chapter presents a self-contained lesson on a key SQL concept or technique using numerous illustrations and annotated examples. Exercises let you practice the skills you learn. Knowledge of SQL is a must for interacting with data. With Learning SQL, you'll quickly discover how to put the power and flexibility of this language to work. Move quickly through SQL basics and several advanced features Use SQL data statements to generate, manipulate, and retrieve data Create database objects, such as tables, indexes, and constraints with SQL schema statements Learn how datasets interact with queries; understand the importance of subqueries Convert and manipulate data with SQL's built-in functions and use conditional logic in data statements
This book constitutes revised selected papers from the 8th and 9th IFIP WG 2.6 International Symposium on Data-Driven Process Discovery and Analysis, SIMPDA 2018, held in Seville, Spain, on December 13-14, 2018, and SIMPDA 2019, held in Bled, Slovenia, on September 8, 2019. From 16 submissions received for SIMPDA 2018 and 9 submissions received for SIMPDA 2019, 3 papers each were carefully reviewed and selected for presentation in this volume. They cover theoretical issues related to process representation, discovery, and analysis or provide practical and operational examples of their application.
As mobile devices continue becoming a larger part of our lives, the development of location acquisition technologies to track moving objects have focused the minds of researchers on issues ranging from longitude and latitude coordinates, speed, direction, and timestamping, as part of parameters needed to calculate the positional information and locations of objects, in terms of time and position in the form of trajectory streams. Recently, recent advances have facilitated various urban applications such as smart transportation and mobile delivery services.Unlike other books on spatial databases, mobile computing, data mining, or computing with spatial trajectories, this book is focused on smart transportation applications.This book is a good reference for advanced undergraduates, graduate students, researchers, and system developers working on transportation systems.
The different facets of the sharing economy offer numerous opportunities for businesses ? particularly those that can be distinguished by their creative ideas and their ability to easily connect buyers and senders of goods and services via digital platforms. At the beginning of the growth of this economy, the advanced digital technologies generated billions of bytes of data that constitute what we call Big Data. This book underlines the facilitating role of Big Data analytics, explaining why and how data analysis algorithms can be integrated operationally, in order to extract value and to improve the practices of the sharing economy. It examines the reasons why these new techniques are necessary for businesses of this economy and proposes a series of useful applications that illustrate the use of data in the sharing ecosystem.