win prizes debuts of the month
Search our site
The Heart of Mars by Paul Magrs Read the opening extract of the brand new Paul Magrs book before its publication on 15/03/2018

Introduction to HPC with MPI for Data Science by Frank Nielsen

Introduction to HPC with MPI for Data Science

Part of the Undergraduate Topics in Computer Science Series


Introduction to HPC with MPI for Data Science by Frank Nielsen

This gentle introduction to High Performance Computing (HPC) for Data Science using the Message Passing Interface (MPI) standard has been designed as a first course for undergraduates on parallel programming on distributed memory models, and requires only basic programming notions.Divided into two parts the first part covers high performance computing using C++ with the Message Passing Interface (MPI) standard followed by a second part providing high-performance data analytics on computer clusters.In the first part, the fundamental notions of blocking versus non-blocking point-to-point communications, global communications (like broadcast or scatter) and collaborative computations (reduce), with Amdalh and Gustafson speed-up laws are described before addressing parallel sorting and parallel linear algebra on computer clusters. The common ring, torus and hypercube topologies of clusters are then explained and global communication procedures on these topologies are studied. This first part closes with the MapReduce (MR) model of computation well-suited to processing big data using the MPI framework.In the second part, the book focuses on high-performance data analytics. Flat and hierarchical clustering algorithms are introduced for data exploration along with how to program these algorithms on computer clusters, followed by machine learning classification, and an introduction to graph analytics. This part closes with a concise introduction to data core-sets that let big data problems be amenable to tiny data problems.Exercises are included at the end of each chapter in order for students to practice the concepts learned, and a final section contains an overall exam which allows them to evaluate how well they have assimilated the material covered in the book.

About the Author

Frank Nielsen is a Professor at Ecole Polytechnique in France where he teaches graduate (vision/graphics) and undergraduate (Java/algorithms),and a senior researcher at Sony Computer Science Laboratories Inc. His research includes Computational information geometry for imaging and learning and he is the author of 3 textbooks and 3 edited books. He is also on the Editorial Board for the Springer Journal of Mathematical Imaging and Vision.

More books by this author
Author 'Like for Like' recommendations

Loading other formats...

Book Info

Publication date

4th February 2016


Frank Nielsen

More books by Frank Nielsen
Author 'Like for Like'


Springer International Publishing AG


282 pages





I read new, exciting writers and established authors before publication and there is a great website full of brilliant books and opinions.


It has opened my eyes to different authors and genres. Just log on and try, you will be surprised and not want for reading material again.

Jocelyn Garvey

Love Reading - it not only does what it says on the tin, it does it with Jam on!!

Maz Tucker

The selections are so varied and the reader reviews are so helpful. A fantastic website for those who love books.

Ann Quintilla

Lovereading always comes up with great suggestions and has introduced me to enjoyable books and new authors to discover.

Gaynor Passmore

Lovereading has all the new books and also suggestions for 'similar' authors whilst waiting for your favourites new books.

Carol Peace

I love the honest opinions, recommendations for every genre and every reader, wish lists and Like for Like.

Amrita Dasgupta

I recommend Lovereading because you get honest reviews on a whole range of genres-there's something for everyone. It's the only site I need.

Sian Spinney