Tous nos rayons

Déjà client ? Identifiez-vous

Mot de passe oublié ?

Nouveau client ?

CRÉER VOTRE COMPTE
Scaling Machine Learning with Spark: Distributed ML with MLlib, TensorFlow, and PyTorch
Ajouter à une liste

Librairie Eyrolles - Paris 5e
Indisponible

Scaling Machine Learning with Spark: Distributed ML with MLlib, TensorFlow, and PyTorch

Scaling Machine Learning with Spark: Distributed ML with MLlib, TensorFlow, and PyTorch

Adi Polak

400 pages, parution le 27/02/2023

Résumé

Get up to speed on Apache Spark, the popular engine for large-scale data processing, including machine learning and analytics. If you're looking to expand your skill set or advance your career in scalable machine learning with MLlib, distributed PyTorch, and distributed TensorFlow, this practical guide is for you.Get up to speed on Apache Spark, the popular engine for large-scale data processing, including machine learning and analytics. If you're looking to expand your skill set or advance your career in scalable machine learning with MLlib, distributed PyTorch, and distributed TensorFlow, this practical guide is for you. Using Spark as your main data processing platform, you'll discover several open source technologies designed and built for enriching Spark's ML capabilities. Scaling Machine Learning with Spark examines various technologies for building end-to-end distributed ML workflows based on the Apache Spark ecosystem with Spark MLlib, MLFlow, TensorFlow, PyTorch, and Petastorm. This book shows you when to use each technology and why. If you're a data scientist working with machine learning, you'll learn how to: Build practical distributed machine learning workflows, including feature engineering and data formats Extend deep learning functionalities beyond Spark by bridging into distributed TensorFlow and PyTorch Manage your machine learning experiment lifecycle with MLFlow Use Petastorm as a storage layer for bridging data from Spark into TensorFlow and PyTorch Use machine learning terminology to understand distribution strategiesAs Vice President of Developer Experience at Treeverse, Adi Polak shapes the future of data & ML technologies for hands-on builders. She also contributes to the lakeFS open-source, a git-like interface for object stores. In her work, Adi brings her vast industry research and engineering experience to bear in educating and helping teams design, architect, and build cost-effective data systems and machine learning pipelines that emphasize scalability, expertise, and business goals. Adi is a frequent worldwide presenter and the author of O'Reilly's upcoming book, "Machine Learning With Apache Spark." She is continually an invited member of multiple program committees and advisor for conferences like Data & AI Summit, Scale by the Bay, and others. Previously, Adi was a senior manager for Azure at Microsoft, where she focused on building advanced analytics systems and modern architectures. When Adi isn't building data pipelines or thinking up new software architecture, you can find her on the local cultural scene or at the beach.

Caractéristiques techniques

  PAPIER
Éditeur(s) O'Reilly
Auteur(s) Adi Polak
Parution 27/02/2023
Nb. de pages 400
EAN13 9781098106829

Avantages Eyrolles.com

Livraison à partir de 0,01 en France métropolitaine
Paiement en ligne SÉCURISÉ
Livraison dans le monde
Retour sous 15 jours
+ d'un million et demi de livres disponibles
satisfait ou remboursé
Satisfait ou remboursé
Paiement sécurisé
modes de paiement
Paiement à l'expédition
partout dans le monde
Livraison partout dans le monde
Service clients sav@commande.eyrolles.com
librairie française
Librairie française depuis 1925
Recevez nos newsletters
Vous serez régulièrement informé(e) de toutes nos actualités.
Inscription