Multi-Modal Querying

From Embeddings to Production

Virtual Voice Sample

Failed to add items

Sorry, we are unable to add the item because your shopping cart is already at capacity.

Add to Cart failed.

Please try again later

Add to Wish List failed.

Please try again later

Remove from wishlist failed.

Please try again later

Adding to library failed

Please try again

Follow podcast failed

Please try again

Unfollow podcast failed

Please try again

Audible Standard 30-day free trial

Try Standard free

Select 1 audiobook a month from our entire collection of titles.

Yours as long as you’re a member.

Get unlimited access to bingeable podcasts.

Standard auto renews for $8.99 a month after 30 days. Cancel anytime.

Multi-Modal Querying

By: Ajit Singh

Narrated by: Virtual Voice

Try Standard free

$8.99 a month after 30 days. Cancel anytime.

Buy for $6.50

This title uses virtual voice narration

Virtual voice is computer-generated narration for audiobooks.

"Multi-Modal Querying : From Embeddings to Production" is a comprehensive, practical, and forward-looking guide designed to demystify the exciting world of multi-modal artificial intelligence. It serves as both a textbook for students and a handbook for practitioners, providing a structured pathway from foundational concepts to the deployment of real-world, industrial-strength applications.

Philosophy:

The core philosophy of this book is empowerment. My goal is to empower you to move beyond being a mere user of AI models and become a creator of intelligent systems. I believe that the ability to query and reason across different data types (text, images, audio, etc.) is a fundamental skill for the next generation of software and AI engineers. The title itself, "From Embeddings to Production," encapsulates my philosophy: I covered the full lifecycle, from the atomic unit of multi-modal understanding (the embedding) to the complexities of deploying a robust, scalable service.

Key Features

1. Foundational Clarity: Chapter 1 establishes a rock-solid foundation, defining all key terms, architectures, and components, making the book accessible even to beginners.
2. Hands-On Code and Examples: Rich with practical, executable Python code using popular and industry-standard libraries like PyTorch, Hugging Face, Faiss, and more.
3. Vector Database Deep Dive: Dedicated chapters on the critical infrastructure of multi-modal systems—vector databases—exploring their architecture, use cases, and leading open-source and managed solutions.
4. Production and Deployment Focus: Goes beyond model training to cover crucial "day two" problems: creating APIs, containerization with Docker, scaling, monitoring, and CI/CD for AI systems.

To Whom This Book Is For

This book is written for a diverse audience with a shared passion for building the future of technology:

1. B.Tech/M.Tech Computer Science Students: Serves as a primary textbook for courses on AI, Machine Learning, Deep Learning, or specialized electives on multi-modal systems. It is fully compliant with modern, skill-oriented syllabi.
2. AI/ML Practitioners and Data Scientists: A perfect resource for professionals looking to expand their skill set from unimodal to multi-modal applications and understand the engineering challenges involved.
3. Software Engineers and Architects: Provides a clear guide for developers who need to integrate multi-modal search capabilities into their applications and design robust, scalable backend systems.
4. Researchers and Academics: Offers a structured and practical overview of the field, serving as a valuable reference for the implementation and engineering aspects of multi-modal research.

Ajit SinghVirtual VoiceIndependently Published

Computer Science

Technology

Programming

Software

Computer Science Technology Programming Software

No reviews yet