Machine Learning & AIMachine Learning & AI
Conference45min
INTERMEDIATE

RamaLama: Making working with AI Models Boring

RamaLama is an open-source tool that simplifies AI model deployment using containers, supporting multiple registries and runtimes. It offers easy commands for chatbots and APIs, generates deployment files for Podman and Kubernetes, and streamlines setup for both local and production environments, making deployment reliable and accessible for developers and administrators.

Cedric Clyburn
Cedric ClyburnRed Hat

talkDetail.whenAndWhere

Tuesday, February 10, 09:25-10:10
Room C
talks.roomOccupancytalks.noOccupancyInfo
talks.description
Managing and deploying AI models can often require extensive system configuration and complex software dependencies. RamaLama, a new open-source tool, aims to make working with AI models straightforward by leveraging container technology, making the process "boring"—predictable, reliable, and easy to manage. RamaLama integrates with container engines like Podman and Docker to deploy AI models within containers, eliminating the need for manual configuration and ensuring optimal setup for both CPU and GPU systems.

This talk will introduce RamaLama’s key features, including support for multiple AI model registries (Ollama, Hugging Face, and OCI), simplified commands for running models as chatbots or REST API services, and compatibility with alternative AI runtimes like llama.cpp and vllm. We’ll explore RamaLama’s unique capabilities, such as generating Podman quadlet files for edge deployments and Kubernetes YAML for scalable deployment, demonstrating how it allows developers to transition from local experimentation to production seamlessly. Join us to learn how RamaLama enables frictionless, containerized AI model deployment for developers and system administrators alike.
open-source
deployment
container
ramalama
talks.speakers
Cedric Clyburn

Cedric Clyburn

Red Hat

United States of America

Cedric Clyburn (@cedricclyburn), Senior Developer Advocate at Red Hat, is an enthusiastic software technologist with a background in Kubernetes, DevOps, and container tools. He has experience speaking and organizing conferences including Devoxx, WeAreDevelopers, The Linux Foundation, KCD NYC, and more. Cedric loves all things open-source, and works to make developer's lives easier! Based out of New York.

talkDetail.rateThisTalk

talkDetail.poortalkDetail.excellent

talkDetail.ratingNotYetAvailable

talkDetail.ratingAvailableWhenStarted

talkDetail.signInRequired

talkDetail.signInToRateDescription

occupancy.title

occupancy.votingNotYetAvailable

occupancy.votingAvailableBeforeStart

talkDetail.signInRequired

occupancy.signInToVoteDescription

comments.title

comments.speakerNotEnabledComments