1 option

What's new in AI : multimodal AI with Purvanshi Mehta.

O'Reilly Online Learning: Academic/Public Library Edition Available online

Format:: Video
Contributor:: Anadiotis, George, host.; Mehta, Purvanshi, presenter.; O'Reilly (Firm), publisher.
Language:: English
Subjects (All):: Artificial intelligence.
Physical Description:: 1 online resource (1 video file (48 min.)) : sound, color.
Edition:: [First edition].
Place of Publication:: [Sebastopol, California] : O'Reilly Media, Inc., [2024]
Summary:: Join host George Anadiotis and guest Purvanshi Mehta, cofounder of Lica World, for a discussion about multimodal AI and its applications. Trained on various types of data from text to images to audio and video, multimodal AI models are expanding the possibilities for the kinds of AI applications we can build. New large AI models such as GPT-4, Gemini, and Claude 3 are all general-purpose multimodal foundational models. More specialized multimodal AI models, such as OpenAI’s yet-to-be-released Sora, which generates video from text, or Suno AI, which generates songs from text, are fueling the imagination with ways we might leverage AI to automate and augment tasks in robotics, entertainment, healthcare, manufacturing, and other industries. George and Purvanshi discuss where this technology stands and share their thoughts on where the field is headed.
Notes:: OCLC-licensed vendor bibliographic record.
OCLC:: 1439049340
Publisher Number:: 0642572033507

The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.

1 option

What's new in AI : multimodal AI with Purvanshi Mehta.

My Account

Guides