9725 Hands-on with Gemini: Interacting with multimodal AI
페이지 정보
본문
Gemini is our natively multimodal AI model capable of reasoning across text, images, audio, video and code. This video highlights some of our favorite interactions with Gemini. Learn more and try the model: https://deepmind.google/gemini
Explore our prompting approaches here: https://goo.gle/how-its-made-gemini
For the purposes of this demo, latency has been reduced and Gemini outputs have been shortened for brevity.
Subscribe to our Channel: https://www.youtube.com/google
Tweet with us on Twitter: https://twitter.com/google
Follow us on Instagram: https://www.instagram.com/google
Join us on Facebook: https://www.facebook.com/Google
0:00 Intro
0:19 Multimodal Dialogue
1:32 Multilinguality
2:04 Game Creation
2:31 Visual Puzzles
3:17 Making Connections
3:39 Image & Text Generation
4:06 Logic & Spatial Reasoning
4:55 Translating Visuals
5:27 Cultural Understanding
Explore our prompting approaches here: https://goo.gle/how-its-made-gemini
For the purposes of this demo, latency has been reduced and Gemini outputs have been shortened for brevity.
Subscribe to our Channel: https://www.youtube.com/google
Tweet with us on Twitter: https://twitter.com/google
Follow us on Instagram: https://www.instagram.com/google
Join us on Facebook: https://www.facebook.com/Google
0:00 Intro
0:19 Multimodal Dialogue
1:32 Multilinguality
2:04 Game Creation
2:31 Visual Puzzles
3:17 Making Connections
3:39 Image & Text Generation
4:06 Logic & Spatial Reasoning
4:55 Translating Visuals
5:27 Cultural Understanding
추천
0
비추천
0
관련링크
댓글목록
등록된 댓글이 없습니다.