SPECIAL: GPT-X, Diffusion, and our Multimodal Future Part II (DALL-E in the future)
What could an AI creative tool like DALL-E 2 or Midjourney look like in the next few years? This video explores the full set of future possibilities upon us!
GPT-X, Diffusion, and our Multimodal Future (Part II) BOOK:
https://docs.google.com/document/d/1TeRMqOsGX8kiK_mNiCt4yxO26Yjk6BdM/
Timestamps:
00:00 - Intro
05:02 - Multimodal Model Characteristics
11:26 - Creating a new image
15:56 - Prompt writing capabilities - Prompt Formatting
16:50 - Prompt autocompletion, intelligent suggestions
19:57 - Built in prompt unbundling
21:57 - Configure Recombinants
23:45 - Recombinant blending modes
25:07 - Configure Recombinant Unbundling Details
26:12 - Recombinant Layers & Node based
28:05 - Recombinant Import Bench
29:48 - Natural Language Image editing - refresh individual objects
31:44 - Edit mode - Natural Langauge Edit Prompts
32:37 - Edit prompt realtime feedback
33:05 - Natural language hex color editing support
34:35 - Advanced Natural Language Edit Prompt Changes
35:53 - Edit prompt intensity (UI controls)
36:59 - Multimodal Object Transformations
37:16 - Lighting & Camera Controls
38:00 - Advanced Editing Tools (Fix AI Weirdness and more)
39:27 - Edit images via markup (markup prompt)
40:43 - Logical Variations
43:57 - Realtime Collaboration
44:28 - Recombinant Collaborators
46:06 - Built in Music Player
48:45 - Creative Hyperparameters
49:20 - Text Capabilities - Magic Text
50:52 - Text Capabilities - Magic Text Fill
52:41 - Latent Variation Scrubbing
53:34 - Explore Alternatives
56:34 - Productivity and commercialization
58:06 - Analysis & Feedback
59:16 - Advanced Multimodal Capabilities
1:03:45 - Offline Capabilities
1:11:11 - Advanced Workflow Support
1:11:58 - Closing thoughts
GPT-X, DALL-E, and our Multimodal Future (Part I)
https://www.youtube.com/playlist?list=PLza3gaByGSXjUCtIuv2x9fwkx3K_3CDmw
Links:
DALL-E 2 unbundling
https://bakztfuture.substack.com/p/dall-e-2-unbundling
DALL-E 2: Recombinant Art & Design
https://bakztfuture.substack.com/p/dall-e-2-recombinant-art-and-design
DALL-E 2 - Unofficial Natural Language Image Editing, Art Critique Survey
https://bakztfuture.substack.com/p/dall-e-2-unofficial-natural-language-b14
Please note I do not represent, have any affiliation with, nor do I speak on behalf of OpenAI.
—
► Remember to Like, Comment, and Subscribe!
—
Connect with me:
Substack Newsletter - https://bakztfuture.substack.com
Twitter - https://www.twitter.com/bakztfuture
Instagram - https://www.instagram.com/bakztfuture
Github - https://www.github.com/bakztfuture
Subscribe to the Multimodal Podcast!
Spotify - https://open.spotify.com/show/7qrWSE7ZxFXYe8uoH8NIFV
Apple Podcasts - https://podcasts.apple.com/us/podcast/multimodal-by-bakz-t-future/id1564576820
Google Podcasts - https://podcasts.google.com/feed/aHR0cHM6Ly9mZWVkLnBvZGJlYW4uY29tL2Jha3p0ZnV0dXJlL2ZlZWQueG1s
Stitcher - https://www.stitcher.com/show/multimodal-by-bakz-t-future
Other Podcast Apps (RSS Link) - https://feed.podbean.com/bakztfuture/feed.xml
Feel free to send me an email or just say hello:
bakztfuture@gmail.com
Видео SPECIAL: GPT-X, Diffusion, and our Multimodal Future Part II (DALL-E in the future) канала Bakz T. Future
GPT-X, Diffusion, and our Multimodal Future (Part II) BOOK:
https://docs.google.com/document/d/1TeRMqOsGX8kiK_mNiCt4yxO26Yjk6BdM/
Timestamps:
00:00 - Intro
05:02 - Multimodal Model Characteristics
11:26 - Creating a new image
15:56 - Prompt writing capabilities - Prompt Formatting
16:50 - Prompt autocompletion, intelligent suggestions
19:57 - Built in prompt unbundling
21:57 - Configure Recombinants
23:45 - Recombinant blending modes
25:07 - Configure Recombinant Unbundling Details
26:12 - Recombinant Layers & Node based
28:05 - Recombinant Import Bench
29:48 - Natural Language Image editing - refresh individual objects
31:44 - Edit mode - Natural Langauge Edit Prompts
32:37 - Edit prompt realtime feedback
33:05 - Natural language hex color editing support
34:35 - Advanced Natural Language Edit Prompt Changes
35:53 - Edit prompt intensity (UI controls)
36:59 - Multimodal Object Transformations
37:16 - Lighting & Camera Controls
38:00 - Advanced Editing Tools (Fix AI Weirdness and more)
39:27 - Edit images via markup (markup prompt)
40:43 - Logical Variations
43:57 - Realtime Collaboration
44:28 - Recombinant Collaborators
46:06 - Built in Music Player
48:45 - Creative Hyperparameters
49:20 - Text Capabilities - Magic Text
50:52 - Text Capabilities - Magic Text Fill
52:41 - Latent Variation Scrubbing
53:34 - Explore Alternatives
56:34 - Productivity and commercialization
58:06 - Analysis & Feedback
59:16 - Advanced Multimodal Capabilities
1:03:45 - Offline Capabilities
1:11:11 - Advanced Workflow Support
1:11:58 - Closing thoughts
GPT-X, DALL-E, and our Multimodal Future (Part I)
https://www.youtube.com/playlist?list=PLza3gaByGSXjUCtIuv2x9fwkx3K_3CDmw
Links:
DALL-E 2 unbundling
https://bakztfuture.substack.com/p/dall-e-2-unbundling
DALL-E 2: Recombinant Art & Design
https://bakztfuture.substack.com/p/dall-e-2-recombinant-art-and-design
DALL-E 2 - Unofficial Natural Language Image Editing, Art Critique Survey
https://bakztfuture.substack.com/p/dall-e-2-unofficial-natural-language-b14
Please note I do not represent, have any affiliation with, nor do I speak on behalf of OpenAI.
—
► Remember to Like, Comment, and Subscribe!
—
Connect with me:
Substack Newsletter - https://bakztfuture.substack.com
Twitter - https://www.twitter.com/bakztfuture
Instagram - https://www.instagram.com/bakztfuture
Github - https://www.github.com/bakztfuture
Subscribe to the Multimodal Podcast!
Spotify - https://open.spotify.com/show/7qrWSE7ZxFXYe8uoH8NIFV
Apple Podcasts - https://podcasts.apple.com/us/podcast/multimodal-by-bakz-t-future/id1564576820
Google Podcasts - https://podcasts.google.com/feed/aHR0cHM6Ly9mZWVkLnBvZGJlYW4uY29tL2Jha3p0ZnV0dXJlL2ZlZWQueG1s
Stitcher - https://www.stitcher.com/show/multimodal-by-bakz-t-future
Other Podcast Apps (RSS Link) - https://feed.podbean.com/bakztfuture/feed.xml
Feel free to send me an email or just say hello:
bakztfuture@gmail.com
Видео SPECIAL: GPT-X, Diffusion, and our Multimodal Future Part II (DALL-E in the future) канала Bakz T. Future
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
#17 - GPT-X, DALL-E, and our Multimodal Future Drops TOMORROW#20 - GPT-X, DALL-E, and our Multimodal Future Closing Thoughts, Clubhouse Event, and more!Generating a Brutalist Pencil Sharpener, Water Bottle, and More with DALL-E 2"Self-exfiltration is a key dangerous capability" AI Safety Article Summary + DiscussionBREAKING: EleutherAI Announces GPT-NeoX-20B (FREE)#19 - Discussing Housing, Policy, and AI with Kemal Ahmed (York Centre NDP MP Candidate)#10 - GPT-3 writes Poem about AI CreativityProof of Concept: Channel Intro#13- How to Come up With GPT-3 Startup Ideas, Mini DALL-E, What GPT-3 Devs Can Learn From Steve JobsGPT-X, DALL-E, and our Multimodal Future Series PREVIEW - Composition & Phrasing#12- VQGAN + Clip, LatentRevisions, Diffusion Models ... AI art is going Viral#8 - GPT-3: The End of the Socially Awkward Developer (?)#43 - 2023, AI Media, and OpenAI SuperalignmentHow To Find Niche Audience/Customer Segments Using Free Reddit Data#44 - Goodbye DALL-E (Complete Tell-All)#10- GPT-3 Developer Goes on a Camping Trip#2 - The Essence of Multimodal Creativity (DALL-E/VQGAN/CLIP and more)Day 30 - Coronavirus Infection Rate Math/PredictionsDiscussing GPT-3, OpenAI Codex, and DALL-E with YouTube Creator Sandra KublikB-side: Multimodal VR/Gaming (Worldbuilding)#11- VQGAN + Clip, OpenAI Announces Fine Tuning, Marvel's Loki