2023-11-22 cs: "Revolutionary Tech Unleashed: Monocular Video Perception to Multi-Modal AI - Discover the Future of Visual Understanding and Language Models!"
Arxiv Podcast GPT Computer Science
November 22, 2023
Audio is streamed directly from the publisher (arxivpodcastgpt.github.io) as published in their RSS feed. Play Podcasts does not host this file. Rights-holders can request removal through the copyright & takedown page.
Show Notes
ChatGPT generated podcast using model=gpt-4-1106-preview for https://arxiv.org/abs/2311.12796 Title Physics guided Shape from Template Monocular Video Perception through Neural Surrogate Models
ChatGPT generated podcast using model=gpt-4-1106-preview for https://arxiv.org/abs/2311.12793 Title ShareGPT4V Improving Large Multi Modal Models with Better Captions
ChatGPT generated podcast using model=gpt-4-1106-preview for https://arxiv.org/abs/2311.12792 Title Intrinsic Image Decomposition via Ordinal Shading
ChatGPT generated podcast using model=gpt-4-1106-preview for https://arxiv.org/abs/2311.12786 Title Mechanistically analyzing the effects of fine tuning on procedurally defined tasks
ChatGPT generated podcast using model=gpt-4-1106-preview for https://arxiv.org/abs/2311.12785 Title Prompting Frameworks for Large Language Models A Survey