[QA] Are Reasoning Models More Prone to Hallucination?
This paper investigates hallucination in large reasoning models, analyzing post-training effects, cognitive behaviors, and model uncertainty, revealing insights into their impact on factual accuracy.https://arxiv.org/abs//2505.23646YouTube: https://www.youtube.com/@ArxivPapersTikTok: https://www.tiktok.com/@arxiv_papersApple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers
--------
7:52
Are Reasoning Models More Prone to Hallucination?
This paper investigates hallucination in large reasoning models, analyzing post-training effects, cognitive behaviors, and model uncertainty, revealing insights into their impact on factual accuracy.https://arxiv.org/abs//2505.23646YouTube: https://www.youtube.com/@ArxivPapersTikTok: https://www.tiktok.com/@arxiv_papersApple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers
--------
20:24
[QA] How does Transformer Learn Implicit Reasoning?
This paper explores implicit multi-hop reasoning in large language models, revealing a developmental trajectory and introducing diagnostic tools to enhance interpretability and understanding of reasoning processes.https://arxiv.org/abs//2505.23653YouTube: https://www.youtube.com/@ArxivPapersTikTok: https://www.tiktok.com/@arxiv_papersApple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers
--------
8:56
How does Transformer Learn Implicit Reasoning?
This paper explores implicit multi-hop reasoning in large language models, revealing a developmental trajectory and introducing diagnostic tools to enhance interpretability and understanding of reasoning processes.https://arxiv.org/abs//2505.23653YouTube: https://www.youtube.com/@ArxivPapersTikTok: https://www.tiktok.com/@arxiv_papersApple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers
--------
23:21
[QA] Let Me Think! A Long Chain-of-Thought Can Be Worth Exponentially Many Short Ones
This paper explores optimal inference-time computation for large language models, revealing scenarios where sequential scaling significantly outperforms parallel scaling, particularly in graph connectivity problems.https://arxiv.org/abs//2505.21825YouTube: https://www.youtube.com/@ArxivPapersTikTok: https://www.tiktok.com/@arxiv_papersApple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers
Running out of time to catch up with new arXiv papers? We take the most impactful papers and present them as convenient podcasts. If you're a visual learner, we offer these papers in an engaging video format. Our service fills the gap between overly brief paper summaries and time-consuming full paper reads. You gain academic insights in a time-efficient, digestible format. Code behind this work: https://github.com/imelnyk/ArxivPapers