#reasoning
共 8 个条目
讲座 (3)
论文 (5)
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models Self-Consistency Improves Chain of Thought Reasoning in Language Models Scaling LLM Test-Time Compute Optimally Can be More Effective than Scaling Model Parameters DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning ReAct: Synergizing Reasoning and Acting in Language Models