Go Beyond The Obvious
📙Paper: “Go Beyond The Obvious: Probing the gap of INFORMAL reasoning ability between Humanity and LLMs by Detective Reasoning Puzzle Benchmark” 🔑Public: ✅ ⚲ Area: Commonsense, Dataset 📅 D...
📙Paper: “Go Beyond The Obvious: Probing the gap of INFORMAL reasoning ability between Humanity and LLMs by Detective Reasoning Puzzle Benchmark” 🔑Public: ✅ ⚲ Area: Commonsense, Dataset 📅 D...
📙Paper: “Solving and Generating NPR Sunday Puzzles with Large Language Models” 🔑Public: ✅ ⚲ Area: Dataset, Riddles 📅 Date: 2023-06-21 🔎 Paper Section: dataset / rule-less / riddles 📝 R...
📙Paper: “Are Large Language Models Really Good Logical Reasoners? A Comprehensive Evaluation and Beyond” 🔑Public: ✅ ⚲ Area: Logical reasoning, Reasoning evaluation 📅 Date: 2023-06-16 🔎 P...
📙Paper: “ChessGPT: Bridging Policy Learning and Language Modeling” 🔑Public: ✅ ⚲ Area: Dataset, Deterministic 📅 Date: 2023-06-15 🔎 Paper Section: dataset / rule-based / deterministic 📝 ...
📙Paper: “BoardgameQA: A Dataset for Natural Language Reasoning with Contradictory Information” 🔑Public: ✅ ⚲ Area: Dataset, Deterministic 📅 Date: 2023-06-13 🔎 Paper Section: dataset / rul...
📙Paper: “Testing the General Deductive Reasoning Capacity of Large Language Models Using OOD Examples” 🔑Public: ✅ ⚲ Area: Logical reasoning, Reasoning evaluation 📅 Date: 2023-05-24 🔎 Pap...
📙Paper: “Logical Reasoning over Natural Language as Knowledge Representation: A Survey” 🔑Public: ✅ ⚲ Area: Logical reasoning 📅 Date: 2023-05-21 🔎 Paper Section: Introduction 📝 Referenc...
📙Paper: “Logic-LM: Empowering Large Language Models with Symbolic Solvers for Faithful Logical Reasoning” 🔑Public: ✅ ⚲ Area: Neurosymbolic 📅 Date: 2023-05-20 🔎 Paper Section: methods / p...
📙Paper: “Tree of Thoughts: Deliberate Problem Solving with Large Language Models” 🔑Public: ✅ ⚲ Area: Prompting, Tree-of-thought 📅 Date: 2023-05-17 🔎 Paper Section: methods / advanced / t...
📙Paper: “Large Language Model Guided Tree-of-Thought” 🔑Public: ✅ ⚲ Area: Prompting, Tree-of-thought 📅 Date: 2023-05-15 🔎 Paper Section: methods / advanced / tot 📝 References: 38