Archives
- 03 Jan PokerGPT
- 11 Dec Mathematical Language Models
- 15 Nov Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs
- 13 Nov LLMs cannot find reasoning errors, but can correct them
- 12 Nov Assessing Logical Puzzle Solving in Large Language Models
- 09 Nov Language Models can be Logical Solvers
- 06 Nov Everything of Thoughts
- 22 Oct Unleashing the potential of prompt engineering in Large Language Models
- 22 Oct LLM-Based Agent Society Investigation
- 22 Oct LINC
- 17 Oct Eliminating Reasoning via Inferring with Planning
- 14 Oct ACES
- 12 Oct GLoRE
- 12 Oct A Systematic Evaluation of Large Language Models on Out-of-Distribution Logical Reasoning Tasks
- 07 Oct Towards Better Chain-of-Thought Prompting Strategies
- 02 Oct Large Language Models Cannot Self-Correct Reasoning Yet
- 01 Oct Towards LogiGLUE
- 01 Oct Avalons Game of Thoughts
- 30 Sep BRAINTEASER
- 26 Sep A Survey of Chain of Thought Reasoning
- 12 Sep PaLM 2 Technical Report
- 08 Sep Exploring Large Language Models for Communication Games
- 22 Aug Are ChatGPT and GPT-4 Good Poker Players
- 20 Aug LatEval
- 17 Aug Graph of Thoughts
- 15 Aug Boosting Logical Reasoning in Large Language Models through a New Framework
- 05 Aug Automatically Correcting Large Language Models
- 28 Jul Language Models Can Teach Themselves to Program Better
- 14 Jul Leveraging Large Language Models to Generate Answer Set Programs
- 10 Jul Go Beyond The Obvious
- 20 Jun Solving and Generating NPR Sunday Puzzles with Large Language Models
- 15 Jun Are Large Language Models Really Good Logical Reasoners
- 14 Jun ChessGPT
- 12 Jun BoardgameQA
- 23 May Testing the General Deductive Reasoning Capacity of Large Language Models Using OOD Examples
- 20 May Logical Reasoning over Natural Language as Knowledge Representation
- 19 May Logic-LM
- 16 May Tree of Thoughts
- 14 May Large Language Model Guided Tree-of-Thought
- 14 May GPT-4 Technical Report
- 05 May Plan-and-Solve Prompting
- 06 Apr Evaluating the Logical Reasoning Ability of ChatGPT and GPT-4
- 29 Mar Self-Refine
- 25 Mar Natural Language Reasoning, A Survey
- 08 Mar Large Language Models
- 07 Feb A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity
- 30 Dec A Survey on In-context Learning
- 19 Dec True Detective
- 19 Dec Towards Reasoning in Large Language Models
- 18 Dec Reasoning with Language Model Prompting
- 18 Dec Large Language Models are Better Reasoners with Self-Verification
- 21 Nov Program of Thoughts Prompting
- 17 Nov PAL
- 02 Nov Large Language Models Are Human-Level Prompt Engineers
- 06 Oct Automatic Chain of Thought Prompting in Large Language Models
- 02 Oct Complexity-Based Prompting for Multi-Step Reasoning
- 08 Jul Jigsaw puzzle solving techniques and applications
- 27 Jun CC-Riddle
- 23 May Large Language Models are Zero-Shot Reasoners
- 20 May Self-Consistency Improves Chain of Thought Reasoning in Language Models
- 19 May Down and Across
- 18 May Selection-Inference
- 27 Jan Chain of Thought Prompting Elicits Reasoning in Large Language Models
- 09 Dec A Puzzle-Based Dataset for Natural Language Inference
- 22 Sep BiRdQA
- 06 Sep Puzzle Solving without Search or Human Knowledge
- 27 Jul Pre-train, Prompt, and Predict
- 06 Jul Evaluating Large Language Models Trained on Code
- 09 Jun Programming Puzzles
- 16 Apr Decrypting Cryptic Crosswords
- 28 Feb Cryptonite
- 05 Jan Did Aristotle Use a Laptop
- 01 Jan RiddleSense
- 25 Nov PIQA
- 28 Oct BART
- 22 Oct Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
- 25 Sep ALBERT
- 25 Jul RoBERTa
- 09 Jan Language Models are Unsupervised Multitask Learners
- 10 Oct BERT
- 01 Oct CommonsenseQA
- 31 Dec Template
- 31 Dec Getting Started