Archives

2024

03 Jan PokerGPT

2023

11 Dec Mathematical Language Models
15 Nov Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs
13 Nov LLMs cannot find reasoning errors, but can correct them
12 Nov Assessing Logical Puzzle Solving in Large Language Models
09 Nov Language Models can be Logical Solvers
06 Nov Everything of Thoughts
22 Oct Unleashing the potential of prompt engineering in Large Language Models
22 Oct LLM-Based Agent Society Investigation
22 Oct LINC
17 Oct Eliminating Reasoning via Inferring with Planning
14 Oct ACES
12 Oct GLoRE
12 Oct A Systematic Evaluation of Large Language Models on Out-of-Distribution Logical Reasoning Tasks
07 Oct Towards Better Chain-of-Thought Prompting Strategies
02 Oct Large Language Models Cannot Self-Correct Reasoning Yet
01 Oct Towards LogiGLUE
01 Oct Avalons Game of Thoughts
30 Sep BRAINTEASER
26 Sep A Survey of Chain of Thought Reasoning
12 Sep PaLM 2 Technical Report
08 Sep Exploring Large Language Models for Communication Games
22 Aug Are ChatGPT and GPT-4 Good Poker Players
20 Aug LatEval
17 Aug Graph of Thoughts
15 Aug Boosting Logical Reasoning in Large Language Models through a New Framework
05 Aug Automatically Correcting Large Language Models
28 Jul Language Models Can Teach Themselves to Program Better
14 Jul Leveraging Large Language Models to Generate Answer Set Programs
10 Jul Go Beyond The Obvious
20 Jun Solving and Generating NPR Sunday Puzzles with Large Language Models
15 Jun Are Large Language Models Really Good Logical Reasoners
14 Jun ChessGPT
12 Jun BoardgameQA
23 May Testing the General Deductive Reasoning Capacity of Large Language Models Using OOD Examples
20 May Logical Reasoning over Natural Language as Knowledge Representation
19 May Logic-LM
16 May Tree of Thoughts
14 May Large Language Model Guided Tree-of-Thought
14 May GPT-4 Technical Report
05 May Plan-and-Solve Prompting
06 Apr Evaluating the Logical Reasoning Ability of ChatGPT and GPT-4
29 Mar Self-Refine
25 Mar Natural Language Reasoning, A Survey
08 Mar Large Language Models
07 Feb A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity

2022

30 Dec A Survey on In-context Learning
19 Dec True Detective
19 Dec Towards Reasoning in Large Language Models
18 Dec Reasoning with Language Model Prompting
18 Dec Large Language Models are Better Reasoners with Self-Verification
21 Nov Program of Thoughts Prompting
17 Nov PAL
02 Nov Large Language Models Are Human-Level Prompt Engineers
06 Oct Automatic Chain of Thought Prompting in Large Language Models
02 Oct Complexity-Based Prompting for Multi-Step Reasoning
08 Jul Jigsaw puzzle solving techniques and applications
27 Jun CC-Riddle
23 May Large Language Models are Zero-Shot Reasoners
20 May Self-Consistency Improves Chain of Thought Reasoning in Language Models
19 May Down and Across
18 May Selection-Inference
27 Jan Chain of Thought Prompting Elicits Reasoning in Large Language Models

2021

09 Dec A Puzzle-Based Dataset for Natural Language Inference
22 Sep BiRdQA
06 Sep Puzzle Solving without Search or Human Knowledge
27 Jul Pre-train, Prompt, and Predict
06 Jul Evaluating Large Language Models Trained on Code
09 Jun Programming Puzzles
16 Apr Decrypting Cryptic Crosswords
28 Feb Cryptonite
05 Jan Did Aristotle Use a Laptop
01 Jan RiddleSense

2020

27 May Language Models are Few-Shot Learners

2019

25 Nov PIQA
28 Oct BART
22 Oct Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
25 Sep ALBERT
25 Jul RoBERTa
09 Jan Language Models are Unsupervised Multitask Learners

2018

10 Oct BERT
01 Oct CommonsenseQA

2000

31 Mar Abductive and inductive reasoning

1999

31 Dec Template
31 Dec Getting Started

Trending Tags

dataset prompting fine-tuning rule-less rule-based model survey logical reasoning commonsense stochastic