PuzzleLLMs

A Survey on Puzzle Solving using Reasoning of Large Language Models

HOME
CATEGORIES
TAGS
ARCHIVES
ABOUT

Home Tags reasoning evaluation

Tag

reasoning evaluation 5

A Systematic Evaluation of Large Language Models on Out-of-Distribution Logical Reasoning Tasks Oct 12, 2023
Are Large Language Models Really Good Logical Reasoners Jun 15, 2023
Testing the General Deductive Reasoning Capacity of Large Language Models Using OOD Examples May 23, 2023
Evaluating the Logical Reasoning Ability of ChatGPT and GPT-4 Apr 6, 2023
A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity Feb 7, 2023

Recently Updated

Getting Started
Template
LINC
Abductive and inductive reasoning
LLM-Based Agent Society Investigation

Trending Tags

dataset prompting fine-tuning rule-less rule-based model survey logical reasoning commonsense stochastic

© 2024 . Some rights reserved.

Using the Chirpy theme for Jekyll.

Trending Tags

dataset prompting fine-tuning rule-less rule-based model survey logical reasoning commonsense stochastic

A new version of content is available.