Content-Length: 299011 | pFad | http://github.com/topics/gpt-4-5

A5 gpt-4-5 · GitHub Topics · GitHub
Skip to content
#

gpt-4-5

Here are 8 public repositories matching this topic...

Language: All
Filter by language

Multi-Agent Step Race Benchmark: Assessing LLM Collaboration and Deception Under Pressure. A multi-player “step-race” that challenges LLMs to engage in public conversation before secretly picking a move (1, 3, or 5 steps). Whenever two or more players choose the same number, all colliding players fail to advance.

  • Updated Aug 29, 2025

Public Goods Game (PGG) Benchmark: Contribute & Punish is a multi-agent benchmark that tests cooperative and self-interested strategies among Large Language Models (LLMs) in a resource-sharing economic scenario. Our experiment extends the classic PGG with a punishment phase, allowing players to penalize free-riders or retaliate against others.

  • Updated Apr 10, 2025

Improve this page

Add a description, image, and links to the gpt-4-5 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the gpt-4-5 topic, visit your repo's landing page and select "manage topics."

Learn more









ApplySandwichStrip

pFad - (p)hone/(F)rame/(a)nonymizer/(d)eclutterfier!      Saves Data!


--- a PPN by Garber Painting Akron. With Image Size Reduction included!

Fetched URL: http://github.com/topics/gpt-4-5

Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy