Can ChatGPT Play 'Doom'? Yes—But It's Terrible

Engineers, researchers, and intrepid hobbyists of all sorts have proven that the classic first-person shooter Doom can be played on almost anything, including a lawnmower and even gut bacteria. On Wednesday, Adrian de Wynter, a principal applied scientist at Microsoft, proved that the popular AI chatbot ChatGPT can play Doom—it’s just not very good at it.

Seeing what devices and other contraptions can run Doom has become an increasingly popular pastime for hackers, researchers, and tech enthusiasts. To make Doom work with ChatGPT, de Wynter paired it with OpenAI’s multimodal GPT-4V (Vision) to get the chatbot to play the game.

The results of the Doom/ChatGPT experiment showed that despite the advances in GPT-4 and its vision-enhanced variant, the AI model could not independently run Doom due to limitations in input and image rendering.

“For example, if the model fell into an acid pool, and then got stuck on a wall, it would ‘forget’ that it is taking damage because of the acid,” de Wynter said, “and then get stuck and die.”

Another issue facing de Wynter was the AI model’s habit of hallucinating and making up explanations for its actions, or lying that it completed an action. That left Doom’s Space Marine at the mercy of rampaging monsters.

GPT-4, de Wynter explained, managed to get to the last room in the game… but only once. Doom’s simplicity, he said, makes it easy to work with due to its portability, and its open-source nature allows for better benchmarks by which to measure intelligent agents because Doom requires heavy reasoning capabilities—like planning in the heat of the moment.

“It’s interesting!” de Wynter told Decrypt’s GG. “It did originate mostly as a meme (‘Can my toaster run Doom?’) due to its portability and open-source code. That’s mostly why it stays as the game of choice.”

De Wynter emphasized that the project was done solely in his capacity as a researcher at the University of York, and is not related at all to his work with Microsoft.

“Debugging took a lot of time. I normally dumped the frames and just went over them to make sure nothing was breaking,” he said, noting constant issues, including the model trying to get out of the map through the window. “Eventually I gave up and turned the frames into GIFs.”

De Wynter’s project is just the latest in a series of experiments that aim to play Doom in unusual places.

Last year, after the launch of the Ordinals protocol, a stripped-down version of Doom was inscribed on the Bitcoin blockchain as Inscription 466. Earlier this year, a similar project added a full-fledged version of Doom to the Dogecoin blockchain.

While this AI attempt at playing Doom may be a one-off, de Wynter said he has ideas for future gaming experiments using large language models (LLMs).

“My main research interest is related to LLM reasoning and planning capabilities, so games, in general, are an excellent testbench for this,” he said. “Strategy games are a bit off the table at the moment, but I’m wondering whether simpler games (or other models) could yield better results.”

Edited by Andrew Hayward

Stay on top of crypto news, get daily updates in your inbox.

Source link

Tags: Bitcoin

#	Name	Price	24H %
1	Bitcoin(BTC)	$0.00	4.61%
2	Ethereum(ETH)	$0.00	7.62%
4	Binance Coin(BNB)	$0.00	8.31%
6	XRP(XRP)	$0.00	4.85%
12	Dogecoin(DOGE)	$0.00	3.53%
7	Cardano(ADA)	$0.00	5.51%
25	Bitcoin Cash(BCH)	$0.00	5.08%
20	Litecoin(LTC)	$0.00	5.87%
22	Chainlink(LINK)	$0.00	7.51%
31	Stellar(XLM)	$0.00	4.31%

Stay on top of crypto news, get daily updates in your inbox.

Tornado Cash Dev Wants Charges Dropped Because Sanctions Deemed Illegal

Metaplanet Stock Tanks After It Converts Bitcoin Treasury to New Business Line

Binance Faces Cease-and-Desist Over Its Listing of PNUT Meme Coin

OpenAI Whistleblower Found Dead in San Francisco Apartment in Apparent Suicide

Donald Trump Considering a16z Crypto Policy Head for CFTC Role: Bloomberg

Altcoins XRP, Tron and Cardano Cool Off as Market Reacts to Bitcoin Plunge

You may have missed

Bullish pattern points to a Stacks (STX) recovery as this memecoin steals the show

Enron Returns With Countdown Teasing Token Launch… Or Is It?

Combining Real Utility With Profit Opportunities

Ripple’s Legal Chief Urges Congress to Regulate Crypto Practices, Not Technology

Sitemap

Legal Information

Pin It on Pinterest

Stay on top of crypto news, get daily updates in your inbox.

More Stories

Leave a Reply Cancel reply

You may have missed

Sitemap

Legal Information

Categories

Pin It on Pinterest