Data & AIData & AI
Quickie15min
INTERMEDIATE

An LLM Walks into General Relativity

This talk presents an experiment where an LLM generates a full presentation on General Relativity. The output is fluent but scientifically flawed, revealing how AI excels in structure yet fails in physics reasoning. Using this case, the talk explores validation methods to ensure reliability in AI‑generated technical content.

Tasos Nikolaou
Tasos NikolaouUp Hellas

talkDetail.whenAndWhere

Friday, April 24, 12:45-13:00
MC 2
talks.roomOccupancytalks.noOccupancyInfo
talks.description
Large Language Models are increasingly used to generate technical content: documentation, reports, and even conference presentations. The results are often fluent, confident, and well-structure, which makes their mistakes harder to spot.
In this talk, we run a simple experiment, an LLM is asked to generate an entire presentation on General Relativity, covering gravitational time dilation, gravitational waves, and black holes, using real scientific sources. The output looks convincing. It has equations, misconceptions, and citations. And yet, several explanations are subtly but fundamentally wrong.
General Relativity is an unforgiving domain. Concepts that sound intuitive, like “light slows down in gravity”, “gravitational waves are ripples in space”, “black holes suck everything in”, fail as soon as you frame them in terms of measurements, observables, and invariants. This makes physics an ideal stress test for AI-generated explanations.
Using the generated slides as a case study, we show:
  • where LLMs consistently succeed (structure, narrative, pedagogy),
  • where they fail (measurement-based reasoning and physical constraints),
  • and how to design agent pipelines that combine AI generation with deterministic validation and human review.
Although physics is the example, the lessons generalize to any technical domain where correctness matters. The goal is not to reject AI-generated content, but to understand how to use it responsibly, and how to catch confident explanations that are wrong.
physics
correctness
validation
ai
talks.speakers
Tasos Nikolaou

Tasos Nikolaou

Up Hellas

Greece

Tasos is a software engineer with a background in physics and machine learning, working on backend systems, software architecture, and microservices, with a strong interest in the practical limits of AI-generated solutions.

talkDetail.rateThisTalk

talkDetail.poortalkDetail.excellent

talkDetail.ratingNotYetAvailable

talkDetail.ratingAvailableWhenStarted

talkDetail.signInRequired

talkDetail.signInToRateDescription

occupancy.title

occupancy.votingNotYetAvailable

occupancy.votingAvailableBeforeStart

talkDetail.signInRequired

occupancy.signInToVoteDescription

comments.title

comments.speakerNotEnabledComments