Can statistical rarity measure whether stories are truly original?
Can we operationalize originality as statistical rarity in narrative feature space? This matters because copyright law requires measuring human creative control, but rarity is relative, context-dependent, and doesn't guarantee quality or authorship.
As AI seeps into writing, the question of what counts as original work shifts from how a story is written to how it is conceived. StoryScope proposes a concrete operationalization: represent each story as a vector of discourse-level narrative features and treat statistical rarity in that space as a proxy for originality. Less common combinations of narrative decisions reflect the broader notion of originality invoked by creativity research (Torrance) and by copyright law, which requires a minimal degree of originality and, per recent U.S. Copyright Office guidance, sufficient human creative control. The empirical hook: human stories are, on average, rarer in narrative feature space, while the five AI models occupy a tight, well-separated cluster.
This is appealing because it converts a contested legal-aesthetic concept into something measurable and model-agnostic. Rarity does not depend on surface style (which survives the humanization edit) and aligns with the intuition that originality is about making uncommon choices, not novel word combinations. It also gives the copyright question an operational handle: a work's position in narrative-decision space could index how much distinctive human conception it carries.
Why it stays a question: rarity-as-originality is a proxy with sharp limits. Rarity is defined relative to a reference distribution, so it drifts as both human and AI writing change — and rare is not the same as good or protectable; an incoherent story can be statistically rare. Conflating "uncommon in feature space" with "originally authored by a human" risks both false positives (idiosyncratic AI output) and false negatives (a human writing in a popular convention). The construct is a useful, falsifiable starting point for measuring conception rather than execution — but whether it should bear legal or evaluative weight is exactly what it leaves open.
Inquiring lines that use this note as a source 6
This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.
- Why does statistical compression destroy literary connotation and meaning?
- How should authorship and originality law attach to discourse structure versus surface style?
- Can rarity in feature space distinguish human authorship from AI output reliably?
- How do changes in human and AI writing distributions shift rarity measures over time?
- Does statistical rarity actually correlate with originality that law should protect?
- What makes creative writing diversity different from code diversity fundamentally?
Related concepts in this collection 4
This note in its neighbourhood — explore the map, then jump to a related concept in the list below.
Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph
-
Can AI stories be detected without analyzing writing style?
Explores whether discourse-level narrative structures like character agency and plot organization reveal AI authorship independently of surface stylistic cues, and whether such structural features resist the kind of fine-tuning that defeats style-based detection.
the same narrative feature space that enables detection is repurposed as a measure of originality
-
Do AI stories explain their themes more than human stories do?
Explores whether AI-generated fiction tends to spell out moral meanings rather than leaving them implicit, and whether this reflects deeper differences in how machines construct narrative versus how humans do.
explains *why* human stories land in rarer regions — ambiguity and temporal complexity are less common choices
-
Why do LLMs generate novel ideas from narrow ranges?
LLM research agents produce individually novel ideas but cluster them in homogeneous sets. This explores why high average novelty coexists with poor diversity coverage and what it means for automated ideation.
a parallel measurement problem — distinguishing genuine originality from average novelty in a feature space of ideas
-
Can humans detect AI text if machines can measure it?
AI-generated text shows measurable differences from human writing across multiple linguistic dimensions, yet human judges consistently fail to identify it. Why does the gap between what is measurable and what is perceptible exist?
extends the construct's premise: AI output is statistically separable in feature space even when humans cannot perceive the difference, the same gap that lets narrative-feature rarity index conception
Related papers in this collection 8
Papers most semantically related to this note, ranked by cosine similarity in the embedding space.
- StoryScope: Investigating idiosyncrasies in AI fiction
- Has the Creativity of Large-Language Models peaked? —an analysis of inter- and intra-LLM variability —
- TaleStream: Supporting Story Ideation with Trope Knowledge
- NoveltyBench: Evaluating Language Models for Humanlike Diversity
- The Method of Critical AI Studies, A Propaedeutic
- The Alien Space of Science: Sampling Coherent but Cognitively Unavailable Research Directions
- Questioning Representational Optimism in Deep Learning: The Fractured Entangled Representation Hypothesis
- We Are All Creators: Generative AI, Collective Knowledge, and the Path Towards Human-AI Synergy
Original note title
originality can be operationalized as statistical rarity in a feature space of narrative decisions