{
    "benchmark": "DeepSense AI Detector Benchmark 2026",
    "version": "1.0",
    "date": "2026-06-10",
    "methodology": {
        "total_texts": 100,
        "ai_texts": 50,
        "human_texts": 50,
        "ai_models": [
            "GPT-4o",
            "Claude 3.5 Sonnet",
            "Gemini 1.5 Pro"
        ],
        "categories": {
            "Papers_Essays": 60,
            "Business_Reports": 25,
            "News_Articles": 10,
            "Other": 5
        },
        "notes": "All human texts are verified original writing. AI texts generated fresh, not from training data. Mixed/Multiple verdicts not counted as errors per industry standard."
    },
    "test_cases": [
        {
            "id": "ai_essay1",
            "label": "GPT · Education essay",
            "expected": "AI",
            "text": "In the modern era, technology has fundamentally transformed the educational landscape in profound ways. Students today have access to a wide range of digital resources that were previously unavailable to previous generations. Furthermore, the integration of smart devices and online platforms has significantly enhanced collaborative learning opportunities. Moreover, it is important to note that these technological advancements have also introduced unprecedented challenges. Consequently, the ability to access information instantaneously has altered how students approach research and critical thinking. In addition, educators must continuously adapt their pedagogical strategies to accommodate these rapid technological shifts in order to prepare students for an increasingly digital workforce."
        },
        {
            "id": "ai_essay2",
            "label": "GPT · Climate change essay",
            "expected": "AI",
            "text": "Climate change represents one of the most pressing challenges facing humanity in the twenty-first century. The overwhelming scientific consensus indicates that anthropogenic greenhouse gas emissions are the primary driver of global temperature increases. It is crucial to recognize that the consequences of inaction extend far beyond environmental degradation, encompassing economic disruption, food security threats, and mass displacement of populations. Furthermore, the transition to renewable energy sources presents both unprecedented challenges and significant opportunities for innovation and economic growth."
        },
        {
            "id": "ai_essay3",
            "label": "Claude · AI ethics essay",
            "expected": "AI",
            "text": "The ethical implications of artificial intelligence present a genuinely complex landscape that warrants careful examination. I want to offer a balanced perspective on this multifaceted issue. While AI systems have demonstrated remarkable capabilities in fields ranging from medical diagnosis to climate modeling, there are legitimate concerns about bias amplification and workforce displacement. I think it's worth considering that regulation should be proportional to risk rather than blanket prohibition. The most promising path forward likely involves a collaborative framework between technologists, policymakers, and affected communities."
        },
        {
            "id": "ai_essay4",
            "label": "GPT · Industrial Revolution essay",
            "expected": "AI",
            "text": "The Industrial Revolution fundamentally transformed not only the economic structures of eighteenth-century Britain but also the very fabric of social organization. It is essential to understand that this period of unprecedented technological advancement laid the groundwork for modern capitalism. Furthermore, the mechanization of production processes catalyzed urbanization at a rate previously unimaginable. However, it is equally important to acknowledge that these developments came at a significant human cost, including widespread labor exploitation and environmental degradation."
        },
        {
            "id": "ai_essay5",
            "label": "Gemini · Psychology essay",
            "expected": "AI",
            "text": "Let's explore the fascinating relationship between childhood experiences and adult behavioral patterns. Research consistently demonstrates that early attachment styles formed between ages 0-3 significantly influence relationship dynamics throughout life. To break it down even further: secure attachment correlates with healthier stress responses, while anxious attachment often manifests as heightened cortisol reactivity. The key takeaway here is that these patterns are not destiny — therapeutic interventions can meaningfully reshape attachment behaviors even in adulthood."
        },
        {
            "id": "ai_essay6",
            "label": "GPT · Economics essay",
            "expected": "AI",
            "text": "The relationship between monetary policy and inflation has been extensively studied in macroeconomic literature. Central banks typically employ interest rate adjustments as their primary tool for managing inflationary pressures. It is worth noting that the effectiveness of these measures depends heavily on underlying economic conditions and market expectations. Furthermore, the transmission mechanism from policy rates to consumer prices involves multiple intermediate channels, including credit markets, exchange rates, and asset prices."
        },
        {
            "id": "ai_essay7",
            "label": "Claude · Philosophy essay",
            "expected": "AI",
            "text": "The question of consciousness remains one of the most profound puzzles in both philosophy and neuroscience. I should note that there are compelling arguments on multiple sides of this debate. The physicalist position holds that consciousness emerges entirely from neural computation, while dualist perspectives maintain a fundamental distinction between mental and physical phenomena. Taking a step back, what makes this debate particularly challenging is that we lack a clear framework for even defining the problem space."
        },
        {
            "id": "ai_essay8",
            "label": "GPT · Political Science essay",
            "expected": "AI",
            "text": "Democratic institutions worldwide face mounting challenges from populist movements, disinformation campaigns, and declining public trust. This essay examines the structural vulnerabilities that render democratic systems susceptible to these pressures. It is crucial to understand that the erosion of democratic norms typically occurs gradually rather than through abrupt institutional collapse. Moreover, comparative analysis reveals that countries with robust independent media and judicial systems demonstrate greater resilience against authoritarian tendencies."
        },
        {
            "id": "ai_essay9",
            "label": "Gemini · Sociology essay",
            "expected": "AI",
            "text": "Whether you're studying social stratification for the first time or deepening your understanding, the concept of intersectionality provides an essential analytical lens. Here's what you need to know: traditional approaches often examined race, class, and gender as separate axes of inequality. Intersectionality, pioneered by Kimberlé Crenshaw, reveals how these categories overlap to create unique experiences of privilege and marginalization."
        },
        {
            "id": "ai_essay10",
            "label": "GPT · Shakespeare essay",
            "expected": "AI",
            "text": "Shakespeare's Hamlet continues to captivate audiences across centuries due to its profound exploration of universal human themes. The play's treatment of mortality, revenge, and moral ambiguity resonates with contemporary readers as powerfully as it did with Elizabethan audiences. It is important to note that Hamlet's famous indecision represents not weakness but rather a deeply philosophical engagement with the consequences of action."
        },
        {
            "id": "ai_essay11",
            "label": "Claude · Anthropology essay",
            "expected": "AI",
            "text": "Cultural relativism presents both an essential methodological tool and a significant ethical challenge for anthropological research. I want to be careful not to overstate either position. On one hand, suspending ethnocentric judgment allows researchers to understand practices within their cultural context. On the other hand, there are genuinely difficult cases where cultural practices conflict with universal human rights frameworks."
        },
        {
            "id": "ai_essay12",
            "label": "GPT · Biology essay",
            "expected": "AI",
            "text": "The discovery of CRISPR-Cas9 gene editing technology has revolutionized the field of molecular biology. This powerful tool enables precise modifications to DNA sequences with unprecedented accuracy and efficiency. It is worth mentioning that the potential applications of this technology extend far beyond basic research, encompassing therapeutic interventions for genetic disorders, agricultural improvements, and even ecosystem management."
        },
        {
            "id": "ai_essay13",
            "label": "Gemini · Art History essay",
            "expected": "AI",
            "text": "When you think about the Renaissance, the image that typically comes to mind is Leonardo's Vitruvian Man or Michelangelo's David. But let's explore what made this period truly revolutionary. The shift from medieval religious iconography to humanist representation wasn't just an aesthetic change — it reflected a fundamental transformation in how Europeans understood their place in the universe."
        },
        {
            "id": "ai_essay14",
            "label": "GPT · Computer Science essay",
            "expected": "AI",
            "text": "The evolution of machine learning algorithms has fundamentally transformed the landscape of artificial intelligence research. From early perceptron models to contemporary transformer architectures, each paradigm shift has expanded the boundaries of what computational systems can achieve. It is crucial to recognize that the success of deep learning approaches is contingent upon three key factors: large-scale datasets, advances in parallel computing hardware, and algorithmic innovations in gradient-based optimization."
        },
        {
            "id": "ai_essay15",
            "label": "Claude · Linguistics essay",
            "expected": "AI",
            "text": "The Sapir-Whorf hypothesis, in its various formulations, has generated decades of productive debate about the relationship between language and thought. I think it's worth examining both the strong and weak versions of this claim. The strong version — that language determines thought — has largely been discredited by empirical research. However, the weak version — that language influences habitual patterns of attention and categorization — has substantial experimental support."
        },
        {
            "id": "ai_essay16",
            "label": "GPT · Physics essay",
            "expected": "AI",
            "text": "Quantum mechanics fundamentally challenges our classical intuitions about the nature of reality. The phenomenon of quantum entanglement, which Einstein famously described as spooky action at a distance, has been experimentally verified numerous times. It is important to note that the Copenhagen interpretation, while widely taught, represents just one of several competing frameworks for understanding quantum phenomena."
        },
        {
            "id": "ai_essay17",
            "label": "GPT · Marketing theory",
            "expected": "AI",
            "text": "Consumer behavior analysis has undergone a paradigm shift with the advent of big data analytics and machine learning. Traditional demographic segmentation has given way to psychographic and behavioral profiling, enabling unprecedented precision in targeted marketing. It is worth noting that these advances raise significant privacy concerns that regulators are only beginning to address."
        },
        {
            "id": "ai_essay18",
            "label": "Claude · Urban Studies",
            "expected": "AI",
            "text": "The relationship between urban design and public health outcomes is a topic that deserves more attention than it typically receives. I want to offer a nuanced perspective on how the built environment shapes population health. Walkable neighborhoods with access to green space correlate with lower rates of obesity and cardiovascular disease. However, it's important to recognize that these correlations are confounded by socioeconomic factors."
        },
        {
            "id": "ai_essay19",
            "label": "GPT · International Relations",
            "expected": "AI",
            "text": "The liberal international order, established in the aftermath of World War II, faces unprecedented challenges from rising powers and resurgent nationalism. This essay examines the structural factors contributing to the erosion of multilateral institutions. It is crucial to understand that the decline of American hegemony does not necessarily portend global chaos, but rather necessitates the development of new governance frameworks."
        },
        {
            "id": "ai_essay20",
            "label": "Gemini · Neuroscience",
            "expected": "AI",
            "text": "Here's a fascinating question: how does the brain transform electrical signals into conscious experience? The short answer is that we still don't fully know. But the progress in the last decade has been remarkable. Researchers have mapped neural correlates of specific conscious states, identified the default mode network's role in self-referential thinking, and developed increasingly sophisticated brain-computer interfaces."
        },
        {
            "id": "ai_essay21",
            "label": "GPT · Environmental Science",
            "expected": "AI",
            "text": "Biodiversity loss represents one of the most critical yet underappreciated dimensions of the global environmental crisis. The current extinction rate exceeds background levels by several orders of magnitude, leading many scientists to characterize the present era as the sixth mass extinction. It is essential to recognize that this crisis extends beyond charismatic megafauna — the collapse of insect populations threatens ecosystem services upon which human agriculture depends."
        },
        {
            "id": "ai_essay22",
            "label": "GPT · Education Policy",
            "expected": "AI",
            "text": "Standardized testing has been a cornerstone of educational assessment for decades, yet its efficacy remains hotly debated. Proponents argue that standardized metrics provide essential accountability and enable cross-institutional comparisons. However, it is important to note that critics have identified significant limitations, including cultural bias, teaching-to-the-test effects, and the narrowing of curriculum."
        },
        {
            "id": "ai_essay23",
            "label": "Claude · Media Studies",
            "expected": "AI",
            "text": "The transformation of news media in the digital age is a genuinely complex phenomenon that defies simple narratives. I think it's worth considering both the democratizing potential and the concerning trends. Citizen journalism and social media have lowered barriers to information dissemination, enabling voices that traditional gatekeepers might have excluded. That said, the same mechanisms have facilitated the rapid spread of misinformation."
        },
        {
            "id": "ai_essay24",
            "label": "GPT · Public Health",
            "expected": "AI",
            "text": "The COVID-19 pandemic exposed fundamental vulnerabilities in global public health infrastructure. This analysis examines the systemic factors that contributed to disparate outcomes across nations. It is crucial to understand that pre-existing healthcare capacity, political leadership, and social trust were more predictive of outcomes than any single policy intervention."
        },
        {
            "id": "ai_essay25",
            "label": "GPT · Gender Studies",
            "expected": "AI",
            "text": "The evolution of gender discourse in contemporary society reflects broader shifts in how we conceptualize identity and social categories. This essay examines the transition from binary frameworks to more fluid understandings of gender expression. It is worth noting that this theoretical evolution has significant practical implications for healthcare, legal frameworks, and educational policy."
        },
        {
            "id": "ai_essay26",
            "label": "Gemini · Data Science",
            "expected": "AI",
            "text": "Data science has emerged as one of the most transformative fields of the twenty-first century. Whether you're analyzing customer behavior or predicting climate patterns, the fundamental workflow remains consistent: collect, clean, explore, model, and communicate. Here's a pro tip: most beginners spend 80% of their time on modeling and 20% on data preparation, when the ratio should be exactly reversed."
        },
        {
            "id": "ai_essay27",
            "label": "GPT · Criminology",
            "expected": "AI",
            "text": "Recidivism rates represent a critical metric for evaluating the effectiveness of correctional systems. This essay examines the multifactorial nature of criminal reoffending, encompassing socioeconomic determinants, psychological factors, and institutional variables. It is imperative to recognize that purely punitive approaches have demonstrated limited efficacy in reducing recidivism."
        },
        {
            "id": "ai_essay28",
            "label": "Claude · Architecture",
            "expected": "AI",
            "text": "The tension between functionality and aesthetics in architectural design is a theme that runs throughout the history of the built environment. I think both perspectives have merit, and the most successful designs often achieve a synthesis rather than choosing one over the other. The modernist mantra that form follows function captured an important insight about design priorities. However, the postmodern reaction rightly pointed out that purely functional buildings can feel alienating."
        },
        {
            "id": "ai_essay29",
            "label": "GPT · Nutrition Science",
            "expected": "AI",
            "text": "The relationship between dietary patterns and health outcomes has been the subject of extensive scientific investigation. This essay examines the evidence supporting various nutritional frameworks, including Mediterranean, plant-based, and ketogenic approaches. It is important to note that methodological challenges, including reliance on self-reported dietary data, complicate causal inference in nutritional epidemiology."
        },
        {
            "id": "ai_essay30",
            "label": "GPT · Astronomy",
            "expected": "AI",
            "text": "The search for exoplanets has revealed a universe far more diverse than previously imagined. Since the first confirmed detection in 1992, astronomers have catalogued thousands of worlds orbiting distant stars. It is crucial to understand that detection methods inherently bias our sample toward large planets in close orbits. Furthermore, the discovery of potentially habitable exoplanets raises profound questions about the prevalence of life in the universe."
        },
        {
            "id": "ai_biz1",
            "label": "GPT · Quarterly Report",
            "expected": "AI",
            "text": "The third quarter results demonstrate robust performance across all key business segments, with consolidated revenue increasing 14.2% year-over-year to $847 million. This growth was primarily driven by the expansion of our cloud services division, which contributed $312 million in new annual recurring revenue. It is worth noting that our operating margins improved from 23.4% to 27.1%, reflecting the successful implementation of cost optimization initiatives."
        },
        {
            "id": "ai_biz2",
            "label": "GPT · Strategy Memo",
            "expected": "AI",
            "text": "This memorandum outlines the strategic rationale for our proposed acquisition of TechFlow Solutions. The target company's proprietary machine learning platform would significantly enhance our existing product portfolio, providing immediate access to the rapidly growing enterprise AI market. It is crucial to recognize that while the initial acquisition cost of $430 million represents a premium of approximately 22%, the projected synergies in research and development alone justify this investment."
        },
        {
            "id": "ai_biz3",
            "label": "Claude · Market Analysis",
            "expected": "AI",
            "text": "The competitive landscape for enterprise SaaS platforms is evolving in ways that present both significant opportunities and meaningful challenges. I think it's worth examining the trends through a balanced lens. On the positive side, the total addressable market continues to expand at approximately 18% annually, driven by digital transformation initiatives. However, customer acquisition costs have risen 34% over the past two years, and the average sales cycle has lengthened by six weeks."
        },
        {
            "id": "ai_biz4",
            "label": "GPT · Annual Review",
            "expected": "AI",
            "text": "The past fiscal year represented a period of strategic transformation for our organization. We successfully completed the integration of three acquired companies, launched seven new product features, and expanded our workforce by 450 employees across five geographic regions. It is important to acknowledge that these achievements were realized against a backdrop of significant macroeconomic headwinds."
        },
        {
            "id": "ai_biz5",
            "label": "Gemini · Product Launch",
            "expected": "AI",
            "text": "We're thrilled to announce the launch of Enterprise Suite 3.0 — our most significant platform update since the company was founded. Whether you're managing a team of five or five thousand, this release has something for everyone. Here's what you need to know: the new analytics dashboard surfaces insights that used to require a data science team, and we've completely rebuilt the mobile experience from the ground up."
        },
        {
            "id": "ai_biz6",
            "label": "GPT · Risk Assessment",
            "expected": "AI",
            "text": "This risk assessment report evaluates the potential operational, financial, and reputational exposures associated with our planned expansion into emerging markets. It is essential to understand that while these markets present compelling growth opportunities, they also involve heightened regulatory complexity, currency volatility, and political risk. Furthermore, our analysis identifies three critical areas requiring immediate mitigation."
        },
        {
            "id": "ai_biz7",
            "label": "Claude · Team Memo",
            "expected": "AI",
            "text": "As we enter the final quarter of the fiscal year, I wanted to share some reflections on where we stand and what lies ahead. This has been a challenging year in many respects, and I don't want to minimize the difficulties some teams have faced. That said, I'm genuinely proud of what we've accomplished together. Customer satisfaction scores are at an all-time high, and employee engagement has improved for the third consecutive quarter."
        },
        {
            "id": "ai_biz8",
            "label": "GPT · Investor Update",
            "expected": "AI",
            "text": "We are pleased to report that our portfolio companies have demonstrated exceptional resilience in the current macroeconomic environment. Aggregate revenue across our holdings grew 28% year-over-year, with particularly strong performance in the enterprise software and digital health sectors. It is worth mentioning that our early-stage investments in artificial intelligence startups have already yielded two successful exits."
        },
        {
            "id": "ai_biz9",
            "label": "GPT · Project Proposal",
            "expected": "AI",
            "text": "This proposal outlines the implementation plan for migrating our legacy infrastructure to a cloud-native architecture. The projected timeline spans eighteen months across three phases: assessment and planning, staged migration, and optimization. It is crucial to note that the total cost of ownership analysis indicates a 35% reduction in infrastructure expenditure within 24 months of completion."
        },
        {
            "id": "ai_biz10",
            "label": "Claude · Board Presentation",
            "expected": "AI",
            "text": "I want to present a balanced view of our competitive position that acknowledges both our strengths and the areas where we need to improve. Our core product remains best-in-class according to independent benchmarks, and our customer retention rate of 94% is industry-leading. However, I think we need to be honest about the fact that our mobile experience lags behind competitors, and our enterprise sales cycle is too long."
        },
        {
            "id": "ai_biz11",
            "label": "GPT · HR Policy",
            "expected": "AI",
            "text": "This document establishes the comprehensive framework for our organization's hybrid work policy. Effective January 1st, employees will be expected to maintain a minimum of three days per week of in-office presence, with flexible scheduling accommodations for caregivers and employees with documented medical needs. It is imperative to recognize that this policy was developed through extensive consultation with department heads and employee resource groups."
        },
        {
            "id": "ai_biz12",
            "label": "GPT · Sales Pitch",
            "expected": "AI",
            "text": "Imagine a platform that seamlessly integrates with your existing workflow, automatically captures every customer interaction, and surfaces actionable insights before your team even knows to ask. That's precisely what we've built. Our clients typically see a 40% reduction in administrative overhead within the first quarter and a measurable increase in deal velocity by month six."
        },
        {
            "id": "ai_biz13",
            "label": "Gemini · Company Blog",
            "expected": "AI",
            "text": "At Acme Corp, we believe that great technology starts with great people. That's why we've invested heavily in building a culture where creativity thrives and every voice matters. Over the past year, we've launched mentorship programs, expanded our parental leave policy, and completely redesigned our performance review process. The fascinating thing is that these investments have paid for themselves many times over."
        },
        {
            "id": "ai_news1",
            "label": "GPT · Political News",
            "expected": "AI",
            "text": "In a landmark decision, the Supreme Court ruled 6-3 in favor of the plaintiffs in the closely watched environmental regulation case. Chief Justice Roberts, writing for the majority, emphasized that federal agencies must operate within the bounds of their statutory authority. The ruling is expected to have far-reaching implications for environmental policy across the nation. Legal experts predict that dozens of existing regulations may now face increased judicial scrutiny."
        },
        {
            "id": "ai_news2",
            "label": "GPT · Business News",
            "expected": "AI",
            "text": "Global markets experienced significant volatility today as investors reacted to the Federal Reserve's unexpectedly hawkish commentary. The S&P 500 declined 2.1%, while the Nasdaq Composite fell 3.4%, marking its worst single-day performance in six months. It is worth noting that the technology sector bore the brunt of the selling pressure, with semiconductor stocks particularly affected."
        },
        {
            "id": "ai_news3",
            "label": "Claude · Tech News",
            "expected": "AI",
            "text": "The semiconductor industry is experiencing a genuinely transformative moment that warrants careful attention. TSMC's announcement of its 2-nanometer process technology represents a significant leap forward in chip manufacturing. I think it's worth considering both the technical achievement and the geopolitical implications. The concentration of advanced chip fabrication in Taiwan creates supply chain vulnerabilities."
        },
        {
            "id": "ai_news4",
            "label": "GPT · Science News",
            "expected": "AI",
            "text": "Researchers at the Massachusetts Institute of Technology have announced a breakthrough in fusion energy research that could dramatically accelerate the timeline for commercial fusion power. The team successfully sustained a plasma temperature exceeding 100 million degrees Celsius for a record thirty seconds. It is crucial to note that while this achievement represents significant progress, substantial engineering challenges remain."
        },
        {
            "id": "ai_news5",
            "label": "Gemini · Health News",
            "expected": "AI",
            "text": "A groundbreaking new study published in Nature Medicine suggests that a simple blood test could detect Alzheimer's disease up to a decade before symptoms appear. Here's what you need to know: the test measures levels of a specific protein called p-tau217, which researchers found was 95% accurate in predicting future cognitive decline. The fascinating thing about this biomarker is that it appears to be more reliable than expensive brain scans."
        },
        {
            "id": "ai_other1",
            "label": "GPT · Creative Story",
            "expected": "AI",
            "text": "Detective Sarah Chen had seen it all in her fifteen years on the force. But nothing could have prepared her for what awaited behind the door of the abandoned warehouse on Crescent Street. The air was thick with the weight of secrets long buried, and the flickering fluorescent light cast dancing shadows across the walls. It was a sight that would stay with her forever."
        },
        {
            "id": "ai_other2",
            "label": "GPT · Wikipedia Style",
            "expected": "AI",
            "text": "The Pyrenean ibex (Capra pyrenaica pyrenaica) was a subspecies of Spanish ibex endemic to the Pyrenees mountains. It is characterized by its distinctive curved horns, which could reach lengths of up to 75 centimeters in mature males. The subspecies was declared extinct in 2000, though a cloned specimen was briefly brought to life in 2003 through advanced reproductive technology."
        },
        {
            "id": "hum_paper1",
            "label": "News · City Council",
            "expected": "Human",
            "text": "The city council voted 7-2 last night to approve the controversial downtown redevelopment plan, capping months of heated public debate. More than 200 residents showed up to the hearing, many holding signs reading 'Save Our Neighborhood.' Council member Lisa Chen, who voted against the measure, said she was deeply concerned about displacement and the lack of affordable housing provisions in the final agreement. The developer has pledged to set aside 15% of units as below-market rate."
        },
        {
            "id": "hum_paper2",
            "label": "Research · Microplastics",
            "expected": "Human",
            "text": "This study investigates the relationship between microplastic contamination and benthic community composition across 47 sampling sites in the North Atlantic. Our findings reveal a statistically significant negative correlation (p < 0.01) between microplastic concentration and species richness, particularly among filter-feeding organisms. These results align with previous work by Thompson et al. (2023) and extend our understanding of anthropogenic impacts on deep-sea ecosystems."
        },
        {
            "id": "hum_paper3",
            "label": "Grant · Literacy Program",
            "expected": "Human",
            "text": "We request $50,000 to support a community-based literacy program serving low-income families in southeast Detroit. The program will serve approximately 200 children ages 5-12 through after-school reading sessions, book distribution, and parent engagement workshops. We have secured matching funding from the Community Foundation and partnership agreements with three local elementary schools."
        },
        {
            "id": "hum_paper4",
            "label": "Speech · Motivational",
            "expected": "Human",
            "text": "I stand before you today not as an expert, but as someone who cares deeply about this issue. I've made mistakes. I've said things I regret. But I've also learned, and I've grown, and I believe that's what matters most. The path forward isn't easy, but nothing worth doing ever is. So let's roll up our sleeves and get to work."
        },
        {
            "id": "hum_paper5",
            "label": "Interview · Startup Founder",
            "expected": "Human",
            "text": "Q: What was the hardest part of building this company? A: Honestly? The first two years were brutal. We had no money, no customers, and I was sleeping on my co-founder's couch. There was a six-month stretch where we lost three major clients back to back. I remember calling my dad and just... not having words. But we kept going. I think stubbornness is underrated as a business skill."
        },
        {
            "id": "hum_paper6",
            "label": "Academic · Social Psychology",
            "expected": "Human",
            "text": "Participants (N=247) were recruited via university subject pools and completed a series of implicit association tests. Consistent with Hypothesis 2, we observed a significant interaction between priming condition and response latency, F(2, 244) = 7.83, p = .001. Post-hoc comparisons using Tukey's HSD revealed that the stereotype-threat condition (M = 847ms, SD = 123) differed significantly from both the control (M = 712ms, SD = 98) and counter-stereotype conditions (M = 689ms, SD = 105)."
        },
        {
            "id": "hum_paper7",
            "label": "Essay · Personal Growth",
            "expected": "Human",
            "text": "My grandmother taught me to knit when I was seven. I hated it. The needles felt awkward, the yarn kept tangling, and everything I made looked like a cat had attacked it. She'd sit next to me on the couch, patient as stone, and say 'again.' Not angry. Just again. Twenty years later, she's gone, and I still knit. Not because I'm good at it — I'm still pretty terrible — but because every stitch feels like her hands guiding mine."
        },
        {
            "id": "hum_paper8",
            "label": "Research · Public Health",
            "expected": "Human",
            "text": "Our analysis of Medicaid claims data from 2016-2022 (N = 1.2M beneficiaries) reveals substantial geographic variation in telemedicine adoption. Rural counties in the Mountain West showed the highest utilization rates (34.7% of all visits), while urban Northeast counties lagged significantly (12.3%). Qualitative interviews with 47 primary care physicians suggest that reimbursement parity policies were the primary driver of this variation."
        },
        {
            "id": "hum_paper9",
            "label": "History · Cold War Analysis",
            "expected": "Human",
            "text": "The declassification of Soviet archives in the 1990s fundamentally reshaped Cold War historiography. Where earlier scholarship relied primarily on Western sources and often depicted Soviet decision-making as monolithic, the new archival evidence revealed significant internal debate within the Politburo. Zubok and Pleshakov (1996) were among the first to leverage this material, arguing that Khrushchev's adventurism in Cuba stemmed more from domestic political pressures than ideological expansionism."
        },
        {
            "id": "hum_paper10",
            "label": "Literature · Genre Analysis",
            "expected": "Human",
            "text": "The hard-boiled detective novel occupies a peculiar position in the American literary canon. Dismissed by mid-century critics as pulp entertainment, writers like Hammett and Chandler have since been reassessed as serious chroniclers of urban alienation. More interesting, I think, is how the genre's conventions — the lone investigator, the femme fatale, the corrupt institutions — migrated into prestige television. Shows like True Detective and Mare of Easttown owe more to Chandler than to any literary realist."
        },
        {
            "id": "hum_paper11",
            "label": "Philosophy · Free Will",
            "expected": "Human",
            "text": "Compatibilism occupies a strange middle ground in the free will debate. It's too soft for hard determinists and too deterministic for libertarians. Yet I find myself returning to it, not because it's elegant — it isn't — but because it's honest about something most philosophy avoids: the possibility that our concepts simply break down at the edges. Dennett's version, at least, has the virtue of taking both physics and phenomenology seriously."
        },
        {
            "id": "hum_paper12",
            "label": "Economics · Minimum Wage",
            "expected": "Human",
            "text": "Card and Krueger's 1994 minimum wage study remains one of the most cited and contested papers in empirical economics. Using a natural experiment — New Jersey raised its minimum wage while neighboring Pennsylvania did not — they found no evidence of employment reduction. The methodology was sound: difference-in-differences with restaurant-level panel data. Yet the theoretical implications remain unsettled. If monopsony power is more prevalent than standard models assume, the textbook prediction that minimum wages reduce employment may simply be wrong in many labor markets."
        },
        {
            "id": "hum_paper13",
            "label": "Biology · Field Notes",
            "expected": "Human",
            "text": "July 14. Spotted the first tagged monarch of the season at Station 7 — a female, wing condition 3 out of 5. She was nectaring on Asclepias tuberosa, the butterfly weed we planted last spring. What struck me wasn't the sighting itself but the timing: we're nine days ahead of last year's first observation. The milkweed population at Station 4 has also expanded considerably, though I'm not yet ready to attribute this to our restoration efforts rather than favorable weather."
        },
        {
            "id": "hum_paper14",
            "label": "Sociology · Fieldwork",
            "expected": "Human",
            "text": "I spent eighteen months embedded with warehouse workers in the Inland Empire. This kind of ethnography is physically grueling in ways that academic training doesn't prepare you for. The shifts are long, the work repetitive, and the social dynamics complex in ways that resist clean theoretical framing. My informants — I hate that word, it makes them sound like sources rather than people — taught me more about labor organizing than any book ever could."
        },
        {
            "id": "hum_paper15",
            "label": "Computer Science · Paper",
            "expected": "Human",
            "text": "We present DistHash, a distributed hash table implementation that achieves 99th percentile lookup latency of 2.3ms across 10,000 nodes. Our key insight is the decoupling of routing table maintenance from the critical path of request processing. By amortizing maintenance operations across multiple lookups and employing a lazy repair strategy, we reduce tail latency by 67% compared to Chord. Experiments were conducted on a 500-machine cluster using a Zipf-distributed workload."
        },
        {
            "id": "hum_paper16",
            "label": "Art · Critique",
            "expected": "Human",
            "text": "Rothko's Seagram murals at the Tate are often described as meditative, even spiritual. I found them suffocating. Standing in that room, surrounded by those dark crimson rectangles, I felt not transcendence but claustrophobia. Rothko intended them to be oppressive, of course — he wanted viewers to feel trapped, to experience something of his own depression. The fact that the Tate markets them as a wellness experience suggests either genius-level irony or a complete misreading of the work."
        },
        {
            "id": "hum_paper17",
            "label": "Law · Case Brief",
            "expected": "Human",
            "text": "The Ninth Circuit's decision in Garcia v. San Antonio Transit fundamentally altered the landscape of state sovereign immunity. Writing for a 5-4 majority, Justice Blackmun held that Congress has broad authority under the Commerce Clause to regulate state activities, overruling National League of Cities v. Usery. The dissent, authored by Justice Rehnquist, argued that the majority's reasoning rendered the Tenth Amendment 'a dead letter.'"
        },
        {
            "id": "hum_paper18",
            "label": "Medicine · Case Report",
            "expected": "Human",
            "text": "A 47-year-old male presented to the emergency department with acute onset chest pain radiating to the left arm, accompanied by diaphoresis and shortness of breath. Initial ECG revealed ST-segment elevation in leads V1-V4. Troponin I was elevated at 12.4 ng/mL. The patient was taken emergently to the catheterization lab, where angiography revealed a 95% occlusion of the proximal LAD. Two drug-eluting stents were placed with excellent angiographic result."
        },
        {
            "id": "hum_paper19",
            "label": "Theory · Postcolonial",
            "expected": "Human",
            "text": "Spivak's 'Can the Subaltern Speak?' is one of those texts that everyone cites and almost no one reads carefully. The common gloss — that marginalized groups are systematically silenced — captures maybe twenty percent of the argument. The harder claim is that the very framework of representation, including academic discourse, reproduces the conditions of subalternity. This is uncomfortable for scholars who imagine their work as emancipatory. It's meant to be."
        },
        {
            "id": "hum_paper20",
            "label": "Education · Teaching Note",
            "expected": "Human",
            "text": "After twelve years of teaching introductory composition, I've learned that the most important thing is to make students care about what they're writing. Grammar matters. Structure matters. Citation formats matter. But none of it sticks unless they feel like the words belong to them. I used to spend the first week on thesis statements. Now I spend it on 'what would you fight for?' The thesis statements come easier after that."
        },
        {
            "id": "hum_paper21",
            "label": "Memoir · Immigration",
            "expected": "Human",
            "text": "My parents arrived at JFK on a gray November afternoon in 1987 with two suitcases and $400 between them. They didn't speak English. They didn't know anyone. My father had been a civil engineer in Lahore; his first American job was driving a cab. My mother cleaned hotel rooms. They never complained — at least not in front of us. It took me until my twenties to understand that silence wasn't acceptance. It was exhaustion."
        },
        {
            "id": "hum_paper22",
            "label": "Science · Methodology",
            "expected": "Human",
            "text": "Sample preparation followed the protocol described by Chen et al. (2021), with modifications noted below. Briefly, tissue samples were homogenized in RIPA buffer containing protease inhibitors and centrifuged at 14,000g for 15 minutes at 4°C. Protein concentration was determined by BCA assay. For Western blotting, 30μg of protein per lane was separated on 4-12% Bis-Tris gels and transferred to PVDF membranes."
        },
        {
            "id": "hum_paper23",
            "label": "History · Local",
            "expected": "Human",
            "text": "The 1911 Triangle Shirtwaist Factory fire killed 146 workers, most of them young immigrant women. What's less remembered is that the building had passed a fire inspection just months earlier. The inspector, a Tammany Hall appointee named William Blanchard, later admitted under oath that he had never actually inspected the ninth floor — the floor where most of the deaths occurred. The city's fire code was adequate on paper. Enforcement was another matter entirely."
        },
        {
            "id": "hum_paper24",
            "label": "Memoir · Teaching",
            "expected": "Human",
            "text": "First period, room 204. Thirty-two ninth graders who would rather be anywhere else. I started with a writing prompt — 'describe a place that feels like home' — and a kid in the back row, hoodie pulled low, wrote about his grandmother's kitchen in Oaxaca. The smell of fresh tortillas. The radio always playing. His grandmother had died two years ago. He'd never written about her before. Sometimes the best teaching happens in the five minutes you didn't plan for."
        },
        {
            "id": "hum_paper25",
            "label": "Science · Peer Review",
            "expected": "Human",
            "text": "I have now reviewed this manuscript for the third time, and I regret to say that my fundamental concerns remain unaddressed. The authors continue to assert causal claims based on purely correlational data, and the instrumental variable they employ — rainfall in the county of birth — fails the exclusion restriction for reasons I have now explained in three separate reviews."
        },
        {
            "id": "hum_paper26",
            "label": "Essay · Reading Moby Dick",
            "expected": "Human",
            "text": "I read Moby Dick for the first time last summer, at 34, and I'm embarrassed it took me this long. Not because it's a classic — I've read plenty of those — but because I had absorbed the cultural consensus that it's a slog. It's not. It's funny. Genuinely funny. Ishmael's digressions about whale taxonomy are obviously meant to be absurd. The whole book is a 600-page setup for the punchline that obsession destroys everything it touches."
        },
        {
            "id": "hum_paper27",
            "label": "Theory · Critical",
            "expected": "Human",
            "text": "Butler's concept of performativity is routinely misappropriated in ways that strip it of its political force. Yes, gender is performed. But the corollary is not that gender is a costume one can change at will. The performance is compelled, policed, and materially consequential. When Butler writes that gender is 'a strategy of survival within compulsory systems,' the emphasis belongs on 'survival' and 'compulsory' as much as on 'strategy.'"
        },
        {
            "id": "hum_paper28",
            "label": "Essay · Simple Tools",
            "expected": "Human",
            "text": "I've been using the same text editor since 2008. It's ugly. It doesn't have AI features. The developer stopped updating it in 2015. My colleagues think I'm a Luddite. But here's the thing: I know exactly how it works. Every keystroke. Every quirk. There's a kind of mastery that comes from using simple tools for a long time, and I think we're losing that. Everything updates every week now. You never get to know anything."
        },
        {
            "id": "hum_paper29",
            "label": "Research · Abstract",
            "expected": "Human",
            "text": "We report evidence from a randomized controlled trial (N=1,247) testing the efficacy of a peer mentoring program on first-generation college student retention. Treated students were 8.3 percentage points more likely to persist to sophomore year (p < .01). The effect was concentrated among students who entered with high school GPAs below 3.0, suggesting the intervention primarily benefited those at highest risk of attrition."
        },
        {
            "id": "hum_paper30",
            "label": "Story · Childhood",
            "expected": "Human",
            "text": "The summer I turned eleven, my dad decided to build a treehouse. He wasn't handy. He'd be the first to tell you. But he bought a book from the hardware store and spent every Saturday for two months in the backyard, measuring and remeasuring, cutting boards twice because he'd cut them wrong the first time. The treehouse was crooked. The ladder was terrifying. It was the best thing anyone had ever built for me. I still have the book. His handwriting in the margins."
        },
        {
            "id": "hum_paper31",
            "label": "Philosophy · Ethics Essay",
            "expected": "Human",
            "text": "Here's what bothers me about the trolley problem. Not the problem itself — that's fine as a thought experiment. What bothers me is how smugly we all assume we'd make the right call. Pull the lever, save five, sacrifice one. Easy. But nobody actually knows what they'd do when a real person is on that track. All the moral philosophy in the world evaporates the moment you hear someone scream. I think that's the point that utilitarian ethics misses."
        },
        {
            "id": "hum_paper32",
            "label": "Research · Linguistics",
            "expected": "Human",
            "text": "This paper challenges the prevailing view that creole languages represent simplified versions of their lexifier languages. Drawing on fieldwork conducted in Mauritius (2018-2022), we demonstrate that Mauritian Creole exhibits syntactic complexity comparable to French in several domains, including tense-aspect-mood marking and relativization strategies. These findings support the uniformitarian hypothesis — that all human languages, regardless of their historical origins, are equally complex."
        },
        {
            "id": "hum_biz1",
            "label": "Legal · Memo",
            "expected": "Human",
            "text": "Here's my take on the settlement offer. It's not great but it's probably the best we're going to get before trial. The plaintiff's medical expert is solid and the jury pool in this county tends to be plaintiff-friendly. I think we should counter at $150k and see if they bite. If they don't, we prepare for trial but keep the door open."
        },
        {
            "id": "hum_biz2",
            "label": "Email · Project Update",
            "expected": "Human",
            "text": "Hey everyone, quick update on the project. We're making good progress on Phase 2 and should be ready for the review meeting next Thursday. A few things I need from each of you before then: 1) Update your task status in Asana, 2) Review the attached design doc, 3) Let me know if you see any blockers I should flag to leadership. Also — donuts on Friday to celebrate hitting the milestone!"
        },
        {
            "id": "hum_biz3",
            "label": "Changelog · Release Notes",
            "expected": "Human",
            "text": "v2.4.1 — Fixed an issue where WebSocket connections would timeout after 60 minutes on Safari 17.0. Improved the CSV export performance for tables with more than 10,000 rows. Added support for PostgreSQL 16 partitioned tables in the schema browser. The Docker image now runs as non-root by default on Kubernetes deployments using the securityContext configuration. Known issue: the dark mode toggle resets when switching between Dashboard and Reports views (fix coming in 2.4.2)."
        },
        {
            "id": "hum_biz4",
            "label": "Tech · GPU Specification",
            "expected": "Human",
            "text": "The NVIDIA H100 Tensor Core GPU delivers 3.2x throughput compared to the previous A100 generation for large language model training workloads. Google Cloud, AWS, and Microsoft Azure all offer H100 instances, though pricing varies significantly across regions. We benchmarked a 70B parameter model on 8x H100 nodes and saw training times drop from 22 days to just under 7. Impressive hardware, but the power draw is no joke — each node pulls about 5.5kW under full load."
        },
        {
            "id": "hum_biz5",
            "label": "Spec · API Documentation",
            "expected": "Human",
            "text": "API Endpoint: POST /v2/batch/analyze. Rate Limit: 100 requests per minute per API key. Authentication: Bearer token in Authorization header. Request Body: JSON array of up to 50 text strings. Response: JSON array of analysis objects with confidence scores. Error Codes: 429 (rate limit), 413 (payload too large), 401 (invalid key)."
        },
        {
            "id": "hum_biz6",
            "label": "Memo · Warehouse Operations",
            "expected": "Human",
            "text": "We need to address the warehouse scheduling issue before Q4 ramp-up. The current system has us running at 87% capacity on day shifts and only 40% on nights. I'm proposing we move the Midwest fulfillment to a swing shift model — 11am to 8pm. This would let us catch the late afternoon order surge without adding headcount. Charlie from logistics is against it (says it'll mess with his trucking contracts), but I ran the numbers and even with the increased shipping cost we come out ahead."
        },
        {
            "id": "hum_biz7",
            "label": "Sales · Deal Debrief",
            "expected": "Human",
            "text": "The Phoenix deal closed yesterday. $2.1M ARR, three-year contract, no termination clause until month 18. They pushed hard on pricing — we ended up at a 12% discount off list — but they committed to a case study and agreed to speak at our user conference. Net-net: good deal, not great. Cisco is circling, so we need to nail the implementation. I'm assigning Patel as the technical account manager. He's our best closer on healthcare accounts."
        },
        {
            "id": "hum_biz8",
            "label": "Postmortem · Incident Report",
            "expected": "Human",
            "text": "On May 12 at 14:37 UTC, users began experiencing elevated error rates on the checkout service. Root cause: a configuration change deployed at 14:30 inadvertently routed 30% of traffic to a staging database that contained stale inventory data. Impact: approximately 12,000 users were unable to complete purchases over a 47-minute window. The change was rolled back at 15:17. Preventative measures: we've added a pre-deploy validation step that verifies database connection strings against production environment variables."
        },
        {
            "id": "hum_biz9",
            "label": "Investment · Thesis Memo",
            "expected": "Human",
            "text": "I'm going to make the case that we should pass on the Series B for ClearView AI. The technology is impressive, but here's my concern: they're selling to hospital systems, and hospital procurement cycles are 18+ months. They've burned through $8M of their $12M Series A in 14 months, and their current pipeline won't close before they run out of cash. I know the AI label is hot right now, but their unit economics don't work. We'd be throwing good money after bad."
        },
        {
            "id": "hum_biz10",
            "label": "Board · Update",
            "expected": "Human",
            "text": "The board meeting went about as expected. Audit committee signed off on the Q2 numbers without issues — Margie did a great job prepping them. The real fireworks were around the acquisition proposal. Tom wants to move faster; I want another quarter of due diligence. We compromised on a 45-day exclusivity window with a break fee. Not ideal, but it keeps both sides at the table. Elizabeth is drafting the term sheet now."
        },
        {
            "id": "hum_biz11",
            "label": "Job · Posting",
            "expected": "Human",
            "text": "We're hiring a senior frontend developer to join our core product team. You'll be building React components, improving our design system, and occasionally yelling at Webpack configs. We're looking for someone with 4+ years of experience who cares about accessibility and actually documents their code. Remote-friendly (US time zones only, sorry). Competitive salary, good benefits, and a team that genuinely likes working together."
        },
        {
            "id": "hum_biz12",
            "label": "Guide · CLI Setup",
            "expected": "Human",
            "text": "To get started, install the CLI tool: npm install -g @company/cli. Initialize your project with cli init. This creates a .company/config.yml file. Open it and add your API key. If you see an authentication error, check that your key has the correct IAM permissions — most issues are key-related. Use cli deploy --env staging first. Never deploy to production on a Friday."
        },
        {
            "id": "hum_news1",
            "label": "News · Financial",
            "expected": "Human",
            "text": "Tesla shares fell 8% in after-hours trading following the company's quarterly earnings report, which showed declining margins despite record vehicle deliveries. CEO Elon Musk attributed the squeeze to 'aggressive pricing strategies' aimed at maintaining market share. Analysts at Morgan Stanley described the results as 'concerning but not catastrophic,' noting that the energy storage division showed promising growth."
        },
        {
            "id": "hum_news2",
            "label": "News · Sports",
            "expected": "Human",
            "text": "The Lakers overcame a 15-point deficit in the fourth quarter to stun the Celtics 112-108 in Game 3 of the NBA Finals. LeBron James scored 18 of his 34 points in the final period, including a go-ahead three-pointer with 47 seconds remaining. 'I just trusted my training,' James said afterward. 'At this stage, it's all about execution.' The series now stands at 2-1, with Game 4 Thursday in Boston."
        },
        {
            "id": "hum_news3",
            "label": "News · Local Politics",
            "expected": "Human",
            "text": "The mayor announced the new policy at a press conference that lasted just 12 minutes — which should tell you something about how much detail was actually shared. When reporters asked about the budget impact, she deferred to the CFO, who wasn't present. When they asked about the timeline, she said 'soon.' The plan itself is ambitious on paper. Whether it works in practice remains to be seen."
        },
        {
            "id": "hum_news4",
            "label": "News · Investigative",
            "expected": "Human",
            "text": "Internal documents obtained by the Times reveal that the chemical company knew about groundwater contamination at its West Virginia facility as early as 2003 — six years before it was disclosed to regulators. Former employees, speaking on condition of anonymity, described a corporate culture that prioritized production targets over safety protocols. 'Everyone knew,' said one engineer who worked at the plant from 2001 to 2008. 'Nobody wanted to be the one to say it.'"
        },
        {
            "id": "hum_news5",
            "label": "News · Feature",
            "expected": "Human",
            "text": "Maria Gonzalez has been making tamales in her East LA kitchen every Saturday for 42 years. She starts at 4 AM, even though her children keep telling her to sleep in. 'What would I do with extra sleep?' she says, laughing. Her tamales have fed three generations of the neighborhood — wedding receptions, quinceañeras, funerals. When the pandemic shut down her weekend stand at the farmers market, her customers started showing up at her front door. She never missed a Saturday."
        },
        {
            "id": "hum_other1",
            "label": "Poem · Human",
            "expected": "Human",
            "text": "The cat sleeps in a patch of afternoon sun. / The world keeps turning, wars and peace and all that. / I'm thinking about what she said last night. / Or maybe I'm thinking about nothing at all. / The cat stretches, resettles, goes on sleeping. / Some things are that simple."
        },
        {
            "id": "hum_other2",
            "label": "Creative · Old House",
            "expected": "Human",
            "text": "The first time I saw the old house, I knew there was something wrong with it. Not in a ghost story way. More like the way the windows seemed to stare at nothing in particular, and how the front door hung just slightly crooked in its frame. It wasn't haunted. It was just sad. I bought it anyway."
        },
        {
            "id": "hum_other3",
            "label": "Social · Reddit Post",
            "expected": "Human",
            "text": "honestly i don't know what i'm doing half the time. like yesterday i spent three hours trying to fix a bug and it turned out i just forgot a comma somewhere. my roommate walked in and was like 'dude you good?' and i just stared at the screen. anyway i think we should grab coffee later and maybe actually read the docs this time. or not. whatever."
        }
    ]
}