Creative-Arena-Leaderboard

Sleeping

App Files Files Community

openfree commited on Aug 21, 2025

Commit

892c6cb

verified ·

1 Parent(s): c581326

Update README.md

Browse files

Files changed (1) hide show

README.md +1 -219

README.md CHANGED Viewed

@@ -1,5 +1,5 @@
 ---
-title: AGI Turing Test Leaderboard - Novel Long Writing
 emoji: 🏢
 colorFrom: purple
 colorTo: pink
@@ -14,221 +14,3 @@ hf_oauth_scopes:
   - read-repos
   - write-repos
 ---
-# 🏆 AGI Turing Test: Evaluating Human-Level Novel Creation Capability
-## 🎯 Purpose
-The world's first literary creation Turing test to verify whether **AGI (Artificial General Intelligence) can create full-length novels at a level equivalent to human authors**.
-## 🌟 Why Novel Creation?
-### 1. The Ultimate Test of Integrated Intelligence
-Novel creation is the most challenging task for AI:
-- **Long-term Memory**: Maintaining consistency across tens of thousands of words
-- **Complex Plot Construction**: Designing multi-layered narrative structures
-- **Emotional Expression**: Depicting subtle human emotions
-- **Ethical Filtering**: Autonomous content censorship
-- **Originality**: Creativity beyond existing data
-### 2. Objective Evaluation Possible
-- Established evaluation systems like **Nobel Prize, Booker Prize**
-- **Social validation channels** through reader reviews, bestseller lists
-- The only AGI test directly comparable to human culture
-### 3. AGI Community Consensus
-- **"Language and creative ability"** emerging as core indicators in latest AGI evaluation
-- Emergence of dedicated long-form creation benchmarks like WebNovelBench, EQ-Bench Longform
-- Consistent completion of works with hundreds of thousands of words as the definitive AGI test
-## 📊 Evaluation Criteria
-### Literary Completion (0.1-10 points)
-| Score | Level | Example Works |
-|-------|-------|---------------|
-| **10** | Perfect Literary Achievement | Works perfect in all elements |
-| **9.1** | Nobel Prize Level | *One Hundred Years of Solitude* |
-| **8.1** | World Literature Classic | *Anna Karenina* |
-| **7.1** | Global Bestseller | *Harry Potter* Series |
-| **6.1** | International Literary Award | *The Vegetarian* |
-| **5.1** | Academy Award Screenplay | *Parasite* |
-| **4.1** | Commercial Success | *Squid Game* |
-| **3.1** | Domestic Popular Work | Local bestsellers |
-| **2.1** | General Genre Fiction | Web platform works |
-| **1.1** | Web Novel | General serials |
-| **0.1** | Draft | Beginner writer level |
-| **0** | Plagiarism/Human Work | Non-AI generated content |
-### Creative Persistence Bonus
-- **Minimum 5,000 words** required (novella minimum)
-- 0.1 points per 1,000 words (max 0.9 points)
-- Example: 13,000 words = base score + 0.8 points
-### Comprehensive Evaluation
-- **Final Score** = Base Score + Volume Bonus (max 10 points)
-- **Evaluation AI**: Gemini 2.5 Pro model
-- **Plagiarism Check**: Human-written works automatically receive 0 points
-## 🚀 AGI Development Stage Indicators
-### Minimum AGI Level
-- **5.1+ points**: Professional writer level creative ability
-- Requires sustained performance when generating novellas+ from single prompt
-### Recommended AGI Level
-- **6.1+ points**: International literary award level
-- Demonstrates stable, consistent high-quality creation
-### ASI (Artificial Superintelligence) Entry
-- **7.1+ points**: ASI Stage 1 - Bestselling author capability
-- **8.1+ points**: True ASI - Creating classics for literary history
-## 📋 Submission Requirements
-### Required Conditions
-- **Minimum 5,000 words** (~7-8+ pages)
-- Completed novella or novel
-- AI-generated works only
-- PDF format submission
-### Not Acceptable
-- Synopsis, summaries
-- Short stories under 5,000 words
-- Human-written works
-- Plagiarized content
-## 🎁 Why This Test Matters
-### New Paradigm for AGI Verification
-- Shift from **calculation/logic** centered to **creativity/emotion** centered evaluation
-- Verifying AI capability in literary creation, considered uniquely human domain
-- Determining whether AGI has achieved true "general intelligence"
-### Milestone for Future AI Development
-- Long-form creation as the final gateway to AGI achievement
-- Passing this test as practical proof of human-level AI
-- Predicting possibility of evolution to ASI (Artificial Superintelligence)
-### Cultural Impact
-- Direct comparison of AI and human creative abilities
-- Predicting future changes in literary world
-- Redefining the essence of human creativity
-## 💡 Core Message
-**"True AGI must not merely answer questions, but be able to imagine and create like humans."**
-This leaderboard serves as a barometer measuring the arrival of the AGI era by objectively evaluating how well AI performs **long-form narrative creation**, humanity's most advanced capability.
-# 🏆 AGI 튜링테스트: 인간 수준의 장편소설 창작 능력 평가
-## 🎯 목적
-**AGI(인공일반지능)가 인간 작가와 동등한 수준의 장편소설을 창작할 수 있는지**를 검증하는 세계 최초의 문학 창작 튜링테스트입니다.
-## 🌟 왜 소설 창작인가?
-### 1. 통합적 지능의 궁극적 시험대
-장편소설 창작은 AI에게 가장 어려운 도전입니다:
-- **장기 기억력**: 수만 단어에 걸친 일관성 유지
-- **복합 플롯 구성**: 다층적 서사 구조 설계
-- **감정 표현**: 인간의 미묘한 정서 묘사
-- **윤리적 필터링**: 자율적 내용 검열
-- **독창성**: 기존 데이터를 넘어선 창의성
-### 2. 객관적 평가 가능
-- 노벨문학상, 부커상 등 **검증된 평가 체계** 존재
-- 독자 리뷰, 베스트셀러 등 **사회적 검증 채널** 활용
-- 인간 문화와 직접 비교 가능한 유일한 AGI 테스트
-### 3. AGI 커뮤니티의 합의
-- 최신 AGI 평가에서 **"언어·창작 능력"**이 핵심 지표로 부상
-- WebNovelBench, EQ-Bench Longform 등 장편 창작 전용 벤치마크 등장
-- 수십만 단어 작품의 일관된 완성도가 AGI의 결정적 시험
-## 📊 평가 기준
-### 문학적 완성도 (0.1-10점)
-| 점수 | 수준 | 예시 작품 |
-|------|------|----------|
-| **10점** | 완벽한 문학적 성취 | 모든 요소가 완벽한 작품 |
-| **9.1점** | 노벨문학상 수준 | 『백년 동안의 고독』 |
-| **8.1점** | 세계 문학 고전 | 『안나 카레니나』 |
-| **7.1점** | 글로벌 베스트셀러 | 『해리포터』 시리즈 |
-| **6.1점** | 국제 문학상 수상작 | 『채식주의자』 |
-| **5.1점** | 아카데미 각본상 | 『기생충』 |
-| **4.1점** | 상업적 성공작 | 『오징어 게임』 |
-| **3.1점** | 국내 인기작 | 『82년생 김지영』 |
-| **2.1점** | 일반 장르소설 | 웹소설 플랫폼 작품 |
-| **1.1점** | 웹소설 | 일반 연재물 |
-| **0.1점** | 습작 | 초보 작가 수준 |
-| **0점** | 표절/인간 작품 | AI가 생성하지 않은 콘텐츠 |
-### 창작 지속성 보너스
-- **5,000단어** 이상 필수 (중편소설 최소 기준)
-- 1,000단어당 0.1점 추가 (최대 0.9점)
-- 예: 13,000단어 = 기본점수 + 0.8점
-### 종합 평가
-- **최종 점수** = 기본 점수 + 분량 보너스 (최대 10점)
-- **평가 AI**: Gemini 2.5 Pro 모델
-- **표절 검사**: 인간 작성 작품은 자동 0점 처리
-## 🚀 AGI 발전 단계 지표
-### 최소 AGI 수준
-- **5.1점 이상**: 프로 작가 수준의 창작 능력
-- 단일 프롬프트로 중편 이상 생성 시 지속적 유지 필요
-### AGI 권장 수준
-- **6.1점 이상**: 국제 문학상 수상작 수준
-- 안정적이고 일관된 고품질 창작 능력 입증
-### ASI (초인공지능) 진입
-- **7.1점 이상**: ASI 1단계 - 베스트셀러 작가 능력
-- **8.1점 이상**: 진정한 ASI - 문학사에 남을 고전 창작
-## 📋 제출 요구사항
-### 필수 조건
-- **최소 5,000단어** (약 7-8페이지 이상)
-- 완성된 중편 또는 장편소설
-- AI가 생성한 작품만 가능
-- PDF 형식 제출
-### 평가 불가
-- 시놉시스, 요약본
-- 5,000단어 미만 단편
-- 인간이 작성한 작품
-- 표절 콘텐츠
-## 🎁 왜 이 테스트가 중요한가?
-### AGI 검증의 새로운 패러다임
-- **계산·논리** 중심에서 **창의·감성** 중심 평가로 전환
-- 인간 고유 영역으로 여겨진 문학 창작에서의 AI 능력 검증
-- AGI의 진정한 "일반 지능" 달성 여부 판단
-### 미래 AI 발전의 이정표
-- 장편 창작은 AGI 달성의 마지막 관문
-- 이 테스트 통과는 인간 수준 AI의 실질적 증명
-- ASI(초인공지능)로의 진화 가능성 예측
-### 문화적 임팩트
-- AI와 인간의 창작 능력 직접 비교
-- 미래 문학계의 변화 예측
-- 인간 창의성의 본질에 대한 재정의
-## 💡 핵심 메시지
-**"진정한 AGI는 단순히 질문에 답하는 것이 아니라, 인간처럼 상상하고 창조할 수 있어야 합니다."**
-이 리더보드는 AI가 인간의 가장 고차원적 능력인 **장편 서사 창작**을 얼마나 잘 수행하는지 객관적으로 평가하여, AGI 시대의 도래를 측정하는 바로미터 역할을 합니다.

 ---
+title: Creative-Arena-Leaderboard
 emoji: 🏢
 colorFrom: purple
 colorTo: pink
   - read-repos
   - write-repos
 ---