KoreanPartyCommunication / QUICKSTART.md
hanjunlee's picture
Upload 23 files
3a36548 verified
# ๋น ๋ฅธ ์‹œ์ž‘ ๊ฐ€์ด๋“œ
## 5๋ถ„ ์•ˆ์— ์‹œ์ž‘ํ•˜๊ธฐ
### 1๋‹จ๊ณ„: ์„ค์น˜ (1๋ถ„)
```bash
setup.bat
```
### 2๋‹จ๊ณ„: ํ—ˆ๊น…ํŽ˜์ด์Šค ํ† ํฐ ์„ค์ • (2๋ถ„)
1. https://huggingface.co/settings/tokens ์ ‘์†
2. "New token" โ†’ ์ด๋ฆ„: `party-crawler` โ†’ ๊ถŒํ•œ: **Write** โ†’ ์ƒ์„ฑ ํ›„ ๋ณต์‚ฌ
3. `.env` ํŒŒ์ผ์„ ๋ฉ”๋ชจ์žฅ์œผ๋กœ ์—ด๊ณ  ์ž…๋ ฅ:
```
HF_TOKEN=์—ฌ๊ธฐ์—_๋ณต์‚ฌํ•œ_ํ† ํฐ_๋ถ™์—ฌ๋„ฃ๊ธฐ
HF_REPO_ID=your_username/minjoo-press-releases
HF_REPO_ID_PPP=your_username/ppp-press-releases
HF_REPO_ID_REBUILDING=your_username/rebuilding-press-releases
HF_REPO_ID_REFORM=your_username/reform-press-releases
HF_REPO_ID_BASIC_INCOME=your_username/basic-income-press-releases
HF_REPO_ID_JINBO=your_username/jinbo-press-releases
```
> **์ค‘์š”**: `your_username`์„ ์‹ค์ œ ํ—ˆ๊น…ํŽ˜์ด์Šค ์‚ฌ์šฉ์ž๋ช…์œผ๋กœ ๋ณ€๊ฒฝํ•˜์„ธ์š”!
### 3๋‹จ๊ณ„: ์‹คํ–‰ (2๋ถ„)
#### ์ „์ฒด ์ •๋‹น ํ•œ ๋ฒˆ์— ์ˆ˜์ง‘ (์ถ”์ฒœ)
```bash
python main.py
```
#### ํŠน์ • ์ •๋‹น๋งŒ ์ˆ˜์ง‘
```bash
python main.py --party minjoo # ๋”๋ถˆ์–ด๋ฏผ์ฃผ๋‹น
python main.py --party ppp # ๊ตญ๋ฏผ์˜ํž˜
python main.py --party rebuilding # ์กฐ๊ตญํ˜์‹ ๋‹น
python main.py --party reform # ๊ฐœํ˜์‹ ๋‹น
python main.py --party basic_income # ๊ธฐ๋ณธ์†Œ๋“๋‹น
python main.py --party jinbo # ์ง„๋ณด๋‹น
```
#### ๋‚ ์งœ ๋ฒ”์œ„ ์ง€์ •
```bash
python main.py --start-date 2024-01-01
python main.py --party reform --start-date 2024-01-01 --end-date 2024-06-30
```
## ์™„๋ฃŒ!
๋ฐ์ดํ„ฐ ์ €์žฅ ์œ„์น˜:
- **๋กœ์ปฌ**: `./data/` ํด๋” (CSV, Excel)
- **ํ—ˆ๊น…ํŽ˜์ด์Šค**: ๊ฐ ์ •๋‹น๋ณ„ ์ €์žฅ์†Œ์— ์ž๋™ ์—…๋กœ๋“œ
## ์ „์ฒด ์˜ต์…˜ ์š”์•ฝ
| ๋ช…๋ น์–ด | ์„ค๋ช… |
|--------|------|
| `python main.py` | 6๊ฐœ ์ •๋‹น ์ „์ฒด ์ฆ๋ถ„ ์—…๋ฐ์ดํŠธ |
| `python main.py --party [์ฝ”๋“œ]` | ํŠน์ • ์ •๋‹น๋งŒ |
| `python main.py --start-date YYYY-MM-DD` | ์‹œ์ž‘ ๋‚ ์งœ ์ง€์ • |
| `python unified_scheduler.py` | ๋งค์ผ ์ž๋™ ์‹คํ–‰ (์Šค์ผ€์ค„๋Ÿฌ) |
## ์ •๋‹น ์ฝ”๋“œ ๋ชฉ๋ก
| ์ฝ”๋“œ | ์ •๋‹น |
|------|------|
| `minjoo` | ๋”๋ถˆ์–ด๋ฏผ์ฃผ๋‹น |
| `ppp` | ๊ตญ๋ฏผ์˜ํž˜ |
| `rebuilding` | ์กฐ๊ตญํ˜์‹ ๋‹น |
| `reform` | ๊ฐœํ˜์‹ ๋‹น |
| `basic_income` | ๊ธฐ๋ณธ์†Œ๋“๋‹น |
| `jinbo` | ์ง„๋ณด๋‹น |
| `all` | ์ „์ฒด (๊ธฐ๋ณธ๊ฐ’) |
## ๋ฌธ์ œ ํ•ด๊ฒฐ
| ๋ฌธ์ œ | ํ•ด๊ฒฐ |
|------|------|
| "HF_TOKEN์ด ์„ค์ •๋˜์ง€ ์•Š์•˜์Šต๋‹ˆ๋‹ค" | `.env` ํŒŒ์ผ์˜ `HF_TOKEN` ํ™•์ธ |
| "Module not found" | `setup.bat` ๋‹ค์‹œ ์‹คํ–‰ |
| ํฌ๋กค๋ง์ด ๋А๋ ค์š” | `crawler_config.json`์—์„œ `concurrent_requests`๋ฅผ 30์œผ๋กœ ์ฆ๊ฐ€ |
| ํŠน์ • ์ •๋‹น๋งŒ ์‹คํŒจ | `python main.py --party [์ฝ”๋“œ]`๋กœ ๊ฐœ๋ณ„ ์‹คํ–‰ํ•˜์—ฌ ํ™•์ธ |
## ๋„์›€๋ง
```bash
python main.py --help
```
์ „์ฒด ๋ฌธ์„œ: `README.md`