RanenSim commited on
Commit
01f9953
·
1 Parent(s): 6d47469

feat: rename model

Browse files
.gitattributes CHANGED
@@ -11,6 +11,7 @@
11
  *.mlmodel filter=lfs diff=lfs merge=lfs -text
12
  *.model filter=lfs diff=lfs merge=lfs -text
13
  *.msgpack filter=lfs diff=lfs merge=lfs -text
 
14
  *.npy filter=lfs diff=lfs merge=lfs -text
15
  *.npz filter=lfs diff=lfs merge=lfs -text
16
  *.onnx filter=lfs diff=lfs merge=lfs -text
 
11
  *.mlmodel filter=lfs diff=lfs merge=lfs -text
12
  *.model filter=lfs diff=lfs merge=lfs -text
13
  *.msgpack filter=lfs diff=lfs merge=lfs -text
14
+ *.nemo filter=lfs diff=lfs merge=lfs -text
15
  *.npy filter=lfs diff=lfs merge=lfs -text
16
  *.npz filter=lfs diff=lfs merge=lfs -text
17
  *.onnx filter=lfs diff=lfs merge=lfs -text
.gitignore ADDED
@@ -0,0 +1 @@
 
 
1
+ ASR/whisper/training_metrics.png
ASR/parakeet/README.md ADDED
@@ -0,0 +1,115 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ license: other
5
+ tags:
6
+ - nemo
7
+ - parakeet
8
+ - tdt
9
+ - automatic-speech-recognition
10
+ - air-traffic-control
11
+ - atc
12
+ - singapore
13
+ - military
14
+ base_model: nvidia/parakeet-tdt-0.6b-v2
15
+ pipeline_tag: automatic-speech-recognition
16
+ metrics:
17
+ - wer
18
+ model-index:
19
+ - name: parakeet-tdt-0.6b-v2-atc-singapore
20
+ results:
21
+ - task:
22
+ type: automatic-speech-recognition
23
+ metrics:
24
+ - name: Validation WER
25
+ type: wer
26
+ value: 1.14
27
+ ---
28
+
29
+ # Parakeet-TDT 0.6B v2 - Singapore Military ATC
30
+
31
+ Fine-tuned NVIDIA Parakeet-TDT 0.6B v2 for Singapore Air Force air traffic control speech recognition.
32
+
33
+ ## Performance
34
+
35
+ | Checkpoint | Validation WER | Notes |
36
+ |------------|----------------|-------|
37
+ | `model.ckpt` | **1.14%** | Best checkpoint |
38
+ | `epoch=54-val_wer=0.0245-last.ckpt` | 2.45% | Final checkpoint, not published here |
39
+
40
+ ## Model Details
41
+
42
+ | Key | Value |
43
+ |-----|-------|
44
+ | Base model | `nvidia/parakeet-tdt-0.6b-v2` |
45
+ | Framework | NeMo |
46
+ | Model class | `EncDecRNNTBPEModel` |
47
+ | Format | Raw `.ckpt` checkpoint + tokenizer artifacts |
48
+ | Checkpoint size | 7.0 GB |
49
+ | Domain | Singapore military ATC (Tengah WSAT, Paya Lebar WSAP) |
50
+
51
+ ## Included Files
52
+
53
+ - `model.ckpt` - best fine-tuned checkpoint
54
+ - `artifacts/705f11d22dc04b169effc35ce5cd1361_tokenizer.model`
55
+ - `artifacts/a4715c7f6b2d4c2bb709306073d0c0a4_tokenizer.vocab`
56
+ - `artifacts/4cf78c8ca4ca44fca36c3754478fb188_vocab.txt`
57
+
58
+ ## Usage
59
+
60
+ This repo currently publishes the fine-tuned model as a raw NeMo checkpoint rather than a packaged `.nemo` archive. The tokenizer artifact paths therefore need to be pointed at the local `artifacts/` folder before restore.
61
+
62
+ ```python
63
+ from pathlib import Path
64
+
65
+ import torch
66
+ from omegaconf import OmegaConf
67
+ from nemo.collections.asr.models import EncDecRNNTBPEModel
68
+
69
+ model_dir = Path("ASR/parakeet")
70
+ ckpt_path = model_dir / "model.ckpt"
71
+ artifacts_dir = model_dir / "artifacts"
72
+
73
+ bundle = torch.load(ckpt_path, map_location="cpu", weights_only=False)
74
+ cfg = bundle["hyper_parameters"]["cfg"]
75
+ cfg = OmegaConf.create(OmegaConf.to_container(cfg, resolve=False))
76
+ cfg.tokenizer.model_path = str(
77
+ artifacts_dir / "705f11d22dc04b169effc35ce5cd1361_tokenizer.model"
78
+ )
79
+ cfg.tokenizer.vocab_path = str(
80
+ artifacts_dir / "4cf78c8ca4ca44fca36c3754478fb188_vocab.txt"
81
+ )
82
+ cfg.tokenizer.spe_tokenizer_vocab = str(
83
+ artifacts_dir / "a4715c7f6b2d4c2bb709306073d0c0a4_tokenizer.vocab"
84
+ )
85
+
86
+ model = EncDecRNNTBPEModel.load_from_checkpoint(str(ckpt_path), cfg=cfg)
87
+ model.eval().cuda()
88
+
89
+ hypotheses = model.transcribe(
90
+ ["audio.wav"],
91
+ return_hypotheses=True,
92
+ timestamps=True,
93
+ )
94
+
95
+ if isinstance(hypotheses, tuple):
96
+ hypotheses = hypotheses[0]
97
+
98
+ hyp = hypotheses[0]
99
+ print(hyp.text)
100
+ print(hyp.timestamp["word"])
101
+ ```
102
+
103
+ Tested in ASTRA with `nemo_toolkit[asr]==2.7.3`.
104
+
105
+ ## Output Format
106
+
107
+ The model outputs normalized spoken text intended for downstream ATC formatting:
108
+
109
+ | Input audio says | Model outputs |
110
+ |-----------------|---------------|
111
+ | "CAMEL climb flight level zero nine zero" | `camel climb flight level zero nine zero` |
112
+ | "Contact Tengah Approach one three zero decimal zero" | `contact tengah approach one three zero decimal zero` |
113
+ | "Squawk four five two one" | `squawk four five two one` |
114
+
115
+ ASTRA then applies a deterministic formatter to convert normalized speech into display text such as `CAMEL climb FL090`.
ASR/parakeet/artifacts/4cf78c8ca4ca44fca36c3754478fb188_vocab.txt ADDED
@@ -0,0 +1,1023 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ t
2
+ th
3
+ a
4
+ ##in
5
+ the
6
+ ##re
7
+ w
8
+ o
9
+ s
10
+ ##at
11
+ ##ou
12
+ ##er
13
+ ##nd
14
+ i
15
+ b
16
+ c
17
+ ##on
18
+ h
19
+ ##ing
20
+ to
21
+ m
22
+ ##en
23
+ f
24
+ p
25
+ ##an
26
+ d
27
+ ##es
28
+ ##or
29
+ ##ll
30
+ of
31
+ and
32
+ y
33
+ l
34
+ I
35
+ ##it
36
+ in
37
+ ##is
38
+ ##ed
39
+ g
40
+ you
41
+ ##ar
42
+ that
43
+ ##om
44
+ ##as
45
+ n
46
+ ##ve
47
+ ##us
48
+ ##ic
49
+ ##ow
50
+ ##al
51
+ it
52
+ be
53
+ wh
54
+ ##le
55
+ ##ion
56
+ ##ut
57
+ ##ot
58
+ we
59
+ is
60
+ e
61
+ ##et
62
+ ##ay
63
+ re
64
+ on
65
+ T
66
+ A
67
+ ha
68
+ ##ent
69
+ ##ke
70
+ ##ct
71
+ S
72
+ ##ig
73
+ ##ver
74
+ Th
75
+ ##all
76
+ ##id
77
+ for
78
+ ##ro
79
+ he
80
+ ##se
81
+ this
82
+ ##ld
83
+ ##ly
84
+ go
85
+ k
86
+ st
87
+ ##st
88
+ ##ch
89
+ li
90
+ u
91
+ ##am
92
+ ##ur
93
+ ##ce
94
+ ##ith
95
+ ##im
96
+ so
97
+ have
98
+ do
99
+ ##ht
100
+ ##th
101
+ an
102
+ with
103
+ ##ad
104
+ r
105
+ ##ir
106
+ was
107
+ as
108
+ W
109
+ are
110
+ ##ust
111
+ ##ally
112
+ j
113
+ se
114
+ ##ation
115
+ ##od
116
+ ##ere
117
+ like
118
+ not
119
+ kn
120
+ ##ight
121
+ B
122
+ they
123
+ And
124
+ know
125
+ ##ome
126
+ ##op
127
+ can
128
+ or
129
+ sh
130
+ me
131
+ ##ill
132
+ ##ant
133
+ ##ck
134
+ what
135
+ at
136
+ ab
137
+ ##ould
138
+ ##ol
139
+ So
140
+ C
141
+ ##use
142
+ ##ter
143
+ ##il
144
+ but
145
+ just
146
+ ne
147
+ de
148
+ ##ra
149
+ ##ore
150
+ there
151
+ ##ul
152
+ ##out
153
+ con
154
+ all
155
+ The
156
+ ##ers
157
+ H
158
+ fr
159
+ pro
160
+ ##ge
161
+ ##ea
162
+ Y
163
+ O
164
+ M
165
+ ##pp
166
+ com
167
+ ##ess
168
+ ch
169
+ al
170
+ ##est
171
+ ##ate
172
+ ##qu
173
+ lo
174
+ ex
175
+ ##very
176
+ su
177
+ ##ain
178
+ one
179
+ ##ca
180
+ ##art
181
+ ##ist
182
+ ##if
183
+ ##ive
184
+ if
185
+ ##ink
186
+ ##nt
187
+ ##ab
188
+ about
189
+ going
190
+ v
191
+ wor
192
+ ##um
193
+ ##ok
194
+ your
195
+ my
196
+ ##ind
197
+ get
198
+ ##cause
199
+ from
200
+ don
201
+ ##ri
202
+ ##pe
203
+ ##un
204
+ ##ity
205
+ up
206
+ P
207
+ out
208
+ ##ort
209
+ L
210
+ ##ment
211
+ ##el
212
+ N
213
+ some
214
+ ##ich
215
+ ##and
216
+ think
217
+ ##em
218
+ ##oug
219
+ G
220
+ ##os
221
+ D
222
+ ##res
223
+ because
224
+ by
225
+ ##ake
226
+ int
227
+ ##ie
228
+ us
229
+ tr
230
+ then
231
+ ##ack
232
+ pl
233
+ here
234
+ pe
235
+ ##her
236
+ will
237
+ F
238
+ which
239
+ ##ard
240
+ right
241
+ thing
242
+ want
243
+ ##ies
244
+ ##ople
245
+ It
246
+ them
247
+ ##ame
248
+ We
249
+ ##our
250
+ say
251
+ R
252
+ people
253
+ see
254
+ who
255
+ ##ast
256
+ ##ure
257
+ ##ect
258
+ ##ear
259
+ tim
260
+ E
261
+ You
262
+ would
263
+ when
264
+ ##ven
265
+ our
266
+ ##ci
267
+ really
268
+ more
269
+ ##ound
270
+ ##ose
271
+ ##ak
272
+ co
273
+ ##ide
274
+ ##ough
275
+ had
276
+ ##so
277
+ qu
278
+ ##eah
279
+ were
280
+ ##ine
281
+ act
282
+ ##ther
283
+ these
284
+ how
285
+ now
286
+ sa
287
+ ##ud
288
+ Wh
289
+ man
290
+ ##ous
291
+ ##one
292
+ ##pt
293
+ ##ff
294
+ ##ong
295
+ has
296
+ any
297
+ very
298
+ But
299
+ look
300
+ ##iv
301
+ ##itt
302
+ time
303
+ mo
304
+ ar
305
+ ##hing
306
+ le
307
+ work
308
+ their
309
+ ##are
310
+ his
311
+ ##per
312
+ ##ions
313
+ im
314
+ ag
315
+ J
316
+ no
317
+ en
318
+ got
319
+ ##ag
320
+ sp
321
+ ##ans
322
+ ##act
323
+ te
324
+ also
325
+ ##iz
326
+ ##ice
327
+ That
328
+ cl
329
+ been
330
+ way
331
+ fe
332
+ did
333
+ ##ple
334
+ ##ually
335
+ other
336
+ U
337
+ ##ite
338
+ ##age
339
+ ##omet
340
+ ##ber
341
+ ##reat
342
+ ##ree
343
+ into
344
+ ##own
345
+ tw
346
+ part
347
+ ##alk
348
+ where
349
+ need
350
+ every
351
+ ##pl
352
+ ad
353
+ ##ry
354
+ over
355
+ ##ble
356
+ ##ap
357
+ ##ue
358
+ kind
359
+ po
360
+ back
361
+ cont
362
+ ##iff
363
+ somet
364
+ pr
365
+ ##nder
366
+ ##ire
367
+ good
368
+ than
369
+ ##ace
370
+ gu
371
+ ##ep
372
+ ##og
373
+ ##ick
374
+ ##way
375
+ lot
376
+ un
377
+ things
378
+ In
379
+ ##ish
380
+ ##kay
381
+ well
382
+ could
383
+ pre
384
+ two
385
+ ##irst
386
+ diff
387
+ ##ach
388
+ ##cc
389
+ ##ittle
390
+ ##int
391
+ He
392
+ those
393
+ ##ence
394
+ ##ip
395
+ ##ase
396
+ him
397
+ make
398
+ little
399
+ ##ical
400
+ gr
401
+ year
402
+ ##ass
403
+ thr
404
+ ##uch
405
+ ##ated
406
+ This
407
+ off
408
+ res
409
+ ##ac
410
+ ##ance
411
+ actually
412
+ talk
413
+ ##ult
414
+ ##able
415
+ ##orm
416
+ dis
417
+ first
418
+ ##ations
419
+ something
420
+ she
421
+ ##sel
422
+ let
423
+ ##ord
424
+ may
425
+ ##ia
426
+ am
427
+ her
428
+ said
429
+ bo
430
+ ##be
431
+ ##ount
432
+ much
433
+ per
434
+ even
435
+ differe
436
+ ##vel
437
+ ##ary
438
+ app
439
+ ##ving
440
+ comm
441
+ imp
442
+ ##ys
443
+ again
444
+ ##ress
445
+ yeah
446
+ down
447
+ ##ang
448
+ mean
449
+ ##na
450
+ ##ens
451
+ does
452
+ fo
453
+ comp
454
+ ro
455
+ bl
456
+ ##ody
457
+ K
458
+ through
459
+ start
460
+ ##uct
461
+ only
462
+ bet
463
+ under
464
+ br
465
+ take
466
+ ##ning
467
+ bu
468
+ use
469
+ Ch
470
+ ##xt
471
+ ##co
472
+ ##ory
473
+ ##ild
474
+ put
475
+ call
476
+ new
477
+ ##other
478
+ ##ting
479
+ happ
480
+ ##ater
481
+ inc
482
+ ##ition
483
+ different
484
+ should
485
+ ##ade
486
+ ##ign
487
+ ##thing
488
+ day
489
+ ##fore
490
+ Yeah
491
+ ##ark
492
+ ##ile
493
+ ##ial
494
+ come
495
+ They
496
+ being
497
+ try
498
+ ##ious
499
+ sc
500
+ bit
501
+ spe
502
+ ##ub
503
+ ##fe
504
+ doing
505
+ St
506
+ ##vers
507
+ ##av
508
+ ##ty
509
+ ##ian
510
+ ##onna
511
+ ##red
512
+ ##wn
513
+ ke
514
+ ##form
515
+ ##ors
516
+ fl
517
+ ##fter
518
+ ##ail
519
+ ##ents
520
+ gonna
521
+ point
522
+ ##ces
523
+ There
524
+ ##self
525
+ many
526
+ If
527
+ same
528
+ sy
529
+ quest
530
+ most
531
+ great
532
+ What
533
+ fu
534
+ ##ug
535
+ show
536
+ ##we
537
+ ##ual
538
+ ##ons
539
+ Be
540
+ ##ically
541
+ ser
542
+ rem
543
+ ind
544
+ pers
545
+ V
546
+ ##he
547
+ str
548
+ ##ved
549
+ still
550
+ ##ank
551
+ rec
552
+ wr
553
+ ##ought
554
+ ##day
555
+ ##ath
556
+ end
557
+ bas
558
+ ##ft
559
+ ##erm
560
+ ##body
561
+ ##ph
562
+ ##ject
563
+ ##ict
564
+ play
565
+ Is
566
+ ##ates
567
+ ph
568
+ ##oth
569
+ acc
570
+ ##get
571
+ years
572
+ em
573
+ id
574
+ Oh
575
+ ##ves
576
+ ##ever
577
+ inter
578
+ rel
579
+ before
580
+ feel
581
+ ##igh
582
+ three
583
+ ##iss
584
+ des
585
+ ##ne
586
+ why
587
+ uh
588
+ To
589
+ cons
590
+ hel
591
+ after
592
+ ##ower
593
+ ##urn
594
+ okay
595
+ long
596
+ bel
597
+ around
598
+ ##ful
599
+ ##te
600
+ ##ise
601
+ ob
602
+ supp
603
+ ##ady
604
+ ##ange
605
+ ##aking
606
+ pos
607
+ ##atch
608
+ tra
609
+ ##gr
610
+ might
611
+ ##ert
612
+ help
613
+ ##ost
614
+ too
615
+ ##cial
616
+ world
617
+ give
618
+ ##ike
619
+ Okay
620
+ ##ways
621
+ min
622
+ ##ward
623
+ ##ily
624
+ gen
625
+ find
626
+ dec
627
+ ##ular
628
+ ##ob
629
+ tell
630
+ Now
631
+ sm
632
+ cour
633
+ real
634
+ ##cess
635
+ ##nds
636
+ big
637
+ num
638
+ ##ction
639
+ add
640
+ set
641
+ um
642
+ ##ood
643
+ ##ible
644
+ own
645
+ life
646
+ ##ities
647
+ its
648
+ God
649
+ ##pect
650
+ didn
651
+ ##stem
652
+ ##les
653
+ ##uc
654
+ ##ib
655
+ ##ating
656
+ ##olog
657
+ person
658
+ inv
659
+ ##ably
660
+ sure
661
+ reg
662
+ ##lic
663
+ stu
664
+ cr
665
+ ev
666
+ ##ments
667
+ another
668
+ la
669
+ last
670
+ sub
671
+ att
672
+ op
673
+ inst
674
+ sl
675
+ happen
676
+ rep
677
+ import
678
+ ##ific
679
+ ##ix
680
+ made
681
+ ear
682
+ ac
683
+ def
684
+ ##ute
685
+ next
686
+ ##ative
687
+ form
688
+ guys
689
+ system
690
+ ##ew
691
+ able
692
+ ##ied
693
+ always
694
+ ##ren
695
+ ##erest
696
+ As
697
+ mod
698
+ done
699
+ ##ings
700
+ love
701
+ ##ism
702
+ ask
703
+ ##old
704
+ ##ered
705
+ trans
706
+ count
707
+ ##ility
708
+ high
709
+ fin
710
+ prob
711
+ pol
712
+ exam
713
+ pres
714
+ maybe
715
+ ##ell
716
+ stud
717
+ prod
718
+ car
719
+ ##ock
720
+ used
721
+ ##oy
722
+ ##stand
723
+ No
724
+ mon
725
+ ##ks
726
+ interest
727
+ ent
728
+ ##ited
729
+ sort
730
+ For
731
+ today
732
+ ##ics
733
+ vide
734
+ bec
735
+ Well
736
+ Al
737
+ important
738
+ such
739
+ run
740
+ keep
741
+ fact
742
+ ##ata
743
+ ##ss
744
+ never
745
+ ##ween
746
+ stuff
747
+ ##ract
748
+ question
749
+ ##als
750
+ sim
751
+ ##vern
752
+ ##ather
753
+ course
754
+ Of
755
+ ##oc
756
+ ##ness
757
+ ##arch
758
+ ##ize
759
+ All
760
+ ##ense
761
+ ##blem
762
+ probably
763
+ ##hip
764
+ number
765
+ ##ention
766
+ saying
767
+ commun
768
+ An
769
+ ##akes
770
+ belie
771
+ between
772
+ better
773
+ ##cus
774
+ place
775
+ gener
776
+ ca
777
+ ins
778
+ ass
779
+ ##cond
780
+ ##cept
781
+ ##ull
782
+ understand
783
+ fun
784
+ thought
785
+ ##gan
786
+ ##iew
787
+ ##cy
788
+ ##ution
789
+ ##ope
790
+ ##ason
791
+ problem
792
+ doesn
793
+ ##ational
794
+ read
795
+ trying
796
+ sch
797
+ el
798
+ ##ah
799
+ ##atter
800
+ exper
801
+ four
802
+ ele
803
+ cou
804
+ ##ont
805
+ called
806
+ partic
807
+ open
808
+ gl
809
+ everything
810
+ eff
811
+ getting
812
+ ty
813
+ Am
814
+ Because
815
+ ##ave
816
+ met
817
+ Like
818
+ ##oney
819
+
820
+ ##e
821
+ ##t
822
+ ##o
823
+ ##a
824
+ ##n
825
+ ##i
826
+ ##s
827
+ ##h
828
+ ##r
829
+ ##l
830
+ ##d
831
+ ##u
832
+ ##c
833
+ ##y
834
+ ##m
835
+ ##g
836
+ ##w
837
+ ##f
838
+ ##p
839
+ ##,
840
+ ##b
841
+ ##.
842
+ ##k
843
+ ##v
844
+ ##'
845
+ ##I
846
+ ##T
847
+ ##A
848
+ ##S
849
+ ##j
850
+ ##x
851
+ ##W
852
+ ##B
853
+ ##C
854
+ ##?
855
+ ##0
856
+ ##O
857
+ ##-
858
+ ##M
859
+ ##H
860
+ ##Y
861
+ ##q
862
+ ##1
863
+ ##P
864
+ ##z
865
+ ##L
866
+ ##D
867
+ ##N
868
+ ##G
869
+ ##F
870
+ ##R
871
+ ##E
872
+ ##2
873
+ ##J
874
+ ##U
875
+ ##:
876
+ ##5
877
+ ##9
878
+ ##3
879
+ ##K
880
+ ##4
881
+ ##V
882
+ ##8
883
+ ##6
884
+ ##7
885
+ ##!
886
+ ##%
887
+ ##Q
888
+ ##$
889
+ ##Z
890
+ ##X
891
+ ##é
892
+ ##/
893
+ ##í
894
+ ##á
895
+ ##£
896
+ ##ó
897
+ ##ā
898
+ ##ü
899
+ ##ñ
900
+ ##ö
901
+ ##è
902
+ ##ç
903
+ ##à
904
+ ##¿
905
+ ##μ
906
+ ##π
907
+ ##ä
908
+ ##ú
909
+ ##θ
910
+ ##ã
911
+ ##φ
912
+ ##ī
913
+ ##σ
914
+ ##ê
915
+ ##ρ
916
+ ##â
917
+ ##ô
918
+ ##^
919
+ ##€
920
+ ##É
921
+ ##ū
922
+ ##Δ
923
+ ##λ
924
+ ##α
925
+ ##τ
926
+ ##æ
927
+ ##а
928
+ ##о
929
+ ##ν
930
+ ##î
931
+ ##γ
932
+ ##ψ
933
+ ##ē
934
+ ##т
935
+ ##ß
936
+ ##ω
937
+ ##ï
938
+ ##ć
939
+ ##č
940
+ ##ε
941
+ ##е
942
+ ##и
943
+ ##ò
944
+ ##р
945
+ ##β
946
+ ##ø
947
+ ##ł
948
+ ##δ
949
+ ##η
950
+ ##п
951
+ ##ë
952
+ ##н
953
+ ##с
954
+ ##š
955
+ ##Ü
956
+ ##å
957
+ ##ń
958
+ ##ś
959
+ ##я
960
+ ##đ
961
+ ##л
962
+ ##м
963
+ ##Ö
964
+ ##û
965
+ ##ș
966
+ ##в
967
+ ##Á
968
+ ##Ø
969
+ ##ù
970
+ ##ο
971
+ ##ч
972
+ ##ь
973
+ ##ž
974
+ ##Φ
975
+ ##у
976
+ ##ę
977
+ ##ι
978
+ ##б
979
+ ##г
980
+ ##к
981
+ ##ő
982
+ ##Ś
983
+ ##Ω
984
+ ##κ
985
+ ##υ
986
+ ##ì
987
+ ##Č
988
+ ##έ
989
+ ##х
990
+ ##ы
991
+ ##Å
992
+ ##Ç
993
+ ##ż
994
+ ##ί
995
+ ##ζ
996
+ ##χ
997
+ ##э
998
+ ##Æ
999
+ ##Í
1000
+ ##õ
1001
+ ##ě
1002
+ ##ħ
1003
+ ##Ł
1004
+ ##œ
1005
+ ##Ž
1006
+ ##ț
1007
+ ##Γ
1008
+ ##П
1009
+ ##д
1010
+ ##з
1011
+ ##ф
1012
+ ##¡
1013
+ ##À
1014
+ ##Î
1015
+ ##Ā
1016
+ ##ė
1017
+ ##Š
1018
+ ##ź
1019
+ ##Κ
1020
+ ##Ψ
1021
+ ##ά
1022
+ ##ξ
1023
+ ##ό
ASR/parakeet/artifacts/705f11d22dc04b169effc35ce5cd1361_tokenizer.model ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5a3a82c48998709f9bc9f5db2924d99a899d20bf82550ca52272da7c7557b0a0
3
+ size 251396
ASR/parakeet/artifacts/a4715c7f6b2d4c2bb709306073d0c0a4_tokenizer.vocab ADDED
@@ -0,0 +1,1024 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ <unk> 0
2
+ ▁t -0
3
+ ▁th -1
4
+ ▁a -2
5
+ in -3
6
+ ▁the -4
7
+ re -5
8
+ ▁w -6
9
+ ▁o -7
10
+ ▁s -8
11
+ at -9
12
+ ou -10
13
+ er -11
14
+ nd -12
15
+ ▁i -13
16
+ ▁b -14
17
+ ▁c -15
18
+ on -16
19
+ ▁h -17
20
+ ing -18
21
+ ▁to -19
22
+ ▁m -20
23
+ en -21
24
+ ▁f -22
25
+ ▁p -23
26
+ an -24
27
+ ▁d -25
28
+ es -26
29
+ or -27
30
+ ll -28
31
+ ▁of -29
32
+ ▁and -30
33
+ ▁y -31
34
+ ▁l -32
35
+ ▁I -33
36
+ it -34
37
+ ▁in -35
38
+ is -36
39
+ ed -37
40
+ ▁g -38
41
+ ▁you -39
42
+ ar -40
43
+ ▁that -41
44
+ om -42
45
+ as -43
46
+ ▁n -44
47
+ ve -45
48
+ us -46
49
+ ic -47
50
+ ow -48
51
+ al -49
52
+ ▁it -50
53
+ ▁be -51
54
+ ▁wh -52
55
+ le -53
56
+ ion -54
57
+ ut -55
58
+ ot -56
59
+ ▁we -57
60
+ ▁is -58
61
+ ▁e -59
62
+ et -60
63
+ ay -61
64
+ ▁re -62
65
+ ▁on -63
66
+ ▁T -64
67
+ ▁A -65
68
+ ▁ha -66
69
+ ent -67
70
+ ke -68
71
+ ct -69
72
+ ▁S -70
73
+ ig -71
74
+ ver -72
75
+ ▁Th -73
76
+ all -74
77
+ id -75
78
+ ▁for -76
79
+ ro -77
80
+ ▁he -78
81
+ se -79
82
+ ▁this -80
83
+ ld -81
84
+ ly -82
85
+ ▁go -83
86
+ ▁k -84
87
+ ▁st -85
88
+ st -86
89
+ ch -87
90
+ ▁li -88
91
+ ▁u -89
92
+ am -90
93
+ ur -91
94
+ ce -92
95
+ ith -93
96
+ im -94
97
+ ▁so -95
98
+ ▁have -96
99
+ ▁do -97
100
+ ht -98
101
+ th -99
102
+ ▁an -100
103
+ ▁with -101
104
+ ad -102
105
+ ▁r -103
106
+ ir -104
107
+ ▁was -105
108
+ ▁as -106
109
+ ▁W -107
110
+ ▁are -108
111
+ ust -109
112
+ ally -110
113
+ ▁j -111
114
+ ▁se -112
115
+ ation -113
116
+ od -114
117
+ ere -115
118
+ ▁like -116
119
+ ▁not -117
120
+ ▁kn -118
121
+ ight -119
122
+ ▁B -120
123
+ ▁they -121
124
+ ▁And -122
125
+ ▁know -123
126
+ ome -124
127
+ op -125
128
+ ▁can -126
129
+ ▁or -127
130
+ ▁sh -128
131
+ ▁me -129
132
+ ill -130
133
+ ant -131
134
+ ck -132
135
+ ▁what -133
136
+ ▁at -134
137
+ ▁ab -135
138
+ ould -136
139
+ ol -137
140
+ ▁So -138
141
+ ▁C -139
142
+ use -140
143
+ ter -141
144
+ il -142
145
+ ▁but -143
146
+ ▁just -144
147
+ ▁ne -145
148
+ ▁de -146
149
+ ra -147
150
+ ore -148
151
+ ▁there -149
152
+ ul -150
153
+ out -151
154
+ ▁con -152
155
+ ▁all -153
156
+ ▁The -154
157
+ ers -155
158
+ ▁H -156
159
+ ▁fr -157
160
+ ▁pro -158
161
+ ge -159
162
+ ea -160
163
+ ▁Y -161
164
+ ▁O -162
165
+ ▁M -163
166
+ pp -164
167
+ ▁com -165
168
+ ess -166
169
+ ▁ch -167
170
+ ▁al -168
171
+ est -169
172
+ ate -170
173
+ qu -171
174
+ ▁lo -172
175
+ ▁ex -173
176
+ very -174
177
+ ▁su -175
178
+ ain -176
179
+ ▁one -177
180
+ ca -178
181
+ art -179
182
+ ist -180
183
+ if -181
184
+ ive -182
185
+ ▁if -183
186
+ ink -184
187
+ nt -185
188
+ ab -186
189
+ ▁about -187
190
+ ▁going -188
191
+ ▁v -189
192
+ ▁wor -190
193
+ um -191
194
+ ok -192
195
+ ▁your -193
196
+ ▁my -194
197
+ ind -195
198
+ ▁get -196
199
+ cause -197
200
+ ▁from -198
201
+ ▁don -199
202
+ ri -200
203
+ pe -201
204
+ un -202
205
+ ity -203
206
+ ▁up -204
207
+ ▁P -205
208
+ ▁out -206
209
+ ort -207
210
+ ▁L -208
211
+ ment -209
212
+ el -210
213
+ ▁N -211
214
+ ▁some -212
215
+ ich -213
216
+ and -214
217
+ ▁think -215
218
+ em -216
219
+ oug -217
220
+ ▁G -218
221
+ os -219
222
+ ▁D -220
223
+ res -221
224
+ ▁because -222
225
+ ▁by -223
226
+ ake -224
227
+ ▁int -225
228
+ ie -226
229
+ ▁us -227
230
+ ▁tr -228
231
+ ▁then -229
232
+ ack -230
233
+ ▁pl -231
234
+ ▁here -232
235
+ ▁pe -233
236
+ her -234
237
+ ▁will -235
238
+ ▁F -236
239
+ ▁which -237
240
+ ard -238
241
+ ▁right -239
242
+ ▁thing -240
243
+ ▁want -241
244
+ ies -242
245
+ ople -243
246
+ ▁It -244
247
+ ▁them -245
248
+ ame -246
249
+ ▁We -247
250
+ our -248
251
+ ▁say -249
252
+ ▁R -250
253
+ ▁people -251
254
+ ▁see -252
255
+ ▁who -253
256
+ ast -254
257
+ ure -255
258
+ ect -256
259
+ ear -257
260
+ ▁tim -258
261
+ ▁E -259
262
+ ▁You -260
263
+ ▁would -261
264
+ ▁when -262
265
+ ven -263
266
+ ▁our -264
267
+ ci -265
268
+ ▁really -266
269
+ ▁more -267
270
+ ound -268
271
+ ose -269
272
+ ak -270
273
+ ▁co -271
274
+ ide -272
275
+ ough -273
276
+ ▁had -274
277
+ so -275
278
+ ▁qu -276
279
+ eah -277
280
+ ▁were -278
281
+ ine -279
282
+ ▁act -280
283
+ ther -281
284
+ ▁these -282
285
+ ▁how -283
286
+ ▁now -284
287
+ ▁sa -285
288
+ ud -286
289
+ ▁Wh -287
290
+ ▁man -288
291
+ ous -289
292
+ one -290
293
+ pt -291
294
+ ff -292
295
+ ong -293
296
+ ▁has -294
297
+ ▁any -295
298
+ ▁very -296
299
+ ▁But -297
300
+ ▁look -298
301
+ iv -299
302
+ itt -300
303
+ ▁time -301
304
+ ▁mo -302
305
+ ▁ar -303
306
+ hing -304
307
+ ▁le -305
308
+ ▁work -306
309
+ ▁their -307
310
+ are -308
311
+ ▁his -309
312
+ per -310
313
+ ions -311
314
+ ▁im -312
315
+ ▁ag -313
316
+ ▁J -314
317
+ ▁no -315
318
+ ▁en -316
319
+ ▁got -317
320
+ ag -318
321
+ ▁sp -319
322
+ ans -320
323
+ act -321
324
+ ▁te -322
325
+ ▁also -323
326
+ iz -324
327
+ ice -325
328
+ ▁That -326
329
+ ▁cl -327
330
+ ▁been -328
331
+ ▁way -329
332
+ ▁fe -330
333
+ ▁did -331
334
+ ple -332
335
+ ually -333
336
+ ▁other -334
337
+ ▁U -335
338
+ ite -336
339
+ age -337
340
+ omet -338
341
+ ber -339
342
+ reat -340
343
+ ree -341
344
+ ▁into -342
345
+ own -343
346
+ ▁tw -344
347
+ ▁part -345
348
+ alk -346
349
+ ▁where -347
350
+ ▁need -348
351
+ ▁every -349
352
+ pl -350
353
+ ▁ad -351
354
+ ry -352
355
+ ▁over -353
356
+ ble -354
357
+ ap -355
358
+ ue -356
359
+ ▁kind -357
360
+ ▁po -358
361
+ ▁back -359
362
+ ▁cont -360
363
+ iff -361
364
+ ▁somet -362
365
+ ▁pr -363
366
+ nder -364
367
+ ire -365
368
+ ▁good -366
369
+ ▁than -367
370
+ ace -368
371
+ ▁gu -369
372
+ ep -370
373
+ og -371
374
+ ick -372
375
+ way -373
376
+ ▁lot -374
377
+ ▁un -375
378
+ ▁things -376
379
+ ▁In -377
380
+ ish -378
381
+ kay -379
382
+ ▁well -380
383
+ ▁could -381
384
+ ▁pre -382
385
+ ▁two -383
386
+ irst -384
387
+ ▁diff -385
388
+ ach -386
389
+ cc -387
390
+ ittle -388
391
+ int -389
392
+ ▁He -390
393
+ ▁those -391
394
+ ence -392
395
+ ip -393
396
+ ase -394
397
+ ▁him -395
398
+ ▁make -396
399
+ ▁little -397
400
+ ical -398
401
+ ▁gr -399
402
+ ▁year -400
403
+ ass -401
404
+ ▁thr -402
405
+ uch -403
406
+ ated -404
407
+ ▁This -405
408
+ ▁off -406
409
+ ▁res -407
410
+ ac -408
411
+ ance -409
412
+ ▁actually -410
413
+ ▁talk -411
414
+ ult -412
415
+ able -413
416
+ orm -414
417
+ ▁dis -415
418
+ ▁first -416
419
+ ations -417
420
+ ▁something -418
421
+ ▁she -419
422
+ sel -420
423
+ ▁let -421
424
+ ord -422
425
+ ▁may -423
426
+ ia -424
427
+ ▁am -425
428
+ ▁her -426
429
+ ▁said -427
430
+ ▁bo -428
431
+ be -429
432
+ ount -430
433
+ ▁much -431
434
+ ▁per -432
435
+ ▁even -433
436
+ ▁differe -434
437
+ vel -435
438
+ ary -436
439
+ ▁app -437
440
+ ving -438
441
+ ▁comm -439
442
+ ▁imp -440
443
+ ys -441
444
+ ▁again -442
445
+ ress -443
446
+ ▁yeah -444
447
+ ▁down -445
448
+ ang -446
449
+ ▁mean -447
450
+ na -448
451
+ ens -449
452
+ ▁does -450
453
+ ▁fo -451
454
+ ▁comp -452
455
+ ▁ro -453
456
+ ▁bl -454
457
+ ody -455
458
+ ▁K -456
459
+ ▁through -457
460
+ ▁start -458
461
+ uct -459
462
+ ▁only -460
463
+ ▁bet -461
464
+ ▁under -462
465
+ ▁br -463
466
+ ▁take -464
467
+ ning -465
468
+ ▁bu -466
469
+ ▁use -467
470
+ ▁Ch -468
471
+ xt -469
472
+ co -470
473
+ ory -471
474
+ ild -472
475
+ ▁put -473
476
+ ▁call -474
477
+ ▁new -475
478
+ other -476
479
+ ting -477
480
+ ▁happ -478
481
+ ater -479
482
+ ▁inc -480
483
+ ition -481
484
+ ▁different -482
485
+ ▁should -483
486
+ ade -484
487
+ ign -485
488
+ thing -486
489
+ ▁day -487
490
+ fore -488
491
+ ▁Yeah -489
492
+ ark -490
493
+ ile -491
494
+ ial -492
495
+ ▁come -493
496
+ ▁They -494
497
+ ▁being -495
498
+ ▁try -496
499
+ ious -497
500
+ ▁sc -498
501
+ ▁bit -499
502
+ ▁spe -500
503
+ ub -501
504
+ fe -502
505
+ ▁doing -503
506
+ ▁St -504
507
+ vers -505
508
+ av -506
509
+ ty -507
510
+ ian -508
511
+ onna -509
512
+ red -510
513
+ wn -511
514
+ ▁ke -512
515
+ form -513
516
+ ors -514
517
+ ▁fl -515
518
+ fter -516
519
+ ail -517
520
+ ents -518
521
+ ▁gonna -519
522
+ ▁point -520
523
+ ces -521
524
+ ▁There -522
525
+ self -523
526
+ ▁many -524
527
+ ▁If -525
528
+ ▁same -526
529
+ ▁sy -527
530
+ ▁quest -528
531
+ ▁most -529
532
+ ▁great -530
533
+ ▁What -531
534
+ ▁fu -532
535
+ ug -533
536
+ ▁show -534
537
+ we -535
538
+ ual -536
539
+ ons -537
540
+ ▁Be -538
541
+ ically -539
542
+ ▁ser -540
543
+ ▁rem -541
544
+ ▁ind -542
545
+ ▁pers -543
546
+ ▁V -544
547
+ he -545
548
+ ▁str -546
549
+ ved -547
550
+ ▁still -548
551
+ ank -549
552
+ ▁rec -550
553
+ ▁wr -551
554
+ ought -552
555
+ day -553
556
+ ath -554
557
+ ▁end -555
558
+ ▁bas -556
559
+ ft -557
560
+ erm -558
561
+ body -559
562
+ ph -560
563
+ ject -561
564
+ ict -562
565
+ ▁play -563
566
+ ▁Is -564
567
+ ates -565
568
+ ▁ph -566
569
+ oth -567
570
+ ▁acc -568
571
+ get -569
572
+ ▁years -570
573
+ ▁em -571
574
+ ▁id -572
575
+ ▁Oh -573
576
+ ves -574
577
+ ever -575
578
+ ▁inter -576
579
+ ▁rel -577
580
+ ▁before -578
581
+ ▁feel -579
582
+ igh -580
583
+ ▁three -581
584
+ iss -582
585
+ ▁des -583
586
+ ne -584
587
+ ▁why -585
588
+ ▁uh -586
589
+ ▁To -587
590
+ ▁cons -588
591
+ ▁hel -589
592
+ ▁after -590
593
+ ower -591
594
+ urn -592
595
+ ▁okay -593
596
+ ▁long -594
597
+ ▁bel -595
598
+ ▁around -596
599
+ ful -597
600
+ te -598
601
+ ise -599
602
+ ▁ob -600
603
+ ▁supp -601
604
+ ady -602
605
+ ange -603
606
+ aking -604
607
+ ▁pos -605
608
+ atch -606
609
+ ▁tra -607
610
+ gr -608
611
+ ▁might -609
612
+ ert -610
613
+ ▁help -611
614
+ ost -612
615
+ ▁too -613
616
+ cial -614
617
+ ▁world -615
618
+ ▁give -616
619
+ ike -617
620
+ ▁Okay -618
621
+ ways -619
622
+ ▁min -620
623
+ ward -621
624
+ ily -622
625
+ ▁gen -623
626
+ ▁find -624
627
+ ▁dec -625
628
+ ular -626
629
+ ob -627
630
+ ▁tell -628
631
+ ▁Now -629
632
+ ▁sm -630
633
+ ▁cour -631
634
+ ▁real -632
635
+ cess -633
636
+ nds -634
637
+ ▁big -635
638
+ ▁num -636
639
+ ction -637
640
+ ▁add -638
641
+ ▁set -639
642
+ ▁um -640
643
+ ood -641
644
+ ible -642
645
+ ▁own -643
646
+ ▁life -644
647
+ ities -645
648
+ ▁its -646
649
+ ▁God -647
650
+ pect -648
651
+ ▁didn -649
652
+ stem -650
653
+ les -651
654
+ uc -652
655
+ ib -653
656
+ ating -654
657
+ olog -655
658
+ ▁person -656
659
+ ▁inv -657
660
+ ably -658
661
+ ▁sure -659
662
+ ▁reg -660
663
+ lic -661
664
+ ▁stu -662
665
+ ▁cr -663
666
+ ▁ev -664
667
+ ments -665
668
+ ▁another -666
669
+ ▁la -667
670
+ ▁last -668
671
+ ▁sub -669
672
+ ▁att -670
673
+ ▁op -671
674
+ ▁inst -672
675
+ ▁sl -673
676
+ ▁happen -674
677
+ ▁rep -675
678
+ ▁import -676
679
+ ific -677
680
+ ix -678
681
+ ▁made -679
682
+ ▁ear -680
683
+ ▁ac -681
684
+ ▁def -682
685
+ ute -683
686
+ ▁next -684
687
+ ative -685
688
+ ▁form -686
689
+ ▁guys -687
690
+ ▁system -688
691
+ ew -689
692
+ ▁able -690
693
+ ied -691
694
+ ▁always -692
695
+ ren -693
696
+ erest -694
697
+ ▁As -695
698
+ ▁mod -696
699
+ ▁done -697
700
+ ings -698
701
+ ▁love -699
702
+ ism -700
703
+ ▁ask -701
704
+ old -702
705
+ ered -703
706
+ ▁trans -704
707
+ ▁count -705
708
+ ility -706
709
+ ▁high -707
710
+ ▁fin -708
711
+ ▁prob -709
712
+ ▁pol -710
713
+ ▁exam -711
714
+ ▁pres -712
715
+ ▁maybe -713
716
+ ell -714
717
+ ▁stud -715
718
+ ▁prod -716
719
+ ▁car -717
720
+ ock -718
721
+ ▁used -719
722
+ oy -720
723
+ stand -721
724
+ ▁No -722
725
+ ▁mon -723
726
+ ks -724
727
+ ▁interest -725
728
+ ▁ent -726
729
+ ited -727
730
+ ▁sort -728
731
+ ▁For -729
732
+ ▁today -730
733
+ ics -731
734
+ ▁vide -732
735
+ ▁bec -733
736
+ ▁Well -734
737
+ ▁Al -735
738
+ ▁important -736
739
+ ▁such -737
740
+ ▁run -738
741
+ ▁keep -739
742
+ ▁fact -740
743
+ ata -741
744
+ ss -742
745
+ ▁never -743
746
+ ween -744
747
+ ▁stuff -745
748
+ ract -746
749
+ ▁question -747
750
+ als -748
751
+ ▁sim -749
752
+ vern -750
753
+ ather -751
754
+ ▁course -752
755
+ ▁Of -753
756
+ oc -754
757
+ ness -755
758
+ arch -756
759
+ ize -757
760
+ ▁All -758
761
+ ense -759
762
+ blem -760
763
+ ▁probably -761
764
+ hip -762
765
+ ▁number -763
766
+ ention -764
767
+ ▁saying -765
768
+ ▁commun -766
769
+ ▁An -767
770
+ akes -768
771
+ ▁belie -769
772
+ ▁between -770
773
+ ▁better -771
774
+ cus -772
775
+ ▁place -773
776
+ ▁gener -774
777
+ ▁ca -775
778
+ ▁ins -776
779
+ ▁ass -777
780
+ cond -778
781
+ cept -779
782
+ ull -780
783
+ ▁understand -781
784
+ ▁fun -782
785
+ ▁thought -783
786
+ gan -784
787
+ iew -785
788
+ cy -786
789
+ ution -787
790
+ ope -788
791
+ ason -789
792
+ ▁problem -790
793
+ ▁doesn -791
794
+ ational -792
795
+ ▁read -793
796
+ ▁trying -794
797
+ ▁sch -795
798
+ ▁el -796
799
+ ah -797
800
+ atter -798
801
+ ▁exper -799
802
+ ▁four -800
803
+ ▁ele -801
804
+ ▁cou -802
805
+ ont -803
806
+ ▁called -804
807
+ ▁partic -805
808
+ ▁open -806
809
+ ▁gl -807
810
+ ▁everything -808
811
+ ▁eff -809
812
+ ▁getting -810
813
+ ▁ty -811
814
+ ▁Am -812
815
+ ▁Because -813
816
+ ave -814
817
+ ▁met -815
818
+ ▁Like -816
819
+ oney -817
820
+ ▁ -818
821
+ e -819
822
+ t -820
823
+ o -821
824
+ a -822
825
+ n -823
826
+ i -824
827
+ s -825
828
+ h -826
829
+ r -827
830
+ l -828
831
+ d -829
832
+ u -830
833
+ c -831
834
+ y -832
835
+ m -833
836
+ g -834
837
+ w -835
838
+ f -836
839
+ p -837
840
+ , -838
841
+ b -839
842
+ . -840
843
+ k -841
844
+ v -842
845
+ ' -843
846
+ I -844
847
+ T -845
848
+ A -846
849
+ S -847
850
+ j -848
851
+ x -849
852
+ W -850
853
+ B -851
854
+ C -852
855
+ ? -853
856
+ 0 -854
857
+ O -855
858
+ - -856
859
+ M -857
860
+ H -858
861
+ Y -859
862
+ q -860
863
+ 1 -861
864
+ P -862
865
+ z -863
866
+ L -864
867
+ D -865
868
+ N -866
869
+ G -867
870
+ F -868
871
+ R -869
872
+ E -870
873
+ 2 -871
874
+ J -872
875
+ U -873
876
+ : -874
877
+ 5 -875
878
+ 9 -876
879
+ 3 -877
880
+ K -878
881
+ 4 -879
882
+ V -880
883
+ 8 -881
884
+ 6 -882
885
+ 7 -883
886
+ ! -884
887
+ % -885
888
+ Q -886
889
+ $ -887
890
+ Z -888
891
+ X -889
892
+ é -890
893
+ / -891
894
+ í -892
895
+ á -893
896
+ £ -894
897
+ ó -895
898
+ ā -896
899
+ ü -897
900
+ ñ -898
901
+ ö -899
902
+ è -900
903
+ ç -901
904
+ à -902
905
+ ¿ -903
906
+ μ -904
907
+ π -905
908
+ ä -906
909
+ ú -907
910
+ θ -908
911
+ ã -909
912
+ φ -910
913
+ ī -911
914
+ σ -912
915
+ ê -913
916
+ ρ -914
917
+ â -915
918
+ ô -916
919
+ ^ -917
920
+ € -918
921
+ É -919
922
+ ū -920
923
+ Δ -921
924
+ λ -922
925
+ α -923
926
+ τ -924
927
+ æ -925
928
+ а -926
929
+ о -927
930
+ ν -928
931
+ î -929
932
+ γ -930
933
+ ψ -931
934
+ ē -932
935
+ т -933
936
+ ß -934
937
+ ω -935
938
+ ï -936
939
+ ć -937
940
+ č -938
941
+ ε -939
942
+ е -940
943
+ и -941
944
+ ò -942
945
+ р -943
946
+ β -944
947
+ ø -945
948
+ ł -946
949
+ δ -947
950
+ η -948
951
+ п -949
952
+ ë -950
953
+ н -951
954
+ с -952
955
+ š -953
956
+ Ü -954
957
+ å -955
958
+ ń -956
959
+ ś -957
960
+ я -958
961
+ đ -959
962
+ л -960
963
+ м -961
964
+ Ö -962
965
+ û -963
966
+ ș -964
967
+ в -965
968
+ Á -966
969
+ Ø -967
970
+ ù -968
971
+ ο -969
972
+ ч -970
973
+ ь -971
974
+ ž -972
975
+ Φ -973
976
+ у -974
977
+ ę -975
978
+ ι -976
979
+ б -977
980
+ г -978
981
+ к -979
982
+ ő -980
983
+ Ś -981
984
+ Ω -982
985
+ κ -983
986
+ υ -984
987
+ ì -985
988
+ Č -986
989
+ έ -987
990
+ х -988
991
+ ы -989
992
+ Å -990
993
+ Ç -991
994
+ ż -992
995
+ ί -993
996
+ ζ -994
997
+ χ -995
998
+ э -996
999
+ Æ -997
1000
+ Í -998
1001
+ õ -999
1002
+ ě -1000
1003
+ ħ -1001
1004
+ Ł -1002
1005
+ œ -1003
1006
+ Ž -1004
1007
+ ț -1005
1008
+ Γ -1006
1009
+ П -1007
1010
+ д -1008
1011
+ з -1009
1012
+ ф -1010
1013
+ ¡ -1011
1014
+ À -1012
1015
+ Î -1013
1016
+ Ā -1014
1017
+ ė -1015
1018
+ Š -1016
1019
+ ź -1017
1020
+ Κ -1018
1021
+ Ψ -1019
1022
+ ά -1020
1023
+ ξ -1021
1024
+ ό -1022
ASR/parakeet/model.ckpt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8f2ff18de72abdbe7fbde1dc50be1cf5dd156f58d258940c25d33db14cc23bfc
3
+ size 7415423311
ASR/{README.md → whisper/README.md} RENAMED
@@ -26,10 +26,12 @@ model-index:
26
  value: 0.66
27
  ---
28
 
29
- # Whisper Large v3 Singapore Military ATC (CTranslate2 float16)
30
 
31
  Fine-tuned Whisper Large v3 for Singapore Air Force air traffic control speech recognition.
32
 
 
 
33
  ## Performance
34
 
35
  | Run | WER | Base | Data | Key Change |
@@ -95,7 +97,7 @@ See [hyperparameters.md](./hyperparameters.md) for full training configuration.
95
  ```python
96
  from faster_whisper import WhisperModel
97
 
98
- model = WhisperModel("path/to/ASR", device="cuda", compute_type="float16")
99
  segments, info = model.transcribe(
100
  "audio.wav",
101
  language="en",
@@ -113,7 +115,6 @@ segments, info = model.transcribe(
113
  ),
114
  )
115
  text = " ".join(seg.text.strip() for seg in segments)
116
- # "camel cleared i l s approach runway three six"
117
  ```
118
 
119
  ## Output Format
@@ -126,4 +127,4 @@ The model outputs **normalized spoken text** (lowercase, fully expanded):
126
  | "Contact Tengah Approach one three zero decimal zero" | `contact tengah approach one three zero decimal zero` |
127
  | "Squawk seven seven zero zero" | `squawk seven seven zero zero` |
128
 
129
- A companion rule-based formatter (23 deterministic rules, <1ms, 0 VRAM) converts to display text (e.g., `CAMEL climb FL090`). See the [ASTRA simpilot](https://github.com/aether-raid) pipeline for the full integration.
 
26
  value: 0.66
27
  ---
28
 
29
+ # Whisper Large v3 - Singapore Military ATC (Legacy CTranslate2 Backend)
30
 
31
  Fine-tuned Whisper Large v3 for Singapore Air Force air traffic control speech recognition.
32
 
33
+ > **Legacy backend.** This folder is kept for compatibility with the original ASTRA faster-whisper / CTranslate2 inference path. The newer NeMo checkpoint lives in [../parakeet](../parakeet).
34
+
35
  ## Performance
36
 
37
  | Run | WER | Base | Data | Key Change |
 
97
  ```python
98
  from faster_whisper import WhisperModel
99
 
100
+ model = WhisperModel("path/to/ASR/whisper", device="cuda", compute_type="float16")
101
  segments, info = model.transcribe(
102
  "audio.wav",
103
  language="en",
 
115
  ),
116
  )
117
  text = " ".join(seg.text.strip() for seg in segments)
 
118
  ```
119
 
120
  ## Output Format
 
127
  | "Contact Tengah Approach one three zero decimal zero" | `contact tengah approach one three zero decimal zero` |
128
  | "Squawk seven seven zero zero" | `squawk seven seven zero zero` |
129
 
130
+ A companion rule-based formatter converts this into display text such as `CAMEL climb FL090`.
ASR/{config.json → whisper/config.json} RENAMED
File without changes
ASR/{hyperparameters.md → whisper/hyperparameters.md} RENAMED
File without changes
ASR/{model.bin → whisper/model.bin} RENAMED
File without changes
ASR/{preprocessor_config.json → whisper/preprocessor_config.json} RENAMED
File without changes
ASR/{tokenizer.json → whisper/tokenizer.json} RENAMED
File without changes
ASR/{vocabulary.json → whisper/vocabulary.json} RENAMED
File without changes
README.md CHANGED
@@ -3,6 +3,8 @@ language:
3
  - en
4
  license: other
5
  tags:
 
 
6
  - whisper
7
  - qwen3
8
  - ctranslate2
@@ -21,30 +23,41 @@ Fine-tuned models for Singapore military air traffic control, built for the [AST
21
 
22
  ## Pipeline
23
 
24
- ```
25
- Audio --> VAD (Silero) --> ASR (Whisper) --> Rule Formatter --> Display Text
26
  "camel climb flight level zero nine zero"
27
- "CAMEL climb FL090"
28
  ```
29
 
30
- The production pipeline uses a **rule-based formatter** (23 deterministic rules, <1ms, 0 VRAM) instead of the LLM. The LLM is retained for reference.
31
 
32
  ## Models
33
 
34
- ### [ASR/](./ASR) Whisper Large v3 (CTranslate2 float16)
35
 
36
  Fine-tuned for Singapore military ATC speech. Uses CTranslate2 float16 format for fast inference with [faster-whisper](https://github.com/SYSTRAN/faster-whisper).
37
 
38
  | Metric | Value |
39
  |--------|-------|
40
- | WER | **0.82%** |
41
  | Base model | `openai/whisper-large-v3` |
42
  | Size | 2.9 GB |
43
- | Training | Full fine-tune with enhanced VHF radio augmentation |
 
 
 
 
 
 
 
 
 
 
 
44
 
45
- ### [LLM/](./LLM) Qwen3-1.7B Display Formatter (Legacy)
46
 
47
- > **Legacy.** Superseded by a deterministic rule-based formatter. Retained for reference.
48
 
49
  Converts normalized ASR output into structured ATC display text.
50
 
@@ -56,26 +69,20 @@ Converts normalized ASR output into structured ATC display text.
56
 
57
  ## Architecture
58
 
 
 
59
  ```
60
- Audio --> VAD (Silero) --> ASR (Whisper ct2) --> Post-processing --> Rule Formatter --> Display Text
61
- ```
62
-
63
- | Component | Technology | Latency | VRAM |
64
- |-----------|-----------|---------|------|
65
- | VAD | Silero VAD (ONNX) | ~50ms | <100 MB |
66
- | ASR | Whisper Large v3 (CTranslate2) | ~500ms-2s | ~2 GB |
67
- | Formatter | 23 deterministic rules | <1ms | 0 MB |
68
 
69
- Total VRAM: ~2 GB (ASR only).
 
 
 
 
 
70
 
71
  ## Domain
72
 
73
- Singapore military ATC covering:
74
- - **Airbases**: Tengah (WSAT, runway 18/36), Paya Lebar (WSAP, runway 02/20)
75
- - **Aircraft**: F-16C/D, F-15SG, C-130, Hercules
76
- - **Approaches**: ILS, GCA, PAR, TACAN, DVOR/DME, VOR/DME, Visual Straight-in
77
- - **100+ callsigns**: CAMEL, NINJA, BEETLE, TAIPAN, MAVERICK, JAGUAR, LANCER, etc.
78
- - **Categories**: departure, approach, handoff, maneuver, landing, emergency, ground, recovery, pilot reports, military-specific ops
79
 
80
  ## Training History
81
 
@@ -87,9 +94,9 @@ Singapore military ATC covering:
87
  | ct2_run6 | 0.40% | jacktol/whisper-large-v3-finetuned-for-ATC | +augmentation, weight decay |
88
  | ct2_run7 | 0.24% | jacktol/whisper-large-v3-finetuned-for-ATC | Frozen encoder, +50 real recordings |
89
  | ct2_run8 | 0.66% | openai/whisper-large-v3 | Full retrain from base, enhanced augmentation |
90
- | **ct2_run9** | **0.82%** | openai/whisper-large-v3 | Expanded dataset (+MNSC, +deepdml, 17.8k train), 19 epochs |
91
 
92
- ### LLM (Legacy)
93
 
94
  | Run | Accuracy | Key Change |
95
  |-----|----------|------------|
@@ -98,24 +105,31 @@ Singapore military ATC covering:
98
 
99
  ## Quick Start
100
 
101
- ### ASR
102
 
103
  ```python
104
  from faster_whisper import WhisperModel
105
 
106
- model = WhisperModel("./ASR", device="cuda", compute_type="float16")
107
  segments, info = model.transcribe("audio.wav", language="en", beam_size=5)
108
  text = " ".join(seg.text.strip() for seg in segments)
109
  ```
110
 
111
- ### Download
 
 
 
 
112
 
113
  ```bash
114
- # Full repo (ASR + LLM)
115
  huggingface-cli download aether-raid/astra-atc-models --local-dir ./models
116
 
117
- # ASR only (recommended)
118
- huggingface-cli download aether-raid/astra-atc-models --include "ASR/*" --local-dir ./models
 
 
 
119
 
120
  # LLM only (legacy)
121
  huggingface-cli download aether-raid/astra-atc-models --include "LLM/*" --local-dir ./models
 
3
  - en
4
  license: other
5
  tags:
6
+ - nemo
7
+ - parakeet
8
  - whisper
9
  - qwen3
10
  - ctranslate2
 
23
 
24
  ## Pipeline
25
 
26
+ ```text
27
+ Audio --> VAD (Silero) --> ASR (Whisper or Parakeet) --> Rule Formatter --> Display Text
28
  "camel climb flight level zero nine zero"
29
+ "CAMEL climb FL090"
30
  ```
31
 
32
+ The production pipeline uses a deterministic rule-based formatter instead of the legacy LLM formatter.
33
 
34
  ## Models
35
 
36
+ ### [ASR/whisper/](./ASR/whisper) - Whisper Large v3 (Legacy CTranslate2 backend)
37
 
38
  Fine-tuned for Singapore military ATC speech. Uses CTranslate2 float16 format for fast inference with [faster-whisper](https://github.com/SYSTRAN/faster-whisper).
39
 
40
  | Metric | Value |
41
  |--------|-------|
42
+ | WER | **0.66%** |
43
  | Base model | `openai/whisper-large-v3` |
44
  | Size | 2.9 GB |
45
+ | Runtime | `faster-whisper` / CTranslate2 |
46
+
47
+ ### [ASR/parakeet/](./ASR/parakeet) - Parakeet-TDT 0.6B v2 (NeMo checkpoint)
48
+
49
+ Fine-tuned NeMo Parakeet model for Singapore military ATC speech. Published as a raw checkpoint together with the tokenizer artifacts required to restore it.
50
+
51
+ | Metric | Value |
52
+ |--------|-------|
53
+ | Validation WER | **1.14%** |
54
+ | Base model | `nvidia/parakeet-tdt-0.6b-v2` |
55
+ | Size | 7.0 GB |
56
+ | Runtime | `nemo_toolkit[asr]` |
57
 
58
+ ### [LLM/](./LLM) - Qwen3-1.7B Display Formatter (Legacy)
59
 
60
+ > **Legacy.** Superseded by the deterministic rule formatter. Retained for reference only.
61
 
62
  Converts normalized ASR output into structured ATC display text.
63
 
 
69
 
70
  ## Architecture
71
 
72
+ ```text
73
+ Audio --> VAD (Silero) --> ASR backend --> Post-processing --> Rule Formatter --> Display Text
74
  ```
 
 
 
 
 
 
 
 
75
 
76
+ | Component | Technology | Notes |
77
+ |-----------|------------|-------|
78
+ | VAD | Silero VAD | Shared frontend for both ASR backends |
79
+ | ASR (legacy) | Whisper Large v3 (CTranslate2) | Lower-memory legacy backend |
80
+ | ASR (current NeMo path) | Parakeet-TDT 0.6B v2 | Fine-tuned NeMo checkpoint |
81
+ | Formatter | Deterministic rules | Converts normalized speech to ATC display text |
82
 
83
  ## Domain
84
 
85
+ Singapore military ATC covering Tengah and Paya Lebar operations, military phraseology, 100+ callsigns, and approach / recovery / emergency traffic.
 
 
 
 
 
86
 
87
  ## Training History
88
 
 
94
  | ct2_run6 | 0.40% | jacktol/whisper-large-v3-finetuned-for-ATC | +augmentation, weight decay |
95
  | ct2_run7 | 0.24% | jacktol/whisper-large-v3-finetuned-for-ATC | Frozen encoder, +50 real recordings |
96
  | ct2_run8 | 0.66% | openai/whisper-large-v3 | Full retrain from base, enhanced augmentation |
97
+ | parakeet_atc | 1.14% | nvidia/parakeet-tdt-0.6b-v2 | NeMo fine-tune, best checkpoint at epoch 20 |
98
 
99
+ ### LLM
100
 
101
  | Run | Accuracy | Key Change |
102
  |-----|----------|------------|
 
105
 
106
  ## Quick Start
107
 
108
+ ### Whisper ASR
109
 
110
  ```python
111
  from faster_whisper import WhisperModel
112
 
113
+ model = WhisperModel("./ASR/whisper", device="cuda", compute_type="float16")
114
  segments, info = model.transcribe("audio.wav", language="en", beam_size=5)
115
  text = " ".join(seg.text.strip() for seg in segments)
116
  ```
117
 
118
+ ### Parakeet ASR
119
+
120
+ See [ASR/parakeet/README.md](./ASR/parakeet/README.md) for the NeMo restore example and tokenizer artifact requirements.
121
+
122
+ ## Download
123
 
124
  ```bash
125
+ # Full repo
126
  huggingface-cli download aether-raid/astra-atc-models --local-dir ./models
127
 
128
+ # Whisper ASR only
129
+ huggingface-cli download aether-raid/astra-atc-models --include "ASR/whisper/*" --local-dir ./models
130
+
131
+ # Parakeet ASR only
132
+ huggingface-cli download aether-raid/astra-atc-models --include "ASR/parakeet/*" --local-dir ./models
133
 
134
  # LLM only (legacy)
135
  huggingface-cli download aether-raid/astra-atc-models --include "LLM/*" --local-dir ./models