DarylT01 commited on
Commit
74e28e3
·
1 Parent(s): 15e4f2e

fix(txt): add all texts, add utils for text processing

Browse files
Files changed (4) hide show
  1. adjectives.txt +250 -3
  2. adverbs.txt +283 -3
  3. conjunctions.txt +30 -3
  4. utils.py +51 -0
adjectives.txt CHANGED
@@ -1,3 +1,250 @@
1
- and
2
- but
3
- or
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ abundant
2
+ acidic
3
+ aggressive
4
+ agreeable
5
+ alive
6
+ ambitious
7
+ ancient
8
+ angry
9
+ ashy
10
+ attractive
11
+ bald
12
+ beautiful
13
+ better
14
+ bewildered
15
+ big
16
+ billions
17
+ bitter
18
+ black
19
+ blue
20
+ brave
21
+ breezy
22
+ brief
23
+ broad
24
+ bumpy
25
+ calm
26
+ careful
27
+ chilly
28
+ chubby
29
+ clean
30
+ clever
31
+ clumsy
32
+ cold
33
+ colossal
34
+ cool
35
+ crashing
36
+ creamy
37
+ crooked
38
+ cuddly
39
+ curved
40
+ damaged
41
+ damp
42
+ dazzling
43
+ dead
44
+ deafening
45
+ deep
46
+ defeated
47
+ delicious
48
+ delightful
49
+ dirty
50
+ disgusting
51
+ drab
52
+ dry
53
+ eager
54
+ easy
55
+ echoing
56
+ elegant
57
+ embarrassed
58
+ faint
59
+ faithful
60
+ famous
61
+ fancy
62
+ fat
63
+ few
64
+ fierce
65
+ fit
66
+ flabby
67
+ flaky
68
+ flat
69
+ fluffy
70
+ freezing
71
+ fresh
72
+ full
73
+ future
74
+ gentle
75
+ gifted
76
+ gigantic
77
+ glamorous
78
+ gorgeous
79
+ gray
80
+ greasy
81
+ great
82
+ green
83
+ grumpy
84
+ hallowed
85
+ handsome
86
+ happy
87
+ harsh
88
+ helpful
89
+ helpless
90
+ high
91
+ hissing
92
+ hollow
93
+ hot
94
+ howling
95
+ huge
96
+ hundreds
97
+ icy
98
+ immense
99
+ important
100
+ incalculable
101
+ inexpensive
102
+ itchy
103
+ jealous
104
+ jolly
105
+ juicy
106
+ kind
107
+ large
108
+ lazy
109
+ lemon
110
+ limited
111
+ little
112
+ lively
113
+ long
114
+ loose
115
+ loud
116
+ low
117
+ magnificent
118
+ mammoth
119
+ mango
120
+ many
121
+ massive
122
+ mealy
123
+ melodic
124
+ melted
125
+ microscopic
126
+ millions
127
+ miniature
128
+ modern
129
+ moldy
130
+ muscular
131
+ mushy
132
+ mysterious
133
+ narrow
134
+ nervous
135
+ nice
136
+ noisy
137
+ numerous
138
+ nutritious
139
+ nutty
140
+ obedient
141
+ obnoxious
142
+ odd
143
+ old
144
+ orange
145
+ panicky
146
+ petite
147
+ pitiful
148
+ plain
149
+ plump
150
+ polite
151
+ poor
152
+ powerful
153
+ prehistoric
154
+ prickly
155
+ proud
156
+ puny
157
+ purple
158
+ purring
159
+ putrid
160
+ quaint
161
+ quick
162
+ quiet
163
+ rancid
164
+ rapid
165
+ rapping
166
+ raspy
167
+ red
168
+ refined
169
+ repulsive
170
+ rhythmic
171
+ rich
172
+ ripe
173
+ rotten
174
+ rough
175
+ round
176
+ salmon
177
+ salty
178
+ savory
179
+ scarce
180
+ scary
181
+ scrawny
182
+ screeching
183
+ scruffy
184
+ shaggy
185
+ shallow
186
+ shapely
187
+ sharp
188
+ short
189
+ shrilling
190
+ shy
191
+ silly
192
+ skinny
193
+ slimy
194
+ slow
195
+ small
196
+ some
197
+ sour
198
+ sparse
199
+ spicy
200
+ spoiled
201
+ square
202
+ squeaking
203
+ stale
204
+ steep
205
+ sticky
206
+ stocky
207
+ strong
208
+ substantial
209
+ sweet
210
+ swift
211
+ tall
212
+ tangy
213
+ tart
214
+ tasteless
215
+ tasty
216
+ teeny
217
+ tender
218
+ thankful
219
+ thoughtless
220
+ thousands
221
+ thundering
222
+ tight
223
+ tinkling
224
+ tiny
225
+ ugly
226
+ uneven
227
+ unimportant
228
+ uninterested
229
+ unkempt
230
+ unsightly
231
+ uptight
232
+ vast
233
+ victorious
234
+ wailing
235
+ warm
236
+ weak
237
+ wet
238
+ whining
239
+ whispering
240
+ white
241
+ wide
242
+ witty
243
+ wonderful
244
+ wooden
245
+ worried
246
+ wrong
247
+ yellow
248
+ young
249
+ yummy
250
+ zealous
adverbs.txt CHANGED
@@ -1,3 +1,283 @@
1
- nor
2
- for
3
- yet
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ above
2
+ abroad
3
+ abruptly
4
+ accidentally
5
+ accordingly
6
+ actually
7
+ additionally
8
+ adequately
9
+ afterwards
10
+ almost
11
+ already
12
+ also
13
+ always
14
+ ambiguously
15
+ analytically
16
+ angrily
17
+ annually
18
+ anyway
19
+ anywhere
20
+ appropriately
21
+ arguably
22
+ around
23
+ away
24
+ awkwardly
25
+ backward
26
+ backwards
27
+ badly
28
+ barely
29
+ basically
30
+ beautifully
31
+ behind
32
+ below
33
+ besides
34
+ boldly
35
+ briefly
36
+ brightly
37
+ busily
38
+ carefully
39
+ carelessly
40
+ casually
41
+ cautiously
42
+ certainly
43
+ cheerfully
44
+ cleverly
45
+ close
46
+ coldly
47
+ comfortably
48
+ commonly
49
+ comparatively
50
+ completely
51
+ confidently
52
+ consciously
53
+ consecutively
54
+ consequently
55
+ continuously
56
+ conveniently
57
+ conversely
58
+ cunningly
59
+ curiously
60
+ currently
61
+ daily
62
+ dearly
63
+ deeply
64
+ differently
65
+ doubtfully
66
+ down
67
+ downhill
68
+ downstairs
69
+ eagerly
70
+ early
71
+ earnestly
72
+ easily
73
+ east
74
+ eastwards
75
+ elegantly
76
+ enough
77
+ especially
78
+ eventually
79
+ ever
80
+ everyday
81
+ everywhere
82
+ exceedingly
83
+ exceptionally
84
+ extremely
85
+ faithfully
86
+ far
87
+ fast
88
+ fearfully
89
+ fiercely
90
+ finally
91
+ foolishly
92
+ forward
93
+ frequently
94
+ fully
95
+ furthermore
96
+ generally
97
+ generously
98
+ gently
99
+ gracefully
100
+ graciously
101
+ gradually
102
+ graphically
103
+ happily
104
+ hard
105
+ hardly
106
+ hence
107
+ here
108
+ highly
109
+ honestly
110
+ however
111
+ humbly
112
+ hungrily
113
+ immediately
114
+ in
115
+ incessantly
116
+ incidentally
117
+ indeed
118
+ inside
119
+ instead
120
+ instinctively
121
+ interestingly
122
+ joyfully
123
+ joyously
124
+ just
125
+ kindly
126
+ last
127
+ late
128
+ lately
129
+ later
130
+ lazily
131
+ least
132
+ legally
133
+ legibly
134
+ likely
135
+ likewise
136
+ logically
137
+ loosely
138
+ lots
139
+ loudly
140
+ lovingly
141
+ madly
142
+ meanwhile
143
+ momentarily
144
+ monthly
145
+ moreover
146
+ most
147
+ mostly
148
+ much
149
+ mysteriously
150
+ namely
151
+ nearby
152
+ nearly
153
+ neatly
154
+ necessarily
155
+ never
156
+ nevertheless
157
+ next
158
+ nicely
159
+ noisily
160
+ nonetheless
161
+ normally
162
+ north
163
+ northeast
164
+ northwards
165
+ northwest
166
+ now
167
+ nowhere
168
+ obediently
169
+ occasionally
170
+ often
171
+ once
172
+ otherwise
173
+ out
174
+ outside
175
+ over
176
+ overseas
177
+ patiently
178
+ perfectly
179
+ physically
180
+ poorly
181
+ possibly
182
+ previously
183
+ privately
184
+ probably
185
+ promptly
186
+ properly
187
+ proudly
188
+ psychologically
189
+ publicly
190
+ purely
191
+ purposefully
192
+ quarterly
193
+ quickly
194
+ randomly
195
+ rapidly
196
+ rarely
197
+ readily
198
+ really
199
+ regularly
200
+ repeatedly
201
+ roughly
202
+ rudely
203
+ sadly
204
+ scarcely
205
+ sceptically
206
+ seldom
207
+ seriously
208
+ shortly
209
+ sickly
210
+ silently
211
+ similarly
212
+ simply
213
+ simultaneously
214
+ sincerely
215
+ skillfully
216
+ slightly
217
+ slowly
218
+ smoothly
219
+ so
220
+ sometimes
221
+ somewhat
222
+ somewhere
223
+ soon
224
+ south
225
+ southeast
226
+ southwards
227
+ southwest
228
+ still
229
+ straight
230
+ strangely
231
+ strongly
232
+ subsequently
233
+ successfully
234
+ suddenly
235
+ sufficiently
236
+ suitably
237
+ superficially
238
+ supremely
239
+ suspiciously
240
+ sweetly
241
+ tactfully
242
+ tastefully
243
+ tenaciously
244
+ then
245
+ there
246
+ therefore
247
+ thoroughly
248
+ thus
249
+ tidily
250
+ tightly
251
+ today
252
+ tomorrow
253
+ tonight
254
+ too
255
+ toward
256
+ towards
257
+ tremendously
258
+ truthfully
259
+ twice
260
+ under
261
+ underneath
262
+ unexpectedly
263
+ unlikely
264
+ unwillingly
265
+ up
266
+ uphill
267
+ upstairs
268
+ usually
269
+ vaguely
270
+ very
271
+ vicariously
272
+ victoriously
273
+ virtually
274
+ vividly
275
+ weekly
276
+ well
277
+ west
278
+ westwards
279
+ widely
280
+ willingly
281
+ wisely
282
+ yearly
283
+ yesterday
conjunctions.txt CHANGED
@@ -1,3 +1,30 @@
1
- subsequently
2
- meanwhile
3
- afterwards
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ after
2
+ although
3
+ and
4
+ as
5
+ because
6
+ before
7
+ both
8
+ but
9
+ either
10
+ for
11
+ how
12
+ if
13
+ neither
14
+ nor
15
+ not
16
+ or
17
+ rather
18
+ since
19
+ such
20
+ that
21
+ though
22
+ unless
23
+ until
24
+ when
25
+ whenever
26
+ whereas
27
+ wherever
28
+ whether
29
+ while
30
+ yet
utils.py ADDED
@@ -0,0 +1,51 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ def file_preprocess(file_name: str):
2
+ """
3
+ Preprocesses the file by:
4
+ 1. removing duplicates
5
+ 2. sorting the words
6
+ 3. removing non-alphabetic words
7
+ """
8
+ words = []
9
+ alphabets = set("abcdefghijklmnopqrstuvwxyz")
10
+ file = open(file_name, "r")
11
+ words = file.read().splitlines()
12
+ file.close()
13
+ # print(
14
+ # sorted(
15
+ # list(
16
+ # set(
17
+ # word
18
+ # for word in words
19
+ # if all(char.lower() in alphabets for char in word)
20
+ # )
21
+ # )
22
+ # )
23
+ # )
24
+ file = open(file_name, "w")
25
+ file.write(
26
+ "\n".join(
27
+ sorted(
28
+ list(
29
+ set(
30
+ word
31
+ for word in words
32
+ if all(char.lower() in alphabets for char in word)
33
+ )
34
+ )
35
+ )
36
+ )
37
+ )
38
+ file.close()
39
+
40
+
41
+ def find_duplicates_in_files(file_names: list[str]):
42
+ words = []
43
+ for file_name in file_names:
44
+ file = open(file_name, "r")
45
+ words += file.read().splitlines()
46
+ file.close()
47
+ print(sorted(list(set([word for word in words if words.count(word) > 1]))))
48
+
49
+
50
+ file_preprocess("adjectives.txt")
51
+ # find_duplicates_in_files(["conjunctions.txt", "adverbs.txt", "adjectives.txt"])