Sweaterdog commited on
Commit
3731117
·
verified ·
1 Parent(s): da96660

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +121 -1
README.md CHANGED
@@ -2,5 +2,125 @@
2
  license: apache-2.0
3
  ---
4
 
 
5
 
6
- This model is a dummy merge of 64 different LLMs. It is stupid, the router sucks, and won't produce coherent text. Test to see if viable for GRaPE (it wasn't)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2
  license: apache-2.0
3
  ---
4
 
5
+ ![GRaPE_Logo](https://cdn-uploads.huggingface.co/production/uploads/66960602f0ffd8e3a381106a/XjHkzctrE41e1qqJYeDzN.png)
6
 
7
+ _The **G**eneral **R**easoning **A**gent (for) **P**roject **E**xploration_
8
+
9
+ # The GRaPE Family
10
+ | Attribute | Size | Modalities | Domain |
11
+ | :--- | :--- | :--- | :--- |
12
+ | **GRaPE** | 10B A2.7B | Text + Image + Video in, Text out | Complex Reasoning Tasks |
13
+ | **GRaPE Flash** (this model) | 7B A1B | Text in, Text out | High-Speed Applications |
14
+ | **GRaPE Mini** | 1.7B | Text in, Text out | Edge Deployment |
15
+
16
+ ***
17
+
18
+ # Capabilities
19
+
20
+ The GRaPE Family was trained on about **14 billion** tokens of data after pre-training. About half was code related tasks, with the rest being heavy on STEAM. Ensuring the model has a sound logical basis.
21
+ > Disclaimer: *GRaPE Flash does not have thinking capabilities, primarily in favor of instant responses.*
22
+ ***
23
+
24
+ # Architecture
25
+
26
+ * GRaPE: Built on the `Qwen3 VL MoE` Architecture, allowing for long-context understanding and reasoning over visual tasks, as well as any text-based task. Allowing for deep understanding.
27
+
28
+ * GRaPE Flash: Built on the `OlMoE` Architecture, allowing for incredibly fast speeds where it matters. Allows for retaining factual information, but lacks in logical tasks.
29
+
30
+ * GRaPE Mini: Built on the `Qwen3` Architecture, allowing for edge case deployments, where logic nor speed can be sacrificed.
31
+
32
+ ***
33
+
34
+ # Demos
35
+ > *The following demos were ran for the model in this repo, GRaPE Flash, which does not have any reasoning abilities*
36
+
37
+ `Make me a landing page in HTML for a music creation software named "Aurora Beats". Output only the HTML file. Make it amazing, and extravagant`
38
+ ![image](https://cdn-uploads.huggingface.co/production/uploads/66960602f0ffd8e3a381106a/QuKyKsI4XvFcn3_7wRe-1.png)
39
+ [View It Here](https://github.com/Sweaterdog/GRaPE-Demos/blob/main/grape-flash/Aurora_Demo.html)
40
+ ***
41
+ `Make me a blackjack game using pygame. Generate all assets within pygame. Make no mistakes. Make it excellent.`
42
+ <details>
43
+ <summary>View Output</summary>
44
+ ```
45
+ user@test $ python blackjack.py
46
+ pygame 2.6.1 (SDL 2.28.4, Python 3.12.2)
47
+ Hello from the pygame community. https://www.pygame.org/contribute.html
48
+ Traceback (most recent call last):
49
+ File "/home/sweaterdog/Desktop/blackjack.py", line 88, in <module>
50
+ screen = pygame.display.set_mode((width, height))
51
+ ^^^^^
52
+ NameError: name 'width' is not defined
53
+ ```
54
+ </details>
55
+
56
+ [View It Here](https://github.com/Sweaterdog/GRaPE-Demos/blob/main/grape-flash/blackjack.py)
57
+ ***
58
+ `What is the electron configuration for Lithium?`
59
+ <details>
60
+ <summary>View Output</summary>
61
+ The electron configuration for Lithium, represented by the symbol 'Li', is:
62
+
63
+ \[ \text{Lithium} = [He] 2s^1 \]
64
+
65
+ This means that lithium has one electron in its 2s orbital and no electrons in the 2p orbitals.
66
+
67
+ *Note: The [He] notation refers to the electron configuration of helium, which is used here as a reference for understanding electron
68
+ configurations.*
69
+ </details>
70
+ GRaPE Flash got the question correct.
71
+ ***
72
+ `Who ended up beheaded in the Reign of Terror, due to people being anxious of being on his list?`
73
+ <details>
74
+ <summary>View Output</summary>
75
+ The Reign of Terror, a period of extreme and radical social and political change during the French Revolution, was marked by the rise
76
+ of the guillotine as a means of execution. Among the many individuals who met their end on this infamous instrument were several
77
+ notable figures whose lives were cut short due to the fervor of the revolution.
78
+
79
+ One such person was King Louis XVI, who was executed by guillotine in January 1793, signaling the beginning of the Reign of Terror.
80
+ His execution was a pivotal moment, symbolizing the overthrow of the monarchy and the birth of the first French Republic.
81
+
82
+ Another prominent victim was Marie Antoinette, the Queen consort of France, who was tried and executed by guillotine in October 1793.
83
+ Her trial and execution were emblematic of the radical changes and the extreme measures taken during this period to reshape the French
84
+ society and government.
85
+
86
+ However, among the thousands who faced the guillotine during this time, there were also common citizens and political figures whose
87
+ names are less remembered today. Among these was a baker named Jean, who was executed for allegedly hoarding grain, a merchant named
88
+ Pierre, who was accused of speculating on bread prices, and a young soldier named Henri, who was executed for desertion.
89
+
90
+ The Reign of Terror was characterized by mass executions, political purges, and the rise of the radical Jacobin faction led by
91
+ Maximilien Robespierre. This period was marked by intense fear and suspicion, leading to widespread accusations and executions. The
92
+ guillotine became a symbol of this brutal application of the death penalty, designed to execute individuals swiftly and efficiently.
93
+
94
+ The Reign of Terror ended with the fall of Robespierre in July 1794, known as the Thermidorian Reaction. This period saw the rise of
95
+ more moderate and less violent leadership, which began the transition back to more normalcy and stability in French society.
96
+
97
+ In this turbulent era, the guillotine was a brutal reminder of the revolutionary zeal and the accompanying paranoia that characterized
98
+ the time. The list of those executed included not just monarchists and aristocrats but also those who were seen as enemies of the
99
+ revolution, regardless of their political leanings. The Reign of Terror, while aiming to create a new social order, ultimately left a
100
+ haunting legacy of fear, sacrifice, and the brutal cost of revolution.
101
+ </details>
102
+ GRaPE Flash got the question correct, but buried the answer, Maximilien Robespierre, late into the response./
103
+ ***
104
+ `What notes are in the F# Mixolydian Scale?`
105
+ <details>
106
+ <summary>View Output</summary>
107
+ The F# Mixolydian Scale consists of the following notes: F#, G#, A#, B, C#, D#, and E#.
108
+
109
+ This scale is often used in various musical genres, particularly in blues, rock, and modal music. It's characterized by a relaxed and
110
+ easy-going vibe due to its unique modal structure.
111
+
112
+ Here's a breakdown of the intervals:
113
+
114
+ * F# to G#: Whole step (major second)
115
+ * G# to A#: Whole step
116
+ * A# to B: Half step (minor second)
117
+ * B to C#: Whole step
118
+ * C# to D#: Whole step
119
+ * D# to E#: Whole step
120
+ * E# to F#: Half step
121
+
122
+ The F# Mixolydian Scale is often used in songs where the composer wants a sense of jazzy or bluesy vibe, and it provides a nice
123
+ contrast to other scales due to its unique character. Remember to always approach your music with emotion and soul, and let the notes
124
+ flow from your heart!
125
+ </details>
126
+ GRaPE Flash got this question wrong. The correct answer is `F#, G#, A#, B, C#, D#, and E`. It had listed the notes in the F# Major Scale.