Commit History
updated date and model count 0c07f8b
Corey Morris commited on
Added new hugging face results 3f507e0
Corey Morris commited on
Updated to reflect number of models. Previously, I think there were duplicates d396c1e
Corey Morris commited on
Show a random question from the moral scenarios evaluation 19c7c67
Corey Morris commited on
Updated model count 4f20e65
Corey Morris commited on
Added statement of removal of models 96ffe12
Corey Morris commited on
removed commented code 7fc9618
Corey Morris commited on
updated update data 280db99
Corey Morris commited on
Fixed type error e79bcf3
Corey Morris commited on
WIP commit. Currently have nlargest error d506f10
Corey Morris commited on
Updated the last updated date to 18Aug 42ff7b9
Corey Morris commited on
Updated description with more models 7f24726
Corey Morris commited on
fixed error d7b89ce
Corey Morris commited on
Added google analytics snippet 9444cd2
Corey Morris commited on
Increased size of scatter plot 2b16774
Corey Morris commited on
Made the radar plot larger f52387e
Corey Morris commited on
Moved radar plots to higher in the page 12a9766
Corey Morris commited on
Modified title and explanation to better reflect what the site is 18ec1ba
Corey Morris commited on
Moved radar chart to after analysis fb25b1e
Corey Morris commited on
Added a default model to compare 7b77065
Corey Morris commited on
Improved clarity of explanation for Radar charts a450af5
Corey Morris commited on
Fixed some of the diplicate model issue 618dcce
Corey Morris commited on
Table now displays the columns that have the top differences dc21a69
Corey Morris commited on
removed charts with hardcoded tasks. removed hardcoding of model for other charts a125eb8
Corey Morris commited on
Finding top differences between tasks from the target model 627e0f9
Corey Morris commited on
Added explanation for the plot and a dataframe of the models 2db58a0
Corey Morris commited on
Added radar chart. Compares a model to the 5 models that have the closest performance on MMLU_average 9695a47
Corey Morris commited on
Added header back for the table 2a7f691
Corey Morris commited on
Added citation for the site ea8703d
Corey Morris commited on
Changed streamlit to wide layout to see more of the table 1e6b767
Corey Morris commited on
Updated updated date 28d4d6a
Corey Morris commited on
Added filter for parameter count. Fixed model filter so that it only filters on the Model name (index of the table) 8474e43
Corey Morris commited on
Modified the selection of models and evaluations so that most do not show up by default. for a better user experience with 700+ models 0a33874
Corey Morris commited on
Added search for Model name and Task name 3abc48f
Corey Morris commited on
Added reasoning for having scatter plots cb21769
Corey Morris commited on
Updated title now that there are over 700 open source models in the dataset a9f9804
Corey Morris commited on
Added statement and hypothesis about moral scenarios d97426f
Corey Morris commited on
Plots have a default title f9a0f38
Corey Morris commited on
Refactor of create_plot bdad6e6
Corey Morris commited on
Added finding from moral scenarios about threshold 1f8cc2a
Corey Morris commited on
Added bar chart for abstract algebra data. a79afe8
Corey Morris commited on
Added introduction, links, and reduced the number of plots displayed a5fb364
Corey Morris commited on
Modified download CSV feature so that the index column now has a title of model name 6a7ad7c
Corey Morris commited on
Added citation 4fbdb10
Corey Morris commited on
Add dashed line at the appropriate scale of the largest and smallest values on the plot so that plotly still zooms in to show that 7ed3839
Corey Morris commited on
for the custom charts, remove columns with NaN values b94ee8f
Corey Morris commited on
Refactoring. Moved ResultDataProcessor class to a separate file to make it easier to use with experimentation in a jupyter notebook 843a5ef
Corey Morris commited on
Renamed class. Removed columns that were not useful. 03ade34
Corey Morris commited on
Moved CSV download to just below the table a34a60b
Corey Morris commited on