Spaces:
Sleeping
Sleeping
Commit
·
f0b8b43
1
Parent(s):
90e01a8
Update README with end deliverable and LLMs note
Browse filesAdded details about the end deliverable and clarified the role of LLMs in coding.
README.md
CHANGED
|
@@ -16,6 +16,10 @@ In this proof-of-concept, we aim to determine whether a coordinated system of AI
|
|
| 16 |
|
| 17 |
If successful, Open Data BIDSifier will serve as a foundation for an AI agent that can identify and harmonize different datasets from open data, making sure these are immediately usable for machine learning and statistical analysis.
|
| 18 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 19 |
## Tools and Structure
|
| 20 |
|
| 21 |
We will use this repository (https://github.com/stefanches7/AI-assisted-Neuroimaging-harmonization) as an intermittant commit place. Please, make yourself familiar with Git and Github. [This intro](https://docs.github.com/de/get-started/start-your-journey/hello-world) can be useful for that.
|
|
@@ -40,8 +44,9 @@ Suggestions smaller LLMs: SmoLM, LLaMA-7B, Qwen-7B
|
|
| 40 |
### Coding & Vibe Coding
|
| 41 |
|
| 42 |
We will use **Python** to code. I recommend using **Anaconda** package manager as a tool to manage the Python package environments. If you are not sure what the previous 2 sentences really mean, I recommend [reading this intro to Python & Conda](https://www.anaconda.com/topics/choosing-between-anaconda-vs-python#:~:text=Anaconda%20is%20a%20distribution%20that,machine%20learning%2C%20and%20scientific%20computing.)
|
|
|
|
| 43 |
|
| 44 |
-
LLMs can assist in writing code, but can also prove counterproductive and write bad (spaghetti), duplicated and erroneous code
|
| 45 |
|
| 46 |
## Rough Work Plan
|
| 47 |
|
|
@@ -92,6 +97,13 @@ Time planned: ~3 hours.
|
|
| 92 |
|
| 93 |
Use [ResEncL](https://huggingface.co/AnonRes/ResEncL-OpenMind-MAE) or another model of your choice with JAX, and feed it the resulting harmonized dataset. Analyse and record the bugs.
|
| 94 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 95 |
Time planned: ~4 hours.
|
| 96 |
|
| 97 |
### Teams and Participants / Skills
|
|
|
|
| 16 |
|
| 17 |
If successful, Open Data BIDSifier will serve as a foundation for an AI agent that can identify and harmonize different datasets from open data, making sure these are immediately usable for machine learning and statistical analysis.
|
| 18 |
|
| 19 |
+
## End deliverable
|
| 20 |
+
|
| 21 |
+
If successful, the developed data harmonizer tool will be published as a Python executable, as well as a [HuggingFace space](https://huggingface.co/spaces).
|
| 22 |
+
|
| 23 |
## Tools and Structure
|
| 24 |
|
| 25 |
We will use this repository (https://github.com/stefanches7/AI-assisted-Neuroimaging-harmonization) as an intermittant commit place. Please, make yourself familiar with Git and Github. [This intro](https://docs.github.com/de/get-started/start-your-journey/hello-world) can be useful for that.
|
|
|
|
| 44 |
### Coding & Vibe Coding
|
| 45 |
|
| 46 |
We will use **Python** to code. I recommend using **Anaconda** package manager as a tool to manage the Python package environments. If you are not sure what the previous 2 sentences really mean, I recommend [reading this intro to Python & Conda](https://www.anaconda.com/topics/choosing-between-anaconda-vs-python#:~:text=Anaconda%20is%20a%20distribution%20that,machine%20learning%2C%20and%20scientific%20computing.)
|
| 47 |
+
Use your local IDE + Github repository clone to collaborate, and [Google collab space](https://colab.research.google.com/drive/15AG1HpdTtwVgmnihxZRLoKEZu2Q6ChU9?usp=sharing) if the local setup fails for some reason.
|
| 48 |
|
| 49 |
+
**NB! LLMs can assist in writing code, but can also prove counterproductive and write bad (spaghetti), duplicated and erroneous code**. It is instructful to be able check their output and correct it manually.
|
| 50 |
|
| 51 |
## Rough Work Plan
|
| 52 |
|
|
|
|
| 97 |
|
| 98 |
Use [ResEncL](https://huggingface.co/AnonRes/ResEncL-OpenMind-MAE) or another model of your choice with JAX, and feed it the resulting harmonized dataset. Analyse and record the bugs.
|
| 99 |
|
| 100 |
+
Work tasks here:
|
| 101 |
+
|
| 102 |
+
1. Data loading ("Data Engineer")
|
| 103 |
+
2. Running & evaluating the model ("AI Engineer")
|
| 104 |
+
3. GUI Design and code for Python executable ("UX Designer")
|
| 105 |
+
4. GUI Design and code for HuggingFace space ("UX Designer")
|
| 106 |
+
|
| 107 |
Time planned: ~4 hours.
|
| 108 |
|
| 109 |
### Teams and Participants / Skills
|