Spaces:
Sleeping
Sleeping
Commit ·
f0b8b43
1
Parent(s): 90e01a8
Update README with end deliverable and LLMs note
Browse filesAdded details about the end deliverable and clarified the role of LLMs in coding.
README.md
CHANGED
|
@@ -16,6 +16,10 @@ In this proof-of-concept, we aim to determine whether a coordinated system of AI
|
|
| 16 |
|
| 17 |
If successful, Open Data BIDSifier will serve as a foundation for an AI agent that can identify and harmonize different datasets from open data, making sure these are immediately usable for machine learning and statistical analysis.
|
| 18 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 19 |
## Tools and Structure
|
| 20 |
|
| 21 |
We will use this repository (https://github.com/stefanches7/AI-assisted-Neuroimaging-harmonization) as an intermittant commit place. Please, make yourself familiar with Git and Github. [This intro](https://docs.github.com/de/get-started/start-your-journey/hello-world) can be useful for that.
|
|
@@ -40,8 +44,9 @@ Suggestions smaller LLMs: SmoLM, LLaMA-7B, Qwen-7B
|
|
| 40 |
### Coding & Vibe Coding
|
| 41 |
|
| 42 |
We will use **Python** to code. I recommend using **Anaconda** package manager as a tool to manage the Python package environments. If you are not sure what the previous 2 sentences really mean, I recommend [reading this intro to Python & Conda](https://www.anaconda.com/topics/choosing-between-anaconda-vs-python#:~:text=Anaconda%20is%20a%20distribution%20that,machine%20learning%2C%20and%20scientific%20computing.)
|
|
|
|
| 43 |
|
| 44 |
-
LLMs can assist in writing code, but can also prove counterproductive and write bad (spaghetti), duplicated and erroneous code. It is instructful to be able check their output and correct it manually.
|
| 45 |
|
| 46 |
## Rough Work Plan
|
| 47 |
|
|
@@ -92,6 +97,13 @@ Time planned: ~3 hours.
|
|
| 92 |
|
| 93 |
Use [ResEncL](https://huggingface.co/AnonRes/ResEncL-OpenMind-MAE) or another model of your choice with JAX, and feed it the resulting harmonized dataset. Analyse and record the bugs.
|
| 94 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 95 |
Time planned: ~4 hours.
|
| 96 |
|
| 97 |
### Teams and Participants / Skills
|
|
|
|
| 16 |
|
| 17 |
If successful, Open Data BIDSifier will serve as a foundation for an AI agent that can identify and harmonize different datasets from open data, making sure these are immediately usable for machine learning and statistical analysis.
|
| 18 |
|
| 19 |
+
## End deliverable
|
| 20 |
+
|
| 21 |
+
If successful, the developed data harmonizer tool will be published as a Python executable, as well as a [HuggingFace space](https://huggingface.co/spaces).
|
| 22 |
+
|
| 23 |
## Tools and Structure
|
| 24 |
|
| 25 |
We will use this repository (https://github.com/stefanches7/AI-assisted-Neuroimaging-harmonization) as an intermittant commit place. Please, make yourself familiar with Git and Github. [This intro](https://docs.github.com/de/get-started/start-your-journey/hello-world) can be useful for that.
|
|
|
|
| 44 |
### Coding & Vibe Coding
|
| 45 |
|
| 46 |
We will use **Python** to code. I recommend using **Anaconda** package manager as a tool to manage the Python package environments. If you are not sure what the previous 2 sentences really mean, I recommend [reading this intro to Python & Conda](https://www.anaconda.com/topics/choosing-between-anaconda-vs-python#:~:text=Anaconda%20is%20a%20distribution%20that,machine%20learning%2C%20and%20scientific%20computing.)
|
| 47 |
+
Use your local IDE + Github repository clone to collaborate, and [Google collab space](https://colab.research.google.com/drive/15AG1HpdTtwVgmnihxZRLoKEZu2Q6ChU9?usp=sharing) if the local setup fails for some reason.
|
| 48 |
|
| 49 |
+
**NB! LLMs can assist in writing code, but can also prove counterproductive and write bad (spaghetti), duplicated and erroneous code**. It is instructful to be able check their output and correct it manually.
|
| 50 |
|
| 51 |
## Rough Work Plan
|
| 52 |
|
|
|
|
| 97 |
|
| 98 |
Use [ResEncL](https://huggingface.co/AnonRes/ResEncL-OpenMind-MAE) or another model of your choice with JAX, and feed it the resulting harmonized dataset. Analyse and record the bugs.
|
| 99 |
|
| 100 |
+
Work tasks here:
|
| 101 |
+
|
| 102 |
+
1. Data loading ("Data Engineer")
|
| 103 |
+
2. Running & evaluating the model ("AI Engineer")
|
| 104 |
+
3. GUI Design and code for Python executable ("UX Designer")
|
| 105 |
+
4. GUI Design and code for HuggingFace space ("UX Designer")
|
| 106 |
+
|
| 107 |
Time planned: ~4 hours.
|
| 108 |
|
| 109 |
### Teams and Participants / Skills
|