stefanches7 commited on
Commit
f0b8b43
·
1 Parent(s): 90e01a8

Update README with end deliverable and LLMs note

Browse files

Added details about the end deliverable and clarified the role of LLMs in coding.

Files changed (1) hide show
  1. README.md +13 -1
README.md CHANGED
@@ -16,6 +16,10 @@ In this proof-of-concept, we aim to determine whether a coordinated system of AI
16
 
17
  If successful, Open Data BIDSifier will serve as a foundation for an AI agent that can identify and harmonize different datasets from open data, making sure these are immediately usable for machine learning and statistical analysis.
18
 
 
 
 
 
19
  ## Tools and Structure
20
 
21
  We will use this repository (https://github.com/stefanches7/AI-assisted-Neuroimaging-harmonization) as an intermittant commit place. Please, make yourself familiar with Git and Github. [This intro](https://docs.github.com/de/get-started/start-your-journey/hello-world) can be useful for that.
@@ -40,8 +44,9 @@ Suggestions smaller LLMs: SmoLM, LLaMA-7B, Qwen-7B
40
  ### Coding & Vibe Coding
41
 
42
  We will use **Python** to code. I recommend using **Anaconda** package manager as a tool to manage the Python package environments. If you are not sure what the previous 2 sentences really mean, I recommend [reading this intro to Python & Conda](https://www.anaconda.com/topics/choosing-between-anaconda-vs-python#:~:text=Anaconda%20is%20a%20distribution%20that,machine%20learning%2C%20and%20scientific%20computing.)
 
43
 
44
- LLMs can assist in writing code, but can also prove counterproductive and write bad (spaghetti), duplicated and erroneous code. It is instructful to be able check their output and correct it manually.
45
 
46
  ## Rough Work Plan
47
 
@@ -92,6 +97,13 @@ Time planned: ~3 hours.
92
 
93
  Use [ResEncL](https://huggingface.co/AnonRes/ResEncL-OpenMind-MAE) or another model of your choice with JAX, and feed it the resulting harmonized dataset. Analyse and record the bugs.
94
 
 
 
 
 
 
 
 
95
  Time planned: ~4 hours.
96
 
97
  ### Teams and Participants / Skills
 
16
 
17
  If successful, Open Data BIDSifier will serve as a foundation for an AI agent that can identify and harmonize different datasets from open data, making sure these are immediately usable for machine learning and statistical analysis.
18
 
19
+ ## End deliverable
20
+
21
+ If successful, the developed data harmonizer tool will be published as a Python executable, as well as a [HuggingFace space](https://huggingface.co/spaces).
22
+
23
  ## Tools and Structure
24
 
25
  We will use this repository (https://github.com/stefanches7/AI-assisted-Neuroimaging-harmonization) as an intermittant commit place. Please, make yourself familiar with Git and Github. [This intro](https://docs.github.com/de/get-started/start-your-journey/hello-world) can be useful for that.
 
44
  ### Coding & Vibe Coding
45
 
46
  We will use **Python** to code. I recommend using **Anaconda** package manager as a tool to manage the Python package environments. If you are not sure what the previous 2 sentences really mean, I recommend [reading this intro to Python & Conda](https://www.anaconda.com/topics/choosing-between-anaconda-vs-python#:~:text=Anaconda%20is%20a%20distribution%20that,machine%20learning%2C%20and%20scientific%20computing.)
47
+ Use your local IDE + Github repository clone to collaborate, and [Google collab space](https://colab.research.google.com/drive/15AG1HpdTtwVgmnihxZRLoKEZu2Q6ChU9?usp=sharing) if the local setup fails for some reason.
48
 
49
+ **NB! LLMs can assist in writing code, but can also prove counterproductive and write bad (spaghetti), duplicated and erroneous code**. It is instructful to be able check their output and correct it manually.
50
 
51
  ## Rough Work Plan
52
 
 
97
 
98
  Use [ResEncL](https://huggingface.co/AnonRes/ResEncL-OpenMind-MAE) or another model of your choice with JAX, and feed it the resulting harmonized dataset. Analyse and record the bugs.
99
 
100
+ Work tasks here:
101
+
102
+ 1. Data loading ("Data Engineer")
103
+ 2. Running & evaluating the model ("AI Engineer")
104
+ 3. GUI Design and code for Python executable ("UX Designer")
105
+ 4. GUI Design and code for HuggingFace space ("UX Designer")
106
+
107
  Time planned: ~4 hours.
108
 
109
  ### Teams and Participants / Skills