Spaces:
Running
Running
| Type,Topic,Example,Expected reduction | |
| General,⚡ Use the right quantization,"On llamacpp, using q4ks instead of no quantization can lead to a reduction of impact by",33% | |
| General,⚡ Use the right framework !,"Using the framework vllm instead of llamacpp for some model can lead to a reduction of impact by",18% | |
| General,⚡ Use the lightest possible model that meets your needs !,"Using the model llama3-8b instead of 13b can lead to a reduction of impact by",30% | |
| General,🌫️ Locate servers in a country where energy production has less impact, "Using a server located in Sweden instead of United-States can lead to a reduction of impact by",93% | |
| General, 🌫️⛏️💧 Use as few resources as possible (i.e. the smallest possible machine/server) to suit the need,"Using a small gpu server instead of a big one can lead to a reduction of impact by",45% | |
| Specific,⚡ Use the most frugal configuration,"Compare with the most frugal configuration: the model X-Y with Z framework",X% | |