Instructions to use remiai3/RemiAI_Framework with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- llama-cpp-python
How to use remiai3/RemiAI_Framework with llama-cpp-python:
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="remiai3/RemiAI_Framework", filename="engine/model.gguf", )
llm.create_chat_completion( messages = [ { "role": "user", "content": "What is the capital of France?" } ] ) - Notebooks
- Google Colab
- Kaggle
- Local Apps
- llama.cpp
How to use remiai3/RemiAI_Framework with llama.cpp:
Install from brew
brew install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf remiai3/RemiAI_Framework # Run inference directly in the terminal: llama-cli -hf remiai3/RemiAI_Framework
Install from WinGet (Windows)
winget install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf remiai3/RemiAI_Framework # Run inference directly in the terminal: llama-cli -hf remiai3/RemiAI_Framework
Use pre-built binary
# Download pre-built binary from: # https://github.com/ggerganov/llama.cpp/releases # Start a local OpenAI-compatible server with a web UI: ./llama-server -hf remiai3/RemiAI_Framework # Run inference directly in the terminal: ./llama-cli -hf remiai3/RemiAI_Framework
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build -j --target llama-server llama-cli # Start a local OpenAI-compatible server with a web UI: ./build/bin/llama-server -hf remiai3/RemiAI_Framework # Run inference directly in the terminal: ./build/bin/llama-cli -hf remiai3/RemiAI_Framework
Use Docker
docker model run hf.co/remiai3/RemiAI_Framework
- LM Studio
- Jan
- vLLM
How to use remiai3/RemiAI_Framework with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "remiai3/RemiAI_Framework" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "remiai3/RemiAI_Framework", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/remiai3/RemiAI_Framework
- Ollama
How to use remiai3/RemiAI_Framework with Ollama:
ollama run hf.co/remiai3/RemiAI_Framework
- Unsloth Studio new
How to use remiai3/RemiAI_Framework with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for remiai3/RemiAI_Framework to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for remiai3/RemiAI_Framework to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for remiai3/RemiAI_Framework to start chatting
- Docker Model Runner
How to use remiai3/RemiAI_Framework with Docker Model Runner:
docker model run hf.co/remiai3/RemiAI_Framework
- Lemonade
How to use remiai3/RemiAI_Framework with Lemonade:
Pull the model
# Download Lemonade from https://lemonade-server.ai/ lemonade pull remiai3/RemiAI_Framework
Run and chat with the model
lemonade run user.RemiAI_Framework-{{QUANT_TAG}}List all available models
lemonade list
Create README.md
Browse files
README.md
ADDED
|
@@ -0,0 +1,78 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: mit
|
| 3 |
+
language:
|
| 4 |
+
- en
|
| 5 |
+
base_model:
|
| 6 |
+
- google/gemma-2b-it
|
| 7 |
+
pipeline_tag: text-generation
|
| 8 |
+
tags:
|
| 9 |
+
- electron
|
| 10 |
+
- desktopapplication
|
| 11 |
+
- windows
|
| 12 |
+
---
|
| 13 |
+
|
| 14 |
+
# RemiAI / Bujji Open Source Framework
|
| 15 |
+
|
| 16 |
+
[](https://opensource.org/licenses/MIT)
|
| 17 |
+
[](https://www.electronjs.org/)
|
| 18 |
+
[](https://huggingface.co/)
|
| 19 |
+
|
| 20 |
+
**A "No-Setup" Local AI Framework for Students**
|
| 21 |
+
|
| 22 |
+
This project is an open-source, offline AI chat application designed for students and colleges. It allows you to run powerful LLMs (like Llama 3, Mistral, etc.) on your laptop without needing GPU, internet, Python, or complicated installations.
|
| 23 |
+
|
| 24 |
+
**Note** - No need any GPU in your laptop to run, it will use the CPU in your laptop for the response generation(inference) and if you want to modify the project code and use another model make sure that your are using the `.gguf` formated weights only, normal weights like `.safetensors` will not supported in this application.
|
| 25 |
+
---
|
| 26 |
+
|
| 27 |
+
## 🚀 Quick Start (One-Line Command)
|
| 28 |
+
|
| 29 |
+
If you have Git and Node.js installed, open your terminal (Command Prompt or PowerShell) and run:
|
| 30 |
+
|
| 31 |
+
```bash
|
| 32 |
+
git clone https://huggingface.co/remiai3/RemiAI_Framework && cd RemiAI-App && npm install && npm start
|
| 33 |
+
```
|
| 34 |
+
|
| 35 |
+
---
|
| 36 |
+
|
| 37 |
+
## 💻 Manual Installation
|
| 38 |
+
|
| 39 |
+
### 1. Requirements
|
| 40 |
+
* **Node.js**: [Download Here](https://nodejs.org/) (Install the LTS version).
|
| 41 |
+
* **Windows Laptop**: (Code includes optimized `.exe` binaries for Windows).
|
| 42 |
+
|
| 43 |
+
### 2. Download & Setup
|
| 44 |
+
1. **Download** the project zip (or clone the repo).
|
| 45 |
+
2. **Extract** the folder.
|
| 46 |
+
3. **Open Terminal** inside the folder path.
|
| 47 |
+
4. Run the installer for libraries:
|
| 48 |
+
```bash
|
| 49 |
+
npm install
|
| 50 |
+
```
|
| 51 |
+
|
| 52 |
+
### 3. Run the App
|
| 53 |
+
Simply type:
|
| 54 |
+
```bash
|
| 55 |
+
npm start
|
| 56 |
+
```
|
| 57 |
+
The application will launch, the AI engine will start in the background, and you can begin chatting immediately!
|
| 58 |
+
|
| 59 |
+
---
|
| 60 |
+
|
| 61 |
+
## 📦 Features
|
| 62 |
+
|
| 63 |
+
* **Zero Python Dependency**: We use compiled binaries (`.dll` and `.exe` included) so you don't need to install Python, PyTorch, or set up virtual environments.
|
| 64 |
+
* **Plug & Play Models**: Supports `.gguf` format.
|
| 65 |
+
* Want a different model? Download any `.gguf` file, rename it to `model.gguf`, and place it in the project root.
|
| 66 |
+
* **Auto-Optimization**: Automatically detects your CPU features (AVX vs AVX2) to give you the best speed possible.
|
| 67 |
+
* **Privacy First**: Runs 100% offline. No data leaves your device.
|
| 68 |
+
|
| 69 |
+
---
|
| 70 |
+
|
| 71 |
+
## 🛠️ Credits & License
|
| 72 |
+
|
| 73 |
+
* **Created By**: RemiAI Team
|
| 74 |
+
* **License**: MIT License.
|
| 75 |
+
* *You are free to rename, modify, and distribute this application as your own project!*
|
| 76 |
+
|
| 77 |
+
**Note on Models**: The application will only uses the `.gguf` formated weights only to make it as the CPU friendly run the application without any GPU
|
| 78 |
+
---
|