Spaces:
Sleeping
Sleeping
| title: Lex Fridman AI Interviewer | |
| emoji: 🎙️ | |
| colorFrom: blue | |
| colorTo: purple | |
| sdk: gradio | |
| sdk_version: 5.23.0 | |
| app_file: app.py | |
| pinned: false | |
| tags: | |
| - interviewer | |
| - nemotron | |
| - lex-fridman | |
| # 🎙️ Lex Fridman AI Interviewer | |
| AI interviewer in the style of Lex Fridman, powered by **NVIDIA Nemotron 3 Nano 4B** via HF Inference API. | |
| The model asks deep, thoughtful questions — you are the guest. Paste context about yourself for more relevant interviews. | |
| ## Model | |
| - **Base**: [nvidia/NVIDIA-Nemotron-3-Nano-4B-BF16](https://huggingface.co/nvidia/NVIDIA-Nemotron-3-Nano-4B-BF16) | |
| - **Architecture**: Hybrid Mamba-2 + Attention (38 Mamba layers + 4 Attention layers) | |
| - **Inference**: HF Inference API (ZeroGPU local deployment blocked by mamba-ssm CUDA kernel requirements) | |
| ## Known Limitation | |
| ZeroGPU deployment with local model was attempted but the hybrid Mamba-2 architecture requires | |
| `mamba_ssm` CUDA Triton kernels (`rmsnorm_fn`) that can't be compiled in the Space build environment. | |
| A pure PyTorch mock was validated locally but fails on ZeroGPU due to device placement / timeout issues. | |
| See `docs/ZEROGPU_LIMITATION.md` in the project repo for details. | |
| ## Project | |
| Part of the [Lex Fridman Interviewer Project](https://huggingface.co/bobber/lex-fridman-interviewer-project). | |