lex-interviewer-chat

Sleeping

Sync space files (2026-04-05)

634ca1b verified about 2 months ago

1.31 kB

	---
	title: Lex Fridman AI Interviewer
	emoji: 🎙️
	colorFrom: blue
	colorTo: purple
	sdk: gradio
	sdk_version: 5.23.0
	app_file: app.py
	pinned: false
	tags:
	- interviewer
	- nemotron
	- lex-fridman
	---

	# 🎙️ Lex Fridman AI Interviewer

	AI interviewer in the style of Lex Fridman, powered by NVIDIA Nemotron 3 Nano 4B via HF Inference API.

	The model asks deep, thoughtful questions — you are the guest. Paste context about yourself for more relevant interviews.

	## Model

	- Base: [nvidia/NVIDIA-Nemotron-3-Nano-4B-BF16](https://huggingface.co/nvidia/NVIDIA-Nemotron-3-Nano-4B-BF16)
	- Architecture: Hybrid Mamba-2 + Attention (38 Mamba layers + 4 Attention layers)
	- Inference: HF Inference API (ZeroGPU local deployment blocked by mamba-ssm CUDA kernel requirements)

	## Known Limitation

	ZeroGPU deployment with local model was attempted but the hybrid Mamba-2 architecture requires
	`mamba_ssm` CUDA Triton kernels (`rmsnorm_fn`) that can't be compiled in the Space build environment.
	A pure PyTorch mock was validated locally but fails on ZeroGPU due to device placement / timeout issues.
	See `docs/ZEROGPU_LIMITATION.md` in the project repo for details.

	## Project

	Part of the [Lex Fridman Interviewer Project](https://huggingface.co/bobber/lex-fridman-interviewer-project).