Commit History

Fix Java installation error - Replace openjdk-8-jdk with default-jdk for Debian bookworm compatibility - OpenJDK 8 not available in Debian bookworm repositories - Use default-jdk which provides OpenJDK 17 (compatible with UFPAlign)
1816301

marcosremar2 commited on

Implementar UFPAlign com segmentação de sílabas - Substituir MFA por UFPAlign para alinhamento específico do português brasileiro - Adicionar interface para mostrar início e fim de cada sílaba - Exibir informações detalhadas de fonemas, palavras e sílabas - Usar repositório oficial do UFPAlign da UFPA - Interface otimizada para visualização de segmentação silábica
1e3ff26

marcosremar2 commited on

Fix MFA dictionary model name - Add portuguese_mfa dictionary download to Dockerfile - Update app.py to use correct portuguese_mfa models for both dictionary and acoustic - Ensure all three required models (dictionary, acoustic, g2p) are downloaded
3722cb7

marcosremar2 commited on

Trigger rebuild with correct Portuguese model names
6d5cb77

marcosremar2 commited on

Add UFPAlign Docker implementation - Fix MFA Portuguese model names in original Dockerfile - Add comprehensive UFPAlign Docker setup with Kaldi toolkit - Include Docker Compose configuration for multi-container deployment - Add Makefile with convenient commands for container management - Support for interactive shell, API service, and batch processing - Complete documentation for UFPAlign Brazilian Portuguese alignment
a80c875

marcosremar2 commited on

Fix acoustic model name to portuguese_brazil_mfa - ensures both acoustic and g2p models use correct portuguese_brazil_mfa naming
c014bac

marcosremar2 commited on

Fix G2P model name to portuguese_brazil_mfa
324c2d5

marcosremar2 commited on

Fix Portuguese model names to portuguese_mfa
9a9cdc9

marcosremar2 commited on

Trigger rebuild - update timestamp
10c5980

marcosremar2 commited on

Fix Python 3.12 compatibility issues - Remove version pinning for packages to use latest compatible versions - Fix AttributeError: module pkgutil has no attribute ImpImporter - Use Python 3.12 compatible package versions
bd041e0

marcosremar2 commited on

Use official Montreal Forced Aligner Docker container - Switch to mmcauliffe/montreal-forced-aligner:latest base image - Drastically simplify Dockerfile and avoid all dependency issues - Official container already includes MFA, Kaldi, OpenFST, pynini, etc.
2fa97f9

marcosremar2 commited on

Switch to conda-forge for Montreal Forced Aligner installation - Avoid compilation issues with pre-built packages from conda-forge - Remove unavailable libopenfst-dev packages from Ubuntu 20.04
b3e02b6

marcosremar2 commited on

Add OpenFST development libraries for pynini compilation - Fix fst/util.h missing header error
5d7e97e

marcosremar2 commited on

Add pynini dependency for Montreal Forced Aligner - Fix ModuleNotFoundError: No module named pynini
b6307b9

marcosremar2 commited on

Fix Montreal Forced Aligner pgvector dependency - Add pgvector==0.2.4 to resolve ModuleNotFoundError
4cf8238

marcosremar2 commited on

Fix pandas version compatibility and add comprehensive API test script
317700f

marcosremar2 commited on

Complete migration to Montreal Forced Aligner with Portuguese models - Replace UFPAlign with MFA for better reliability and community support
97280ac

marcosremar2 commited on

Fix f-string backslash syntax error - replace with string concatenation
0bf2604

marcosremar2 commited on

Fix word_boundary.int generation with proper Kaldi format and phone boundary types
8bbc39b

marcosremar2 commited on

Fix newline escaping in f-string - use \\n to create literal \n
1001387

marcosremar2 commited on

Fix shell quoting for f-strings - use single quotes to protect double quotes
9540dde

marcosremar2 commited on

Fix Python f-string syntax in Dockerfile - use double quotes to avoid shell conflicts
1b0c784

marcosremar2 commited on

Trigger fresh build - update README with latest status
0094da2

marcosremar2 commited on

Fix Dockerfile Python script creation - replace heredoc with echo statements
e720090

marcosremar2 commited on

Fix Python syntax error in word_boundary.int creation - Replace broken inline Python with proper script file - Create create_word_boundary.py with proper indentation and syntax - Add additional silence phone types (SPN, spn) to mapping - Script creates proper Kaldi word boundary format
1bd9f59

marcosremar2 commited on

Fix download_model.sh syntax error and create missing word_boundary.int - Remove broken sed command that corrupted download_model.sh - Generate word_boundary.int from phones.txt during build - Create proper phone boundary mappings for Kaldi - Show word_boundary.int content during build for verification
bf9423e

marcosremar2 commited on

Pre-extract data.tar.gz during Docker build to fix word_boundary.int issue - Download and extract data files during build instead of runtime - Modify download_model.sh to skip data download if directory exists - Show data directory structure during build for verification - This should resolve the missing word_boundary.int file error
fa6fa35

marcosremar2 commited on

Add comprehensive debugging for UFPAlign data extraction issue - Log directory structure before and after UFPAlign execution - Check for data/lang/phones/word_boundary.int file existence - Identify where data extraction/copying is failing - Debug both source UFPAlign/data and target egs/UFPAlign/s5/data directories
e9e084d

marcosremar2 commited on

Fix UFPAlign data extraction and environment setup - Add UFPALIGN_DIR environment variable as expected by UFPAlign script - Grant user sudo privileges for file extraction operations - UFPAlign script uses sudo for directory creation and file operations - This should resolve word_boundary.int missing file error
85d4d2b

marcosremar2 commited on

Fix Kaldi egs directory permissions for UFPAlign - Grant user write permissions to /opt/kaldi/egs directory - UFPAlign needs to create working directories in KALDI_ROOT/egs/UFPAlign/s5 - Resolves 'mkdir: Permission denied' error during alignment execution
9519099

marcosremar2 commited on

Fix externally-managed-environment error for Python 3.11+ - Add --break-system-packages flag to all pip install commands - Required for Python 3.11+ which prevents global package installation by default - Safe to use in Docker containers where we want global package installation
86faf8d

marcosremar2 commited on

Fix Java package compatibility for Debian Bookworm - Replace openjdk-8-jdk with default-jdk for Debian Bookworm compatibility - The kaldiasr/kaldi:latest image uses Debian Bookworm where openjdk-8-jdk is not available - default-jdk provides the same functionality and is compatible with UFPAlign
a230ef3

marcosremar2 commited on

Enable UFPAlign native automatic model downloading - Remove manual download endpoint and UI - UFPAlign automatically downloads models via utils/download_model.sh on first use - Cleaner implementation following official UFPAlign behavior - Added wget dependency for downloading FalaBrasil annotator lib - Models download automatically when first alignment is requested
82aaae8

marcosremar2 commited on

Follow official UFPAlign Dockerfile structure with runtime model download - Use official UFPAlign build approach from GitHub - Add runtime model download endpoint /download-models - Update web interface with download models button - Include unzip package for model extraction - Set Portuguese Brazilian locale as per official requirements
25e0c71

marcosremar2 commited on

Implement robust multi-method download with fallback handling - Replace single gdown with comprehensive script trying multiple methods - Add proper error handling and graceful fallback to test structure - Extensive logging and status reporting
272afb2

marcosremar2 commited on

Simplify download process and add build-time listing for debugging
3b005db

marcosremar2 commited on

Add detailed debugging for model detection and file structure
1fcca8c

marcosremar2 commited on

Download both UFPAlign models and data files during build initialization
9dd5d2a

marcosremar2 commited on

Fix Python multiline syntax in Dockerfile with pure bash approach
f5891bf

marcosremar2 commited on

Fix model download with multiple fallback methods
b1f9da8

marcosremar2 commited on

Remove mocks and demo mode, restore strict UFPAlign validation
94e512a

marcosremar2 commited on

Remove restrictive model checks and add demo mode fallback
ddb64c1

marcosremar2 commited on

Add graceful degradation and basic test functionality
5bf7cbf

marcosremar2 commited on

Fix permissions and model download with alternative method
1738fc7

marcosremar2 commited on

Fix Java package: use default-jdk instead of openjdk-8-jdk
e041dcb

marcosremar2 commited on

Fix missing dependencies: add sudo, java, bc for UFPAlign
741fec4

marcosremar2 commited on

Add comprehensive debugging and health checks for UFPAlign
ac13286

marcosremar2 commited on

Fix Dockerfile Python syntax using external script
2c555da

marcosremar2 commited on

Fix model download with graceful degradation
593dae6

marcosremar2 commited on

Fix pip externally-managed-environment error
1d619b8

marcosremar2 commited on