goshawk-stock-predictions

Runtime error

App Files Files Community

Tonic commited on Jun 18, 2025

Commit

8481b42

1 Parent(s): e115d61

attempt more advanced predictions using model ensemble

Browse files

Files changed (3) hide show

README.md +116 -132
app.py +732 -43
requirements.txt +5 -1

README.md CHANGED Viewed

@@ -13,158 +13,142 @@ tags:
   - mcp-server-track
 ---
-# Stock Analysis and Prediction Demo
-A comprehensive stock analysis and prediction tool built with Gradio, featuring multiple prediction strategies and technical analysis indicators. The application is particularly suited for structured financial product creation and analysis.
 ## Features
-- **Multiple Prediction Strategies**:
-  - Chronos ML-based prediction
-  - Technical analysis-based prediction
-- **Technical Indicators**:
-  - RSI (Relative Strength Index)
-  - MACD (Moving Average Convergence Divergence)
-  - Bollinger Bands
-  - Simple Moving Averages (20, 50, and 200-day)
-- **Trading Signals**:
-  - Buy/Sell recommendations based on multiple indicators
-  - Overall trading signal combining all indicators
-  - Confidence intervals for predictions
-- **Interactive Visualizations**:
-  - Price prediction with confidence intervals
-  - Technical indicators overlay
-  - Volume analysis
-  - Historical price trends
-- **Structured Product Analysis**:
-  - Extended prediction horizons (up to 1 year)
-  - Historical analysis up to 10 years
-  - Comprehensive risk metrics
-  - Sector and industry analysis
-  - Liquidity assessment
-## Structured Product Features
-### Extended Time Horizons
-- Prediction window up to 365 days
-- Historical data analysis up to 10 years
-- Long-term trend analysis
-- Extended technical indicators
-### Risk Analysis
-- Annualized volatility
-- Maximum drawdown analysis
-- Current drawdown tracking
-- Sharpe and Sortino ratios
-- Risk-adjusted return metrics
-### Product Metrics
-- Market capitalization
-- Sector and industry classification
-- Dividend yield analysis
-- Volume metrics
-- Liquidity scoring
-### Sector Analysis
-- Market cap ranking (Large/Mid/Small)
-- Sector exposure
-- Industry classification
-- Liquidity assessment
-## Installation
-1. Clone the repository:
-```bash
-git clone <repository-url>
-cd stock-prediction
-```
-2. Create and activate a virtual environment:
-```bash
-python -m venv .venv
-source .venv/bin/activate  # On Windows: .venv\Scripts\activate
-```
-3. Install dependencies:
 ```bash
 pip install -r requirements.txt
 ```
-## Usage
-1. Start the Gradio demo:
 ```bash
 python app.py
 ```
-2. Open your web browser and navigate to the URL shown in the terminal (typically http://localhost:7860)
-3. Enter a stock symbol (e.g., AAPL, GOOGL, MSFT) and select your desired parameters:
-   - Timeframe (1d, 1h, 15m)
-   - Number of days to predict (up to 365 days)
-   - Historical lookback period (up to 10 years)
-   - Prediction strategy (Chronos or Technical)
-4. Click "Analyze Stock" to get:
-   - Price predictions and trading signals
-   - Structured product metrics
-   - Risk analysis
-   - Sector analysis
-## Using for Structured Products
-### Initial Screening
-1. Use extended lookback period (up to 10 years) for long-term performance analysis
-2. Look for stocks with stable volatility and good risk-adjusted returns
-3. Check liquidity scores for trading feasibility
-### Risk Assessment
-1. Review risk metrics to match client risk profile
-2. Analyze maximum drawdowns for worst-case scenarios
-3. Compare risk-adjusted returns using Sharpe and Sortino ratios
-### Product Structuring
-1. Use prediction horizon (up to 1 year) for product maturity design
-2. Consider dividend yields for income-generating products
-3. Use sector analysis for proper diversification
-### Portfolio Construction
-1. Analyze multiple stocks for diversified bundles
-2. Use sector metrics to avoid overexposure
-3. Consider market cap rankings for appropriate sizing
-## Prediction Strategies
-### Chronos Strategy
-Uses Amazon's Chronos model for ML-based price prediction. This strategy:
-- Analyzes historical price patterns
-- Generates probabilistic forecasts
-- Provides confidence intervals
-### Technical Strategy
-Uses traditional technical analysis indicators to generate predictions:
-- RSI for overbought/oversold conditions
-- MACD for trend direction
-- Bollinger Bands for volatility
-- Moving Averages for trend confirmation
-## Trading Signals
-The demo provides trading signals based on multiple technical indicators:
-- RSI: Oversold (<30), Overbought (>70), Neutral
-- MACD: Buy (MACD > Signal), Sell (MACD < Signal)
-- Bollinger Bands: Buy (price < lower band), Sell (price > upper band)
-- SMA: Buy (20-day > 50-day), Sell (20-day < 50-day)
-An overall trading signal is calculated by combining all individual signals.
 ## Contributing
-Contributions are welcome! Please feel free to submit a Pull Request.
 ## License

   - mcp-server-track
 ---
+# Advanced Stock Prediction Analysis
+A comprehensive stock prediction and analysis tool that combines Chronos forecasting with advanced features including regime detection, ensemble methods, and stress testing.
 ## Features
+### Core Prediction Engine
+- **Chronos Forecasting**: State-of-the-art time series forecasting using Amazon's Chronos model
+- **Technical Analysis**: Traditional technical indicators (RSI, MACD, Bollinger Bands, SMA)
+- **Multi-timeframe Support**: Daily, hourly, and 15-minute analysis
+- **Real-time Data**: Live market data via yfinance
+### Advanced Features
+#### 1. Market Regime Detection
+- **Hidden Markov Models (HMM)**: Automatic detection of market regimes (bull, bear, sideways)
+- **Volatility-based Fallback**: Simplified regime detection when HMM is unavailable
+- **Regime-adjusted Signals**: Trading signals that adapt to current market conditions
+#### 2. Ensemble Methods
+- **Multi-model Combination**: Combines Chronos, technical, and statistical predictions
+- **Adaptive Weighting**: User-configurable weights for different models
+- **Uncertainty Quantification**: Advanced uncertainty estimation with skewness adjustment
+#### 3. Advanced Risk Metrics
+- **Tail Risk Analysis**: VaR and CVaR calculations
+- **Market Correlation**: Beta, alpha, and correlation with market indices
+- **Risk-adjusted Returns**: Sharpe, Sortino, and Calmar ratios
+- **Drawdown Analysis**: Maximum drawdown and recovery metrics
+#### 4. Stress Testing
+- **Scenario Analysis**: Market crash, high volatility, bull market scenarios
+- **Interest Rate Shocks**: Impact of rate changes on predictions
+- **Custom Scenarios**: User-defined stress test parameters
+#### 5. Enhanced Uncertainty Quantification
+- **Skewness-aware**: Accounts for non-normal return distributions
+- **Adaptive Smoothing**: Reduces prediction drift based on uncertainty
+- **Confidence Intervals**: Dynamic confidence levels based on market conditions
+## Installation
+1. Install dependencies:
 ```bash
 pip install -r requirements.txt
 ```
+2. Run the application:
 ```bash
 python app.py
 ```
+## Usage
+### Basic Analysis
+1. Enter a stock symbol (e.g., AAPL, MSFT, GOOGL)
+2. Select timeframe (Daily, Hourly, or 15-minute)
+3. Choose prediction strategy (Chronos or Technical)
+4. Set prediction days and lookback period
+5. Click "Analyze Stock"
+### Advanced Settings
+- **Ensemble Methods**: Enable/disable multi-model combination
+- **Regime Detection**: Enable/disable market regime analysis
+- **Stress Testing**: Enable/disable scenario analysis
+- **Risk-free Rate**: Set annual risk-free rate for calculations
+- **Market Index**: Choose correlation index (S&P 500, Dow Jones, NASDAQ, Russell 2000)
+- **Ensemble Weights**: Adjust weights for Chronos, Technical, and Statistical models
+### Output Sections
+#### Daily Analysis
+- **Structured Product Metrics**: Market cap, sector, financial ratios
+- **Advanced Risk Analysis**: Comprehensive risk metrics with market correlation
+- **Market Regime Analysis**: Current regime and transition probabilities
+- **Trading Signals**: Advanced signals with confidence levels
+- **Stress Test Results**: Scenario analysis outcomes
+- **Ensemble Analysis**: Multi-model combination details
+#### Hourly/15-minute Analysis
+- **Intraday Metrics**: High-frequency volatility and momentum indicators
+- **Volume Analysis**: Volume-price trends and momentum
+- **Real-time Indicators**: Pre/post market data analysis
+## Technical Details
+### Regime Detection
+- Uses Hidden Markov Models with 3 states (low volatility, normal, high volatility)
+- Falls back to volatility-based detection if HMM unavailable
+- Regime probabilities influence trading signal thresholds
+### Ensemble Methods
+- **Chronos**: Primary deep learning model (60% default weight)
+- **Technical**: Traditional indicators with mean reversion (20% default weight)
+- **Statistical**: ARIMA-like models with momentum (20% default weight)
+### Stress Testing Scenarios
+- **Market Crash**: 3x volatility, -15% return shock
+- **High Volatility**: 2x volatility, -5% return shock
+- **Low Volatility**: 0.5x volatility, +2% return shock
+- **Bull Market**: 1.2x volatility, +10% return shock
+- **Interest Rate Shock**: 1.5x volatility, -8% return shock
+### Uncertainty Quantification
+- Skewness-adjusted confidence intervals
+- Adaptive smoothing based on prediction uncertainty
+- Time-varying volatility modeling
+## Dependencies
+### Core
+- `torch>=2.1.2`: PyTorch for deep learning
+- `chronos-forecasting>=1.0.0`: Amazon's Chronos model
+- `yfinance>=0.2.0`: Yahoo Finance data
+- `gradio>=4.0.0`: Web interface
+### Advanced Features
+- `hmmlearn>=0.3.0`: Hidden Markov Models for regime detection
+- `scipy>=1.10.0`: Scientific computing and statistics
+- `scikit-learn>=1.0.0`: Machine learning utilities
+- `plotly>=5.0.0`: Interactive visualizations
+## Limitations
+1. **Market Hours**: Intraday data (hourly/15-minute) only available during market hours
+2. **Data Quality**: Dependent on yfinance data availability and quality
+3. **Model Complexity**: Advanced features may increase computation time
+4. **GPU Requirements**: Chronos model requires CUDA-capable GPU for optimal performance
+## Disclaimer
+This tool is for educational and research purposes only. Stock predictions are inherently uncertain and should not be used as the sole basis for investment decisions. Always conduct thorough research and consider consulting with financial professionals before making investment decisions.
 ## Contributing
+Contributions are welcome! Please feel free to submit pull requests or open issues for bugs and feature requests.
 ## License

app.py CHANGED Viewed

@@ -16,12 +16,37 @@ import gc
 import pytz
 import time
 import random
 # Initialize global variables
 pipeline = None
 scaler = MinMaxScaler(feature_range=(-1, 1))
 scaler.fit_transform([[-1, 1]])
 def retry_yfinance_request(func, max_retries=3, initial_delay=1):
     """
     Retry mechanism for yfinance requests with exponential backoff.
@@ -342,15 +367,24 @@ def calculate_bollinger_bands(prices: pd.Series, period: int = 20, std_dev: int
     return upper_band, middle_band, lower_band
 @spaces.GPU(duration=180)
-def make_prediction(symbol: str, timeframe: str = "1d", prediction_days: int = 5, strategy: str = "chronos") -> Tuple[Dict, go.Figure]:
     """
-    Make prediction using selected strategy with ZeroGPU.
     Args:
         symbol (str): Stock symbol
         timeframe (str): Data timeframe ('1d', '1h', '15m')
         prediction_days (int): Number of days to predict
         strategy (str): Prediction strategy to use
     Returns:
         Tuple[Dict, go.Figure]: Trading signals and visualization plot
@@ -976,11 +1010,590 @@ def calculate_trading_signals(df: pd.DataFrame) -> Dict:
     return signals
 def create_interface():
     """Create the Gradio interface with separate tabs for different timeframes"""
-    with gr.Blocks(title="Structured Product Analysis") as demo:
-        gr.Markdown("# Structured Product Analysis")
-        gr.Markdown("Analyze stocks for inclusion in structured financial products with extended time horizons.")
         # Add market status message
         market_status = "Market is currently closed" if not is_market_open() else "Market is currently open"
@@ -990,6 +1603,50 @@ def create_interface():
         Next trading day: {next_trading_day.strftime('%Y-%m-%d')}
         """)
         with gr.Tabs() as tabs:
             # Daily Analysis Tab
             with gr.TabItem("Daily Analysis"):
@@ -1022,18 +1679,24 @@ def create_interface():
                 with gr.Row():
                     with gr.Column():
                         gr.Markdown("### Structured Product Metrics")
                         daily_metrics = gr.JSON(label="Product Metrics")
-                        gr.Markdown("### Risk Analysis")
                         daily_risk_metrics = gr.JSON(label="Risk Metrics")
-                        gr.Markdown("### Sector Analysis")
-                        daily_sector_metrics = gr.JSON(label="Sector Metrics")
                         gr.Markdown("### Trading Signals")
                         daily_signals = gr.JSON(label="Trading Signals")
             # Hourly Analysis Tab
             with gr.TabItem("Hourly Analysis"):
@@ -1138,9 +1801,34 @@ def create_interface():
                         gr.Markdown("### Sector & Financial Analysis")
                         min15_sector_metrics = gr.JSON(label="Sector Metrics")
-        def analyze_stock(symbol, timeframe, prediction_days, lookback_days, strategy):
             try:
-                signals, fig = make_prediction(symbol, timeframe, prediction_days, strategy)
                 # Get historical data for additional metrics
                 df = get_historical_data(symbol, timeframe, lookback_days)
@@ -1161,19 +1849,8 @@ def create_interface():
                     "Price_to_Sales": df['Price_to_Sales'].iloc[-1]
                 }
-                # Calculate risk metrics
-                risk_metrics = {
-                    "Annualized_Volatility": df['Annualized_Vol'].iloc[-1],
-                    "Max_Drawdown": df['Max_Drawdown'].iloc[-1],
-                    "Current_Drawdown": df['Drawdown'].iloc[-1],
-                    "Sharpe_Ratio": (df['Returns'].mean() * 252) / (df['Returns'].std() * np.sqrt(252)),
-                    "Sortino_Ratio": (df['Returns'].mean() * 252) / (df['Returns'][df['Returns'] < 0].std() * np.sqrt(252)),
-                    "Return_on_Equity": df['Return_on_Equity'].iloc[-1],
-                    "Return_on_Assets": df['Return_on_Assets'].iloc[-1],
-                    "Debt_to_Equity": df['Debt_to_Equity'].iloc[-1],
-                    "Current_Ratio": df['Current_Ratio'].iloc[-1],
-                    "Quick_Ratio": df['Quick_Ratio'].iloc[-1]
-                }
                 # Calculate sector metrics
                 sector_metrics = {
@@ -1197,7 +1874,15 @@ def create_interface():
                     }
                     product_metrics.update(intraday_metrics)
-                return signals, fig, product_metrics, risk_metrics, sector_metrics
             except Exception as e:
                 error_message = str(e)
                 if "Market is currently closed" in error_message:
@@ -1209,40 +1894,44 @@ def create_interface():
                 raise gr.Error(error_message)
         # Daily analysis button click
-        def daily_analysis(s: str, pd: int, ld: int, st: str) -> Tuple[Dict, go.Figure, Dict, Dict, Dict]:
             """
-            Process daily timeframe stock analysis and generate predictions.
             Args:
                 s (str): Stock symbol (e.g., "AAPL", "MSFT", "GOOGL")
                 pd (int): Number of days to predict (1-365)
                 ld (int): Historical lookback period in days (1-3650)
                 st (str): Prediction strategy to use ("chronos" or "technical")
             Returns:
-                Tuple[Dict, go.Figure, Dict, Dict, Dict]: A tuple containing:
-                    - Trading signals dictionary
-                    - Plotly figure with price and technical analysis
-                    - Product metrics dictionary
-                    - Risk metrics dictionary
-                    - Sector metrics dictionary
-            Example:
-                >>> daily_analysis("AAPL", 30, 365, "chronos")
-                ({'RSI': 'Neutral', 'MACD': 'Buy', ...}, <Figure>, {...}, {...}, {...})
             """
-            return analyze_stock(s, "1d", pd, ld, st)
         daily_predict_btn.click(
             fn=daily_analysis,
-            inputs=[daily_symbol, daily_prediction_days, daily_lookback_days, daily_strategy],
-            outputs=[daily_signals, daily_plot, daily_metrics, daily_risk_metrics, daily_sector_metrics]
         )
         # Hourly analysis button click
-        def hourly_analysis(s: str, pd: int, ld: int, st: str) -> Tuple[Dict, go.Figure, Dict, Dict, Dict]:
             """
-            Process hourly timeframe stock analysis and generate predictions.
             Args:
                 s (str): Stock symbol (e.g., "AAPL", "MSFT", "GOOGL")

 import pytz
 import time
 import random
+from scipy import stats
+from scipy.optimize import minimize
+import warnings
+warnings.filterwarnings('ignore')
+# Additional imports for advanced features
+try:
+    from hmmlearn import hmm
+    HMM_AVAILABLE = True
+except ImportError:
+    HMM_AVAILABLE = False
+    print("Warning: hmmlearn not available. Regime detection will use simplified methods.")
+try:
+    from sklearn.ensemble import RandomForestRegressor
+    from sklearn.linear_model import LinearRegression
+    ENSEMBLE_AVAILABLE = True
+except ImportError:
+    ENSEMBLE_AVAILABLE = False
+    print("Warning: scikit-learn not available. Ensemble methods will be simplified.")
 # Initialize global variables
 pipeline = None
 scaler = MinMaxScaler(feature_range=(-1, 1))
 scaler.fit_transform([[-1, 1]])
+# Global market data cache
+market_data_cache = {}
+cache_expiry = {}
+CACHE_DURATION = 3600  # 1 hour cache
 def retry_yfinance_request(func, max_retries=3, initial_delay=1):
     """
     Retry mechanism for yfinance requests with exponential backoff.
     return upper_band, middle_band, lower_band
 @spaces.GPU(duration=180)
+def make_prediction(symbol: str, timeframe: str = "1d", prediction_days: int = 5, strategy: str = "chronos",
+                   use_ensemble: bool = True, use_regime_detection: bool = True, use_stress_testing: bool = True,
+                   risk_free_rate: float = 0.02, ensemble_weights: Dict = None,
+                   market_index: str = "^GSPC") -> Tuple[Dict, go.Figure]:
     """
+    Make prediction using selected strategy with advanced features.
     Args:
         symbol (str): Stock symbol
         timeframe (str): Data timeframe ('1d', '1h', '15m')
         prediction_days (int): Number of days to predict
         strategy (str): Prediction strategy to use
+        use_ensemble (bool): Whether to use ensemble methods
+        use_regime_detection (bool): Whether to use regime detection
+        use_stress_testing (bool): Whether to perform stress testing
+        risk_free_rate (float): Risk-free rate for calculations
+        ensemble_weights (Dict): Weights for ensemble models
+        market_index (str): Market index for correlation analysis
     Returns:
         Tuple[Dict, go.Figure]: Trading signals and visualization plot
     return signals
+def get_market_data(symbol: str = "^GSPC", lookback_days: int = 365) -> pd.DataFrame:
+    """
+    Fetch market data (S&P 500 by default) for correlation analysis and regime detection.
+    Args:
+        symbol (str): Market index symbol (default: ^GSPC for S&P 500)
+        lookback_days (int): Number of days to look back
+    Returns:
+        pd.DataFrame: Market data with returns
+    """
+    cache_key = f"{symbol}_{lookback_days}"
+    current_time = time.time()
+    # Check cache
+    if cache_key in market_data_cache and current_time < cache_expiry.get(cache_key, 0):
+        return market_data_cache[cache_key]
+    try:
+        ticker = yf.Ticker(symbol)
+        end_date = datetime.now()
+        start_date = end_date - timedelta(days=lookback_days)
+        def fetch_market_history():
+            return ticker.history(
+                start=start_date,
+                end=end_date,
+                interval="1d",
+                prepost=False,
+                actions=False,
+                auto_adjust=True
+            )
+        df = retry_yfinance_request(fetch_market_history)
+        if not df.empty:
+            df['Returns'] = df['Close'].pct_change()
+            df['Volatility'] = df['Returns'].rolling(window=20).std()
+            # Cache the data
+            market_data_cache[cache_key] = df
+            cache_expiry[cache_key] = current_time + CACHE_DURATION
+        return df
+    except Exception as e:
+        print(f"Warning: Could not fetch market data for {symbol}: {str(e)}")
+        return pd.DataFrame()
+def detect_market_regime(returns: pd.Series, n_regimes: int = 3) -> Dict:
+    """
+    Detect market regime using Hidden Markov Model or simplified methods.
+    Args:
+        returns (pd.Series): Price returns
+        n_regimes (int): Number of regimes to detect
+    Returns:
+        Dict: Regime information including probabilities and characteristics
+    """
+    if len(returns) < 50:
+        return {"regime": 1, "probabilities": [1.0], "volatility": returns.std()}
+    try:
+        if HMM_AVAILABLE:
+            # Use HMM for regime detection
+            model = hmm.GaussianHMM(n_components=n_regimes, random_state=42, covariance_type="full")
+            model.fit(returns.dropna().reshape(-1, 1))
+            # Get regime probabilities for the last observation
+            regime_probs = model.predict_proba(returns.dropna().reshape(-1, 1))
+            current_regime = model.predict(returns.dropna().reshape(-1, 1))[-1]
+            # Calculate regime characteristics
+            regime_means = model.means_.flatten()
+            regime_vols = np.sqrt(model.covars_.diagonal(axis1=1, axis2=2))
+            return {
+                "regime": int(current_regime),
+                "probabilities": regime_probs[-1].tolist(),
+                "means": regime_means.tolist(),
+                "volatilities": regime_vols.tolist(),
+                "method": "HMM"
+            }
+        else:
+            # Simplified regime detection using volatility clustering
+            volatility = returns.rolling(window=20).std().dropna()
+            vol_percentile = volatility.iloc[-1] / volatility.quantile(0.8)
+            if vol_percentile > 1.2:
+                regime = 2  # High volatility regime
+            elif vol_percentile < 0.8:
+                regime = 0  # Low volatility regime
+            else:
+                regime = 1  # Normal regime
+            return {
+                "regime": regime,
+                "probabilities": [0.1, 0.8, 0.1] if regime == 1 else [0.8, 0.1, 0.1] if regime == 0 else [0.1, 0.1, 0.8],
+                "volatility": volatility.iloc[-1],
+                "method": "Volatility-based"
+            }
+    except Exception as e:
+        print(f"Warning: Regime detection failed: {str(e)}")
+        return {"regime": 1, "probabilities": [1.0], "volatility": returns.std(), "method": "Fallback"}
+def calculate_advanced_risk_metrics(df: pd.DataFrame, market_returns: pd.Series = None,
+                                  risk_free_rate: float = 0.02) -> Dict:
+    """
+    Calculate advanced risk metrics including tail risk and market correlation.
+    Args:
+        df (pd.DataFrame): Stock data
+        market_returns (pd.Series): Market returns for correlation analysis
+        risk_free_rate (float): Annual risk-free rate
+    Returns:
+        Dict: Advanced risk metrics
+    """
+    returns = df['Returns'].dropna()
+    if len(returns) < 30:
+        return {"error": "Insufficient data for risk calculation"}
+    # Basic metrics
+    annual_return = returns.mean() * 252
+    annual_vol = returns.std() * np.sqrt(252)
+    # Market-adjusted metrics
+    if market_returns is not None and len(market_returns) > 0:
+        # Align dates
+        aligned_returns = returns.reindex(market_returns.index).dropna()
+        aligned_market = market_returns.reindex(aligned_returns.index).dropna()
+        if len(aligned_returns) > 10:
+            beta = np.cov(aligned_returns, aligned_market)[0,1] / np.var(aligned_market)
+            alpha = aligned_returns.mean() - beta * aligned_market.mean()
+            correlation = np.corrcoef(aligned_returns, aligned_market)[0,1]
+        else:
+            beta = 1.0
+            alpha = 0.0
+            correlation = 0.0
+    else:
+        beta = 1.0
+        alpha = 0.0
+        correlation = 0.0
+    # Tail risk metrics
+    var_95 = np.percentile(returns, 5)
+    var_99 = np.percentile(returns, 1)
+    cvar_95 = returns[returns <= var_95].mean()
+    cvar_99 = returns[returns <= var_99].mean()
+    # Maximum drawdown
+    cumulative_returns = (1 + returns).cumprod()
+    rolling_max = cumulative_returns.expanding().max()
+    drawdown = (cumulative_returns - rolling_max) / rolling_max
+    max_drawdown = drawdown.min()
+    # Skewness and kurtosis
+    skewness = stats.skew(returns)
+    kurtosis = stats.kurtosis(returns)
+    # Risk-adjusted returns
+    sharpe_ratio = (annual_return - risk_free_rate) / annual_vol if annual_vol > 0 else 0
+    sortino_ratio = (annual_return - risk_free_rate) / (returns[returns < 0].std() * np.sqrt(252)) if returns[returns < 0].std() > 0 else 0
+    calmar_ratio = annual_return / abs(max_drawdown) if max_drawdown != 0 else 0
+    # Information ratio (if market data available)
+    if market_returns is not None and len(market_returns) > 0:
+        excess_returns = aligned_returns - aligned_market
+        information_ratio = excess_returns.mean() / excess_returns.std() if excess_returns.std() > 0 else 0
+    else:
+        information_ratio = 0
+    return {
+        "Annual_Return": annual_return,
+        "Annual_Volatility": annual_vol,
+        "Sharpe_Ratio": sharpe_ratio,
+        "Sortino_Ratio": sortino_ratio,
+        "Calmar_Ratio": calmar_ratio,
+        "Information_Ratio": information_ratio,
+        "Beta": beta,
+        "Alpha": alpha * 252,
+        "Correlation_with_Market": correlation,
+        "VaR_95": var_95,
+        "VaR_99": var_99,
+        "CVaR_95": cvar_95,
+        "CVaR_99": cvar_99,
+        "Max_Drawdown": max_drawdown,
+        "Skewness": skewness,
+        "Kurtosis": kurtosis,
+        "Risk_Free_Rate": risk_free_rate
+    }
+def create_ensemble_prediction(df: pd.DataFrame, prediction_days: int,
+                             ensemble_weights: Dict = None) -> Tuple[np.ndarray, np.ndarray]:
+    """
+    Create ensemble prediction combining multiple models.
+    Args:
+        df (pd.DataFrame): Historical data
+        prediction_days (int): Number of days to predict
+        ensemble_weights (Dict): Weights for different models
+    Returns:
+        Tuple[np.ndarray, np.ndarray]: Mean and uncertainty predictions
+    """
+    if ensemble_weights is None:
+        ensemble_weights = {"chronos": 0.6, "technical": 0.2, "statistical": 0.2}
+    predictions = {}
+    uncertainties = {}
+    # Chronos prediction (placeholder - will be filled by main prediction function)
+    predictions["chronos"] = np.array([])
+    uncertainties["chronos"] = np.array([])
+    # Technical prediction
+    if ensemble_weights.get("technical", 0) > 0:
+        try:
+            last_price = df['Close'].iloc[-1]
+            rsi = df['RSI'].iloc[-1]
+            macd = df['MACD'].iloc[-1]
+            macd_signal = df['MACD_Signal'].iloc[-1]
+            volatility = df['Volatility'].iloc[-1]
+            # Enhanced technical prediction
+            trend = 1 if (rsi > 50 and macd > macd_signal) else -1
+            mean_reversion = (df['SMA_200'].iloc[-1] - last_price) / last_price if 'SMA_200' in df.columns else 0
+            tech_pred = []
+            for i in range(1, prediction_days + 1):
+                # Combine trend and mean reversion
+                prediction = last_price * (1 + trend * volatility * 0.3 + mean_reversion * 0.1 * i)
+                tech_pred.append(prediction)
+            predictions["technical"] = np.array(tech_pred)
+            uncertainties["technical"] = np.array([volatility * last_price * i for i in range(1, prediction_days + 1)])
+        except Exception as e:
+            print(f"Technical prediction error: {str(e)}")
+            predictions["technical"] = np.array([])
+            uncertainties["technical"] = np.array([])
+    # Statistical prediction (ARIMA-like)
+    if ensemble_weights.get("statistical", 0) > 0:
+        try:
+            returns = df['Returns'].dropna()
+            if len(returns) > 10:
+                # Simple moving average with momentum
+                ma_short = df['Close'].rolling(window=10).mean().iloc[-1]
+                ma_long = df['Close'].rolling(window=30).mean().iloc[-1]
+                momentum = (ma_short - ma_long) / ma_long
+                last_price = df['Close'].iloc[-1]
+                stat_pred = []
+                for i in range(1, prediction_days + 1):
+                    # Mean reversion with momentum
+                    prediction = last_price * (1 + momentum * 0.5 - 0.001 * i)  # Decay factor
+                    stat_pred.append(prediction)
+                predictions["statistical"] = np.array(stat_pred)
+                uncertainties["statistical"] = np.array([returns.std() * last_price * np.sqrt(i) for i in range(1, prediction_days + 1)])
+            else:
+                predictions["statistical"] = np.array([])
+                uncertainties["statistical"] = np.array([])
+        except Exception as e:
+            print(f"Statistical prediction error: {str(e)}")
+            predictions["statistical"] = np.array([])
+            uncertainties["statistical"] = np.array([])
+    # Combine predictions
+    valid_predictions = {k: v for k, v in predictions.items() if len(v) > 0}
+    valid_uncertainties = {k: v for k, v in uncertainties.items() if len(v) > 0}
+    if not valid_predictions:
+        return np.array([]), np.array([])
+    # Weighted ensemble
+    total_weight = sum(ensemble_weights.get(k, 0) for k in valid_predictions.keys())
+    if total_weight == 0:
+        return np.array([]), np.array([])
+    # Normalize weights
+    normalized_weights = {k: ensemble_weights.get(k, 0) / total_weight for k in valid_predictions.keys()}
+    # Calculate weighted mean and uncertainty
+    max_length = max(len(v) for v in valid_predictions.values())
+    ensemble_mean = np.zeros(max_length)
+    ensemble_uncertainty = np.zeros(max_length)
+    for model, pred in valid_predictions.items():
+        weight = normalized_weights[model]
+        if len(pred) < max_length:
+            # Extend prediction using last value
+            extended_pred = np.concatenate([pred, np.full(max_length - len(pred), pred[-1])])
+            extended_unc = np.concatenate([valid_uncertainties[model], np.full(max_length - len(pred), valid_uncertainties[model][-1])])
+        else:
+            extended_pred = pred[:max_length]
+            extended_unc = valid_uncertainties[model][:max_length]
+        ensemble_mean += weight * extended_pred
+        ensemble_uncertainty += weight * extended_unc
+    return ensemble_mean, ensemble_uncertainty
+def stress_test_scenarios(df: pd.DataFrame, prediction: np.ndarray,
+                         scenarios: Dict = None) -> Dict:
+    """
+    Perform stress testing under various market scenarios.
+    Args:
+        df (pd.DataFrame): Historical data
+        prediction (np.ndarray): Base prediction
+        scenarios (Dict): Stress test scenarios
+    Returns:
+        Dict: Stress test results
+    """
+    if scenarios is None:
+        scenarios = {
+            "market_crash": {"volatility_multiplier": 3.0, "return_shock": -0.15},
+            "high_volatility": {"volatility_multiplier": 2.0, "return_shock": -0.05},
+            "low_volatility": {"volatility_multiplier": 0.5, "return_shock": 0.02},
+            "bull_market": {"volatility_multiplier": 1.2, "return_shock": 0.10},
+            "interest_rate_shock": {"volatility_multiplier": 1.5, "return_shock": -0.08}
+        }
+    base_volatility = df['Volatility'].iloc[-1]
+    base_return = df['Returns'].mean()
+    last_price = df['Close'].iloc[-1]
+    stress_results = {}
+    for scenario_name, params in scenarios.items():
+        try:
+            # Calculate stressed parameters
+            stressed_vol = base_volatility * params["volatility_multiplier"]
+            stressed_return = base_return + params["return_shock"]
+            # Generate stressed prediction
+            stressed_pred = []
+            for i, pred in enumerate(prediction):
+                # Apply stress factors
+                stress_factor = 1 + stressed_return * (i + 1) / 252
+                volatility_impact = np.random.normal(0, stressed_vol * np.sqrt((i + 1) / 252))
+                stressed_price = pred * stress_factor * (1 + volatility_impact)
+                stressed_pred.append(stressed_price)
+            # Calculate stress metrics
+            stress_results[scenario_name] = {
+                "prediction": np.array(stressed_pred),
+                "max_loss": min(stressed_pred) / last_price - 1,
+                "volatility": stressed_vol,
+                "expected_return": stressed_return,
+                "var_95": np.percentile([p / last_price - 1 for p in stressed_pred], 5)
+            }
+        except Exception as e:
+            print(f"Stress test error for {scenario_name}: {str(e)}")
+            stress_results[scenario_name] = {"error": str(e)}
+    return stress_results
+def calculate_skewed_uncertainty(quantiles: np.ndarray, confidence_level: float = 0.9) -> np.ndarray:
+    """
+    Calculate uncertainty accounting for skewness in return distributions.
+    Args:
+        quantiles (np.ndarray): Quantile predictions from Chronos
+        confidence_level (float): Confidence level for uncertainty calculation
+    Returns:
+        np.ndarray: Uncertainty estimates
+    """
+    try:
+        lower = quantiles[0, :, 0]
+        median = quantiles[0, :, 1]
+        upper = quantiles[0, :, 2]
+        # Calculate skewness for each prediction point
+        uncertainties = []
+        for i in range(len(lower)):
+            # Calculate skewness
+            if upper[i] != median[i] and median[i] != lower[i]:
+                skewness = (median[i] - lower[i]) / (upper[i] - median[i])
+            else:
+                skewness = 1.0
+            # Adjust z-score based on skewness
+            if skewness > 1.2:  # Right-skewed
+                z_score = stats.norm.ppf(confidence_level) * (1 + 0.1 * skewness)
+            elif skewness < 0.8:  # Left-skewed
+                z_score = stats.norm.ppf(confidence_level) * (1 - 0.1 * abs(skewness))
+            else:
+                z_score = stats.norm.ppf(confidence_level)
+            # Calculate uncertainty
+            uncertainty = (upper[i] - lower[i]) / (2 * z_score)
+            uncertainties.append(uncertainty)
+        return np.array(uncertainties)
+    except Exception as e:
+        print(f"Skewed uncertainty calculation error: {str(e)}")
+        # Fallback to simple calculation
+        return (quantiles[0, :, 2] - quantiles[0, :, 0]) / (2 * 1.645)
+def adaptive_smoothing(new_pred: np.ndarray, historical_pred: np.ndarray,
+                      prediction_uncertainty: np.ndarray) -> np.ndarray:
+    """
+    Apply adaptive smoothing based on prediction uncertainty.
+    Args:
+        new_pred (np.ndarray): New predictions
+        historical_pred (np.ndarray): Historical predictions
+        prediction_uncertainty (np.ndarray): Prediction uncertainty
+    Returns:
+        np.ndarray: Smoothed predictions
+    """
+    try:
+        if len(historical_pred) == 0:
+            return new_pred
+        # Calculate adaptive alpha based on uncertainty
+        uncertainty_ratio = prediction_uncertainty / np.mean(np.abs(historical_pred))
+        if uncertainty_ratio > 0.1:  # High uncertainty
+            alpha = 0.1  # More smoothing
+        elif uncertainty_ratio < 0.05:  # Low uncertainty
+            alpha = 0.5  # Less smoothing
+        else:
+            alpha = 0.3  # Default
+        # Apply weighted smoothing
+        smoothed = alpha * new_pred + (1 - alpha) * historical_pred[-len(new_pred):]
+        return smoothed
+    except Exception as e:
+        print(f"Adaptive smoothing error: {str(e)}")
+        return new_pred
+def advanced_trading_signals(df: pd.DataFrame, regime_info: Dict = None) -> Dict:
+    """
+    Generate advanced trading signals with confidence levels and regime awareness.
+    Args:
+        df (pd.DataFrame): Stock data
+        regime_info (Dict): Market regime information
+    Returns:
+        Dict: Advanced trading signals
+    """
+    try:
+        # Calculate signal strength and confidence
+        rsi = df['RSI'].iloc[-1]
+        macd = df['MACD'].iloc[-1]
+        macd_signal = df['MACD_Signal'].iloc[-1]
+        rsi_strength = abs(rsi - 50) / 50  # 0-1 scale
+        macd_strength = abs(macd - macd_signal) / df['Close'].iloc[-1]
+        # Regime-adjusted thresholds
+        if regime_info and "volatilities" in regime_info:
+            volatility_regime = df['Volatility'].iloc[-1] / np.mean(regime_info["volatilities"])
+        else:
+            volatility_regime = 1.0
+        # Adjust RSI thresholds based on volatility
+        rsi_oversold = 30 + (volatility_regime - 1) * 10
+        rsi_overbought = 70 - (volatility_regime - 1) * 10
+        # Calculate signals with confidence
+        signals = {}
+        # RSI signal
+        if rsi < rsi_oversold:
+            rsi_signal = "Oversold"
+            rsi_confidence = min(0.9, 0.5 + rsi_strength * 0.4)
+        elif rsi > rsi_overbought:
+            rsi_signal = "Overbought"
+            rsi_confidence = min(0.9, 0.5 + rsi_strength * 0.4)
+        else:
+            rsi_signal = "Neutral"
+            rsi_confidence = 0.3
+        signals["RSI"] = {
+            "signal": rsi_signal,
+            "strength": rsi_strength,
+            "confidence": rsi_confidence,
+            "value": rsi
+        }
+        # MACD signal
+        if macd > macd_signal:
+            macd_signal = "Buy"
+            macd_confidence = min(0.8, 0.4 + macd_strength * 40)
+        else:
+            macd_signal = "Sell"
+            macd_confidence = min(0.8, 0.4 + macd_strength * 40)
+        signals["MACD"] = {
+            "signal": macd_signal,
+            "strength": macd_strength,
+            "confidence": macd_confidence,
+            "value": macd
+        }
+        # Bollinger Bands signal
+        if 'BB_Upper' in df.columns and 'BB_Lower' in df.columns:
+            current_price = df['Close'].iloc[-1]
+            bb_upper = df['BB_Upper'].iloc[-1]
+            bb_lower = df['BB_Lower'].iloc[-1]
+            if current_price < bb_lower:
+                bb_signal = "Buy"
+                bb_confidence = 0.7
+            elif current_price > bb_upper:
+                bb_signal = "Sell"
+                bb_confidence = 0.7
+            else:
+                bb_signal = "Hold"
+                bb_confidence = 0.5
+            signals["Bollinger"] = {
+                "signal": bb_signal,
+                "confidence": bb_confidence,
+                "position": (current_price - bb_lower) / (bb_upper - bb_lower) if bb_upper != bb_lower else 0.5
+            }
+        # SMA signal
+        if 'SMA_20' in df.columns and 'SMA_50' in df.columns:
+            sma_20 = df['SMA_20'].iloc[-1]
+            sma_50 = df['SMA_50'].iloc[-1]
+            if sma_20 > sma_50:
+                sma_signal = "Buy"
+                sma_confidence = 0.6
+            else:
+                sma_signal = "Sell"
+                sma_confidence = 0.6
+            signals["SMA"] = {
+                "signal": sma_signal,
+                "confidence": sma_confidence,
+                "ratio": sma_20 / sma_50 if sma_50 != 0 else 1.0
+            }
+        # Calculate weighted overall signal
+        buy_signals = []
+        sell_signals = []
+        for signal_name, signal_data in signals.items():
+            if signal_data["signal"] == "Buy":
+                buy_signals.append(signal_data["strength"] * signal_data["confidence"])
+            elif signal_data["signal"] == "Sell":
+                sell_signals.append(signal_data["strength"] * signal_data["confidence"])
+        weighted_buy = sum(buy_signals) if buy_signals else 0
+        weighted_sell = sum(sell_signals) if sell_signals else 0
+        if weighted_buy > weighted_sell:
+            overall_signal = "Buy"
+            overall_confidence = weighted_buy / (weighted_buy + weighted_sell) if (weighted_buy + weighted_sell) > 0 else 0
+        elif weighted_sell > weighted_buy:
+            overall_signal = "Sell"
+            overall_confidence = weighted_sell / (weighted_buy + weighted_sell) if (weighted_buy + weighted_sell) > 0 else 0
+        else:
+            overall_signal = "Hold"
+            overall_confidence = 0.5
+        return {
+            "signals": signals,
+            "overall_signal": overall_signal,
+            "confidence": overall_confidence,
+            "regime_adjusted": regime_info is not None
+        }
+    except Exception as e:
+        print(f"Advanced trading signals error: {str(e)}")
+        return {"error": str(e)}
 def create_interface():
     """Create the Gradio interface with separate tabs for different timeframes"""
+    with gr.Blocks(title="Advanced Stock Prediction Analysis") as demo:
+        gr.Markdown("# Advanced Stock Prediction Analysis")
+        gr.Markdown("Analyze stocks with advanced features including regime detection, ensemble methods, and stress testing.")
         # Add market status message
         market_status = "Market is currently closed" if not is_market_open() else "Market is currently open"
         Next trading day: {next_trading_day.strftime('%Y-%m-%d')}
         """)
+        # Advanced Settings Accordion
+        with gr.Accordion("Advanced Settings", open=False):
+            with gr.Row():
+                with gr.Column():
+                    use_ensemble = gr.Checkbox(label="Use Ensemble Methods", value=True)
+                    use_regime_detection = gr.Checkbox(label="Use Regime Detection", value=True)
+                    use_stress_testing = gr.Checkbox(label="Use Stress Testing", value=True)
+                    risk_free_rate = gr.Slider(
+                        minimum=0.0,
+                        maximum=0.1,
+                        value=0.02,
+                        step=0.001,
+                        label="Risk-Free Rate (Annual)"
+                    )
+                    market_index = gr.Dropdown(
+                        choices=["^GSPC", "^DJI", "^IXIC", "^RUT"],
+                        label="Market Index for Correlation",
+                        value="^GSPC"
+                    )
+                with gr.Column():
+                    gr.Markdown("### Ensemble Weights")
+                    chronos_weight = gr.Slider(
+                        minimum=0.0,
+                        maximum=1.0,
+                        value=0.6,
+                        step=0.1,
+                        label="Chronos Weight"
+                    )
+                    technical_weight = gr.Slider(
+                        minimum=0.0,
+                        maximum=1.0,
+                        value=0.2,
+                        step=0.1,
+                        label="Technical Weight"
+                    )
+                    statistical_weight = gr.Slider(
+                        minimum=0.0,
+                        maximum=1.0,
+                        value=0.2,
+                        step=0.1,
+                        label="Statistical Weight"
+                    )
         with gr.Tabs() as tabs:
             # Daily Analysis Tab
             with gr.TabItem("Daily Analysis"):
                 with gr.Row():
                     with gr.Column():
                         gr.Markdown("### Structured Product Metrics")
                         daily_metrics = gr.JSON(label="Product Metrics")
+                        gr.Markdown("### Advanced Risk Analysis")
                         daily_risk_metrics = gr.JSON(label="Risk Metrics")
+                        gr.Markdown("### Market Regime Analysis")
+                        daily_regime_metrics = gr.JSON(label="Regime Metrics")
                         gr.Markdown("### Trading Signals")
                         daily_signals = gr.JSON(label="Trading Signals")
+                    with gr.Column():
+                        gr.Markdown("### Stress Test Results")
+                        daily_stress_results = gr.JSON(label="Stress Test Results")
+                        gr.Markdown("### Ensemble Analysis")
+                        daily_ensemble_metrics = gr.JSON(label="Ensemble Metrics")
             # Hourly Analysis Tab
             with gr.TabItem("Hourly Analysis"):
                         gr.Markdown("### Sector & Financial Analysis")
                         min15_sector_metrics = gr.JSON(label="Sector Metrics")
+        def analyze_stock(symbol, timeframe, prediction_days, lookback_days, strategy,
+                         use_ensemble, use_regime_detection, use_stress_testing,
+                         risk_free_rate, market_index, chronos_weight, technical_weight, statistical_weight):
             try:
+                # Create ensemble weights
+                ensemble_weights = {
+                    "chronos": chronos_weight,
+                    "technical": technical_weight,
+                    "statistical": statistical_weight
+                }
+                # Get market data for correlation analysis
+                market_df = get_market_data(market_index, lookback_days)
+                market_returns = market_df['Returns'] if not market_df.empty else None
+                # Make prediction with advanced features
+                signals, fig = make_prediction(
+                    symbol=symbol,
+                    timeframe=timeframe,
+                    prediction_days=prediction_days,
+                    strategy=strategy,
+                    use_ensemble=use_ensemble,
+                    use_regime_detection=use_regime_detection,
+                    use_stress_testing=use_stress_testing,
+                    risk_free_rate=risk_free_rate,
+                    ensemble_weights=ensemble_weights,
+                    market_index=market_index
+                )
                 # Get historical data for additional metrics
                 df = get_historical_data(symbol, timeframe, lookback_days)
                     "Price_to_Sales": df['Price_to_Sales'].iloc[-1]
                 }
+                # Calculate advanced risk metrics
+                risk_metrics = calculate_advanced_risk_metrics(df, market_returns, risk_free_rate)
                 # Calculate sector metrics
                 sector_metrics = {
                     }
                     product_metrics.update(intraday_metrics)
+                # Extract regime and stress test information
+                regime_metrics = signals.get("regime_info", {})
+                stress_results = signals.get("stress_test_results", {})
+                ensemble_metrics = {
+                    "ensemble_used": signals.get("ensemble_used", False),
+                    "ensemble_weights": ensemble_weights
+                }
+                return signals, fig, product_metrics, risk_metrics, sector_metrics, regime_metrics, stress_results, ensemble_metrics
             except Exception as e:
                 error_message = str(e)
                 if "Market is currently closed" in error_message:
                 raise gr.Error(error_message)
         # Daily analysis button click
+        def daily_analysis(s: str, pd: int, ld: int, st: str, ue: bool, urd: bool, ust: bool,
+                          rfr: float, mi: str, cw: float, tw: float, sw: float) -> Tuple[Dict, go.Figure, Dict, Dict, Dict, Dict, Dict, Dict]:
             """
+            Process daily timeframe stock analysis with advanced features.
             Args:
                 s (str): Stock symbol (e.g., "AAPL", "MSFT", "GOOGL")
                 pd (int): Number of days to predict (1-365)
                 ld (int): Historical lookback period in days (1-3650)
                 st (str): Prediction strategy to use ("chronos" or "technical")
+                ue (bool): Use ensemble methods
+                urd (bool): Use regime detection
+                ust (bool): Use stress testing
+                rfr (float): Risk-free rate
+                mi (str): Market index
+                cw (float): Chronos weight
+                tw (float): Technical weight
+                sw (float): Statistical weight
             Returns:
+                Tuple containing all analysis results
             """
+            return analyze_stock(s, "1d", pd, ld, st, ue, urd, ust, rfr, mi, cw, tw, sw)
         daily_predict_btn.click(
             fn=daily_analysis,
+            inputs=[daily_symbol, daily_prediction_days, daily_lookback_days, daily_strategy,
+                   use_ensemble, use_regime_detection, use_stress_testing, risk_free_rate, market_index,
+                   chronos_weight, technical_weight, statistical_weight],
+            outputs=[daily_signals, daily_plot, daily_metrics, daily_risk_metrics, daily_sector_metrics,
+                    daily_regime_metrics, daily_stress_results, daily_ensemble_metrics]
         )
         # Hourly analysis button click
+        def hourly_analysis(s: str, pd: int, ld: int, st: str, ue: bool, urd: bool, ust: bool,
+                           rfr: float, mi: str, cw: float, tw: float, sw: float) -> Tuple[Dict, go.Figure, Dict, Dict, Dict]:
             """
+            Process hourly timeframe stock analysis with advanced features.
             Args:
                 s (str): Stock symbol (e.g., "AAPL", "MSFT", "GOOGL")

requirements.txt CHANGED Viewed

@@ -92,4 +92,8 @@ typer
 diskcache
 anthropic
 gradio>=4.0.0
-chronos-forecasting>=1.0.0

 diskcache
 anthropic
 gradio>=4.0.0
+chronos-forecasting>=1.0.0
+# Advanced features dependencies
+hmmlearn>=0.3.0
+scipy>=1.10.0