Spaces:

QuantumLearner
/

Space21

Sleeping

App Files Files Community

QuantumLearner commited on Jul 20, 2024

Commit

37fd55b

verified ·

1 Parent(s): 510dfb7

Update app.py

Browse files

Files changed (1) hide show

app.py +44 -57

app.py CHANGED Viewed

@@ -9,21 +9,21 @@ import networkx as nx
 # Streamlit app setup
 st.set_page_config(layout="wide")
-st.title("Herding Behaviour Analysis in Cryptocurrency Markets")
 st.markdown(
     """
-    This app analyzes herding behavior in cryptocurrency markets by examining price movements and correlations.
-    You can specify the cryptocurrency pairs, time period, and other parameters for the analysis.
     The app includes various analyses such as price reindexing, Kalman Filter estimation of a common factor,
-    CSSD and CSAD calculations, rolling CSSD and CSAD, and network visualization of correlations.
     """
 )
 st.sidebar.header("How to use")
 st.sidebar.write(
     """
-    1. Select the cryptocurrency pairs.
     2. Choose the time period.
     3. Set additional parameters for the analyses.
     4. Click 'Run Analysis' to see the results.
@@ -35,6 +35,7 @@ tickers = st.sidebar.text_area("Asset Symbol (Crypto-Pair or Stock Ticker) (comm
 start_date = st.sidebar.date_input("Start Date", value=pd.to_datetime("2020-01-01"))
 end_date = st.sidebar.date_input("End Date", value=pd.to_datetime("2025-01-01"))
 market_index = st.sidebar.text_input("Market Index Ticker", value="BTC-USD")
 run_button = st.sidebar.button("Run Analysis")
 if run_button:
@@ -42,7 +43,7 @@ if run_button:
     if market_index not in tickers:
         tickers.append(market_index)
-    # Fetching cryptocurrency data
     data = yf.download(tickers, start=start_date, end=end_date)['Close']
     # Clean the data by filling or dropping NaN and infinite values
@@ -50,8 +51,8 @@ if run_button:
     data = data.replace([np.inf, -np.inf], np.nan).dropna()
     # Reindexing prices to start at 0
-    st.markdown("#### Cryptocurrency Prices Reindexed to Start at 0")
-    st.markdown("This analysis reindexes cryptocurrency prices to start at 0, making it easier to compare their relative movements over time.")
     data_reindexed = data.apply(lambda x: x / x.iloc[0])
     fig = go.Figure()
@@ -60,7 +61,7 @@ if run_button:
         fig.add_trace(go.Scatter(x=data_reindexed.index, y=data_reindexed[ticker], mode='lines', name=ticker))
     fig.update_layout(
-        title="Cryptocurrency Prices Reindexed to Start at 0",
         xaxis_title="Date",
         yaxis_title="Reindexed Price",
         template="plotly_dark"
@@ -74,8 +75,8 @@ if run_button:
     returns = returns.replace([np.inf, -np.inf], np.nan).dropna()
     # Kalman Filter: Estimating a common factor
-    st.markdown("#### Kalman Filter: Estimated Common Factor and Cryptocurrency Returns")
-    st.markdown("This analysis uses the Kalman Filter to estimate a common factor influencing all cryptocurrency returns. It compares individual cryptocurrency returns to the estimated common factor.")
     st.markdown("The Kalman Filter operates based on the following state-space model:")
@@ -89,17 +90,15 @@ if run_button:
     st.markdown("Where:")
     st.markdown(r"""
-    \begin{itemize}
-    \item \(\mathbf{x}_t\) is the state vector (the common factor we are estimating).
-    \item \(\mathbf{A}\) is the state transition matrix (set to the identity matrix \(\mathbf{I}\)).
-    \item \(\mathbf{w}_t\) is the process noise (with covariance \(\mathbf{Q}\)).
-    \item \(\mathbf{y}_t\) is the observation vector (cryptocurrency returns).
-    \item \(\mathbf{H}\) is the observation matrix (set to a vector of ones).
-    \item \(\mathbf{v}_t\) is the observation noise (with covariance \(\mathbf{R}\)).
-    \end{itemize}
     """)
-    st.markdown("The Kalman Filter recursively estimates the state vector \(\mathbf{x}_t\) using the observed cryptocurrency returns \(\mathbf{y}_t\). The estimated common factor is then compared to individual cryptocurrency returns.")
     st.markdown("""
     The steps to derive the common factor are:
@@ -107,17 +106,15 @@ if run_button:
     2. **Prediction:** Use the state equation to predict the state vector at the next time step.
     3. **Update:** Use the observation equation and the actual observed returns to update the estimate of the state vector.
-    This process repeats for each time step, producing an estimated common factor that influences all cryptocurrency returns.
     """)
     st.markdown("""
     **How to Interpret the Results:**
-    - **Estimated Common Factor:** This represents the underlying factor that influences all the cryptocurrency returns. If the common factor is high, it indicates that most cryptocurrencies are experiencing high returns. Conversely, if the common factor is low, it indicates that most cryptocurrencies are experiencing low returns.
-    - **Individual Cryptocurrency Returns vs. Common Factor:** By comparing the individual cryptocurrency returns to the estimated common factor, you can identify which cryptocurrencies are moving with the market trend and which are moving independently.
-    - **Deviation from the Common Factor:** Cryptocurrencies that deviate significantly from the common factor may be influenced by specific news or events, whereas cryptocurrencies that closely follow the common factor are more influenced by market-wide factors.
-    By observing these comparisons, you can gain insights into the behavior of individual cryptocurrencies relative to the market and identify potential outliers or trend-followers.
     """)
     observations = returns.values
@@ -138,7 +135,7 @@ if run_button:
     fig.add_trace(go.Scatter(x=returns.index, y=state_means[:, 0], mode='lines', name='Estimated Common Factor', line=dict(color='red', width=4)))
     fig.update_layout(
-        title='Kalman Filter: Estimated Common Factor and Cryptocurrency Returns',
         xaxis_title='Date',
         yaxis_title='Returns',
         template='plotly_dark'
@@ -147,39 +144,29 @@ if run_button:
     # CSSD and CSAD calculations
     st.markdown("#### CSSD and CSAD Calculations")
-    st.markdown("This analysis calculates the Cross-Sectional Standard Deviation (CSSD) and Cross-Sectional Absolute Deviation (CSAD) of cryptocurrency returns. These metrics help to identify herding behavior in the market.")
     st.markdown("The formulas for CSSD and CSAD are as follows:")
     st.markdown("**CSSD (Cross-Sectional Standard Deviation):**")
     st.latex(r"\text{CSSD}_t = \sqrt{\frac{\sum_{i=1}^{N} (R_{i,t} - \overline{R}_t)^2}{N - 1}}")
-    st.markdown("""
-    Where:
-    - \(R_{i,t}\) is the return of cryptocurrency \(i\) at time \(t\).
-    - \(\overline{R}_t\) is the average return of all cryptocurrencies at time \(t\).
-    - \(N\) is the number of cryptocurrencies.
-    """)
     st.markdown("**CSAD (Cross-Sectional Absolute Deviation):**")
     st.latex(r"\text{CSAD}_t = \frac{\sum_{i=1}^{N} |R_{i,t} - \overline{R}_t|}{N}")
     st.markdown("""
     Where:
-    - \(R_{i,t}\) is the return of cryptocurrency \(i\) at time \(t\).
-    - \(\overline{R}_t\) is the average return of all cryptocurrencies at time \(t\).
-    - \(N\) is the number of cryptocurrencies.
     """)
-    st.markdown("These metrics help to identify herding behavior by measuring the dispersion of individual cryptocurrency returns around the market return.")
     st.markdown("""
     **How to Interpret the Results:**
-    - **CSSD (Cross-Sectional Standard Deviation):** A higher CSSD indicates greater dispersion of individual cryptocurrency returns around the market return, suggesting less herding behavior. Conversely, a lower CSSD indicates that cryptocurrency returns are more closely clustered around the market return, suggesting more herding behavior.
-    - **CSAD (Cross-Sectional Absolute Deviation):** A higher CSAD also indicates greater dispersion of individual cryptocurrency returns around the market return, suggesting less herding behavior. A lower CSAD suggests more herding behavior as cryptocurrency returns are more closely clustered around the market return.
-    By observing the trends in CSSD and CSAD over time, you can identify periods of increased or decreased herding behavior in the market.
     """)
     market_return = returns[market_index]
@@ -235,31 +222,31 @@ if run_button:
     )
     st.plotly_chart(fig, use_container_width=True)
-    # Network visualization of cryptocurrency correlations
-    st.markdown("#### Network Visualization of Cryptocurrency Correlations")
-    st.markdown("This analysis visualizes the correlations between cryptocurrencies as a network. Cryptocurrencies are connected by edges if their correlation exceeds a threshold.")
     st.markdown("""
     **How to Interpret the Results:**
-    - **Nodes:** Each node represents a cryptocurrency. The position of the nodes is determined by a spring layout algorithm, which places highly connected nodes closer together.
-    - **Edges:** An edge (or line) between two nodes indicates that the correlation between the two cryptocurrencies exceeds the specified threshold (0.5 in this case).
     - **Edge Thickness:** The thickness of the edge represents the strength of the correlation. Thicker edges indicate higher correlations.
-    - **Cluster Formation:** Groups of nodes that are densely connected to each other represent clusters of cryptocurrencies that move together. This can indicate sector-specific movements or broader market trends.
-    - **Isolated Nodes:** Nodes that are not connected to others suggest that those cryptocurrencies do not have strong correlations with the rest of the market within the given threshold.
-    By examining this network, you can identify groups of cryptocurrencies that tend to move together, which may reflect sector-specific behavior or broader market dynamics. This can provide insights into the structure of the market and potential areas of risk or opportunity.
     """)
     years = data.index.year.unique()
-    def plot_network_for_year(data_for_year, year):
         corr_matrix = data_for_year.corr()
         G = nx.Graph()
         for ticker in tickers:
             G.add_node(ticker)
-        threshold = 0.5
         for i in range(len(tickers)):
             for j in range(i+1, len(tickers)):
                 if abs(corr_matrix.iloc[i, j]) > threshold:
@@ -294,7 +281,7 @@ if run_button:
         )
         layout = go.Layout(
-            title=f'Cryptocurrency Correlation Network for {year}',
             showlegend=False,
             hovermode='closest',
             margin=dict(b=20, l=5, r=5, t=40),
@@ -307,7 +294,7 @@ if run_button:
     for year in years:
         data_for_year = data[data.index.year == year]
-        plot_network_for_year(data_for_year, year)
 hide_streamlit_style = """
 <style>
@@ -315,4 +302,4 @@ hide_streamlit_style = """
 footer {visibility: hidden;}
 </style>
 """
-st.markdown(hide_streamlit_style, unsafe_allow_html=True)

 # Streamlit app setup
 st.set_page_config(layout="wide")
+st.title("Herding Behaviour Analysis in Financials Markets")
 st.markdown(
     """
+    This app analyzes herding behavior in financial markets by examining price movements and correlations.
+    You can specify the stock ticker or Asset pairs, time period, and other parameters for the analysis.
     The app includes various analyses such as price reindexing, Kalman Filter estimation of a common factor,
+    CSSD and CSAD calculations, and network visualization of correlations over time.
     """
 )
 st.sidebar.header("How to use")
 st.sidebar.write(
     """
+    1. Select the stock ticker Asset pairs.
     2. Choose the time period.
     3. Set additional parameters for the analyses.
     4. Click 'Run Analysis' to see the results.
 start_date = st.sidebar.date_input("Start Date", value=pd.to_datetime("2020-01-01"))
 end_date = st.sidebar.date_input("End Date", value=pd.to_datetime("2025-01-01"))
 market_index = st.sidebar.text_input("Market Index Ticker", value="BTC-USD")
+correlation_threshold = st.sidebar.slider("Correlation Threshold (for Network Analysis)", min_value=0.0, max_value=1.0, value=0.75, step=0.05)
 run_button = st.sidebar.button("Run Analysis")
 if run_button:
     if market_index not in tickers:
         tickers.append(market_index)
+    # Fetching Asset data
     data = yf.download(tickers, start=start_date, end=end_date)['Close']
     # Clean the data by filling or dropping NaN and infinite values
     data = data.replace([np.inf, -np.inf], np.nan).dropna()
     # Reindexing prices to start at 0
+    st.markdown("#### Asset Prices Reindexed to Start at 0")
+    st.markdown("This analysis reindexes asset prices to start at 0, making it easier to compare their relative movements over time.")
     data_reindexed = data.apply(lambda x: x / x.iloc[0])
     fig = go.Figure()
         fig.add_trace(go.Scatter(x=data_reindexed.index, y=data_reindexed[ticker], mode='lines', name=ticker))
     fig.update_layout(
+        title="Asset Prices Reindexed to Start at 0",
         xaxis_title="Date",
         yaxis_title="Reindexed Price",
         template="plotly_dark"
     returns = returns.replace([np.inf, -np.inf], np.nan).dropna()
     # Kalman Filter: Estimating a common factor
+    st.markdown("#### Kalman Filter: Estimated Common Factor and Asset Returns")
+    st.markdown("This analysis uses the Kalman Filter to estimate a common factor influencing all asset returns. It compares individual asset returns to the estimated common factor.")
     st.markdown("The Kalman Filter operates based on the following state-space model:")
     st.markdown("Where:")
     st.markdown(r"""
+    - \(xt\) is the state vector (the common factor we are estimating).
+    - \(A\) is the state transition matrix (set to the identity matrix \(I\)).
+    - \(wt\) is the process noise (with covariance \(Q\)).
+    - \(yt\) is the observation vector (asset returns).
+    - \(H\) is the observation matrix (set to a vector of ones).
+    - \(vt\) is the observation noise (with covariance \(R\)).
     """)
+    st.markdown("The Kalman Filter recursively estimates the state vector \(xt\) using the observed asset returns \( yt \). The estimated common factor is then compared to individual asset returns.")
     st.markdown("""
     The steps to derive the common factor are:
     2. **Prediction:** Use the state equation to predict the state vector at the next time step.
     3. **Update:** Use the observation equation and the actual observed returns to update the estimate of the state vector.
+    This process repeats for each time step, producing an estimated common factor that influences all asset returns.
     """)
     st.markdown("""
     **How to Interpret the Results:**
+    - **Estimated Common Factor:** This represents the underlying factor that influences all the asset returns. If the common factor is high, it indicates that most assets are experiencing high returns. Conversely, if the common factor is low, it indicates that most asset are experiencing low returns.
+    - **Individual Asset Returns vs. Common Factor:** By comparing the individual asset returns to the estimated common factor, you can identify which asset are moving with the market trend and which are moving independently.
+    - **Deviation from the Common Factor:** Assets that deviate significantly from the common factor may be influenced by specific news or events, whereas assets that closely follow the common factor are more influenced by market-wide factors.
     """)
     observations = returns.values
     fig.add_trace(go.Scatter(x=returns.index, y=state_means[:, 0], mode='lines', name='Estimated Common Factor', line=dict(color='red', width=4)))
     fig.update_layout(
+        title='Kalman Filter: Estimated Common Factor and Asset Returns',
         xaxis_title='Date',
         yaxis_title='Returns',
         template='plotly_dark'
     # CSSD and CSAD calculations
     st.markdown("#### CSSD and CSAD Calculations")
+    st.markdown("This analysis calculates the Cross-Sectional Standard Deviation (CSSD) and Cross-Sectional Absolute Deviation (CSAD) of Asset returns.")
     st.markdown("The formulas for CSSD and CSAD are as follows:")
     st.markdown("**CSSD (Cross-Sectional Standard Deviation):**")
     st.latex(r"\text{CSSD}_t = \sqrt{\frac{\sum_{i=1}^{N} (R_{i,t} - \overline{R}_t)^2}{N - 1}}")
     st.markdown("**CSAD (Cross-Sectional Absolute Deviation):**")
     st.latex(r"\text{CSAD}_t = \frac{\sum_{i=1}^{N} |R_{i,t} - \overline{R}_t|}{N}")
     st.markdown("""
     Where:
+    - Rit is the return of Asset i at time t.
+    - Rt is the average return of all Assets at time t.
+    - N is the number of Assets.
     """)
+    st.markdown("These metrics help to identify the dispersion of individual asset returns around the market return.")
     st.markdown("""
     **How to Interpret the Results:**
+    - **CSSD (Cross-Sectional Standard Deviation) and CSAD (Cross-Sectional Absolute Deviation):** Higher values indicate greater dispersion of asset returns around the market return, suggesting less herding behavior. Lower values indicate more clustering, suggesting more herding behavior.
     """)
     market_return = returns[market_index]
     )
     st.plotly_chart(fig, use_container_width=True)
+    # Network visualization of Asset correlations
+    st.markdown("#### Network Visualization of Asset Correlations")
+    st.markdown("This analysis visualizes the correlations between Assets as a network. Assets are connected by edges if their correlation exceeds a threshold.")
     st.markdown("""
     **How to Interpret the Results:**
+    - **Nodes:** Each node represents a Asset. The position of the nodes is determined by a spring layout algorithm, which places highly connected nodes closer together.
+    - **Edges:** An edge (or line) between two nodes indicates that the correlation between the two Assets exceeds the specified threshold (0.5 in this case).
     - **Edge Thickness:** The thickness of the edge represents the strength of the correlation. Thicker edges indicate higher correlations.
+    - **Cluster Formation:** Groups of nodes that are densely connected to each other represent clusters of Assets that move together. This can indicate sector-specific movements or broader market trends.
+    - **Isolated Nodes:** Nodes that are not connected to others suggest that those Assets do not have strong correlations with the rest of the market within the given threshold.
+    By examining this network, you can identify groups of Assets that tend to move together, which may reflect sector-specific behavior or broader market dynamics. This can provide insights into the structure of the market and potential areas of risk or opportunity.
     """)
     years = data.index.year.unique()
+    def plot_network_for_year(data_for_year, year, threshold):
         corr_matrix = data_for_year.corr()
         G = nx.Graph()
         for ticker in tickers:
             G.add_node(ticker)
+        #threshold = 0.5
         for i in range(len(tickers)):
             for j in range(i+1, len(tickers)):
                 if abs(corr_matrix.iloc[i, j]) > threshold:
         )
         layout = go.Layout(
+            title=f'Asset Correlation Network for {year}',
             showlegend=False,
             hovermode='closest',
             margin=dict(b=20, l=5, r=5, t=40),
     for year in years:
         data_for_year = data[data.index.year == year]
+        plot_network_for_year(data_for_year, year, correlation_threshold)
 hide_streamlit_style = """
 <style>
 footer {visibility: hidden;}
 </style>
 """
+st.markdown(hide_streamlit_style, unsafe_allow_html=True)