Spaces:

ahanbose
/

maker_lab_2

Configuration error

App Files Files Community

ahan bose commited on Feb 10

Commit

351e529

0 Parent(s):

Initial commit: Ahan Bose AI Twin

Browse files

Files changed (12) hide show

.gitignore +3 -0
app.py +83 -0
knowledge_base/achievements.txt +4 -0
knowledge_base/experience.txt +9 -0
knowledge_base/goals.txt +3 -0
knowledge_base/projects.txt +11 -0
knowledge_base/skills.txt +15 -0
profile.txt +37 -0
requirements.txt +19 -0
sidebar.py +93 -0
synthetic_data/synthetic_projects.txt +35 -0
synthetic_data/sythentic_experience.txt +8 -0

.gitignore ADDED Viewed

	@@ -0,0 +1,3 @@

+.env
+__pycache__/
+.streamlit/

app.py ADDED Viewed

	@@ -0,0 +1,83 @@

+import os
+import streamlit as st
+from dotenv import load_dotenv
+# 1. NEW MODULAR IMPORTS (No 'langchain.chains' needed)
+from langchain_community.document_loaders import TextLoader, DirectoryLoader
+from langchain_text_splitters import RecursiveCharacterTextSplitter
+from langchain_community.vectorstores import FAISS
+from langchain_huggingface import HuggingFaceEmbeddings, HuggingFaceEndpoint, ChatHuggingFace
+from langchain_core.prompts import ChatPromptTemplate
+from langchain_core.runnables import RunnablePassthrough
+from langchain_core.output_parsers import StrOutputParser
+from sidebar import show_profile, generate_ai_summary
+load_dotenv()
+st.set_page_config(page_title="Ahan Bose - AI Twin", layout="wide")
+st.title("🤖 Ahan Bose: AI Digital Twin ")
+# CALL THE SIDEBAR FROM THE OTHER FILE
+show_profile()
+hf_token = os.getenv("HUGGINGFACEHUB_API_TOKEN")
+@st.cache_resource
+def setup_vector_db():
+    # Load and split docs
+    loader = DirectoryLoader('./knowledge_base/', glob="./*.txt", loader_cls=TextLoader)
+    docs = loader.load()
+    splitter = RecursiveCharacterTextSplitter(chunk_size=1000, chunk_overlap=200)
+    splits = splitter.split_documents(docs)
+    # Setup Vector Store
+    embeddings = HuggingFaceEmbeddings(model_name="sentence-transformers/all-MiniLM-L6-v2")
+    vectorstore = FAISS.from_documents(splits, embeddings)
+    return vectorstore.as_retriever()
+if not hf_token:
+    st.error("Token missing!")
+else:
+    try:
+        retriever = setup_vector_db()
+        # 2. SETUP LLM
+        llm_endpoint = HuggingFaceEndpoint(
+            repo_id="mistralai/Mistral-7B-Instruct-v0.2",
+            task = "conversational",
+            huggingfacehub_api_token=hf_token,
+            temperature=0.5
+        )
+        llm = ChatHuggingFace(llm=llm_endpoint)
+        # 3. DEFINE THE TEMPLATE
+        template = """You are Ahan Bose's AI Twin. Answer based only on the context provided:
+        {context}
+        Question: {question}
+        """
+        prompt = ChatPromptTemplate.from_template(template)
+        # 4. THE LCEL PIPE CHAIN (The Modern Replacement for RetrievalQA)
+        # This builds the chain without needing the 'langchain.chains' module
+        rag_chain = (
+            {"context": retriever, "question": RunnablePassthrough()}
+            | prompt
+            | llm
+            | StrOutputParser()
+        )
+        #AI SUMMARY
+        ai_summary = generate_ai_summary(llm)
+        st.write(ai_summary)
+        # 5. UI
+        query = st.text_input("Ask me something:")
+        if st.button("Submit") and query:
+            with st.spinner("Processing..."):
+                # Simply call invoke on the pipe
+                response = rag_chain.invoke(query)
+                st.markdown("### Answer:")
+                st.write(response)
+    except Exception as e:
+        st.error(f"Error: {e}")

knowledge_base/achievements.txt ADDED Viewed

	@@ -0,0 +1,4 @@

+Achievement: CAT 2024 - 98.11 percentile.
+Achievement: Deloitte - Outstanding Performance & Applause Awards.
+Achievement: Academic - CNR Rao & MRD Merit Scholarships.
+Achievement: Extra-Curricular - Best Delegate (VIT MUN) and 3rd in WB Swimming Championship.

knowledge_base/experience.txt ADDED Viewed

	@@ -0,0 +1,9 @@

+Experience: Deloitte USI (Consultant)
+Duration: 35 Months (Jun 2022 - Jun 2025)
+Key Impact: Promoted for technical leadership; ranked in top 1% of practitioners.
+Experience: Continental Automotive (Intern)
+Key Impact: Cut test time by 60% via automation of 15+ hardware test scripts.
+Experience: PES MUN Society (Vice President)
+Key Impact: Led National MUN with 200+ delegates and managed INR 1.14 Lakh budget.

knowledge_base/goals.txt ADDED Viewed

	@@ -0,0 +1,3 @@

+Career Goal: Lead enterprise-wide digital transformations.
+Career Goal: Specialize in finance data engineering and cloud automation.
+Career Goal: Improve enterprise forecast precision and operational agility through AI.

knowledge_base/projects.txt ADDED Viewed

	@@ -0,0 +1,11 @@

+Project: Full-stack Cloud Data Pipelines
+Role: Consultant
+Details: Built pipelines for real-time P&L reporting; slashed TAT by 80%.
+Project: Enterprise Profitability Planning
+Role: Consultant
+Details: Standardized 50+ master data elements for enterprise-wide planning.
+Project: Diplomat Wars
+Role: Founder/Vice President
+Details: Launched intra-collegiate contest for 100+ participants.

knowledge_base/skills.txt ADDED Viewed

	@@ -0,0 +1,15 @@

+Skill: Data Engineering & Cloud Pipelines
+Level: Advanced
+Used In: Deloitte USI - P&L reporting and financial data flows.
+Skill: Automation (Logic-based)
+Level: Expert
+Used In: Automating 10K+ folders and cross-cloud financial data.
+Skill: Financial Analytics
+Level: Advanced
+Used In: Inventory cost models, CAPEX visibility, and predictive analytics.
+Skill: Technical Testing (CAN/Hardware)
+Level: Intermediate
+Used In: Continental Automotive - Airbag unit testing and script automation.

profile.txt ADDED Viewed

	@@ -0,0 +1,37 @@

+Name: Ahan Bose
+Email: pgp25.ahan@spjimr.org
+Education:
+- PGDM: S.P. Jain Institute of Management & Research (SPJIMR), Mumbai (Class of 2027)
+- Bachelor of Technology (B.Tech): PES University (CGPA: 8.61/10)
+- Class XII: FIITJEE PU College, Karnataka PU Board (87.67%)
+- Class X: The Frank Anthony Public School, ICSE (93.67%)
+Skills:
+- Finance & Data Engineering: Full-stack cloud data pipelines, real-time P&L reporting, Power BI, finance planning systems integration
+- Automation & Analytics: Logic-based automation, predictive analytics, cross-cloud financial data flows, CAPEX visibility
+- Technical & Testing: SQL, Python, CAN protocol testing, hardware test script automation, RCA diagnostics
+- Leadership: Stakeholder alignment, mentoring (8+ new hires), budget management (INR 1.14 Lakh), content strategy
+Interests:
+- FinTech and Data Architecture
+- Model United Nations (MUN) and Debating
+- Competitive Swimming
+Projects:
+- Full-stack Cloud Data Pipelines: Enabled real-time P&L reporting and automated financial data flows at Deloitte USI
+- Enterprise Profitability Planning: Standardized 50+ master data elements for segment-wide planning
+- Hardware Test Automation: Automated 15+ test scripts, reducing testing time by 60%
+- Diplomat Wars: Launched an intra-collegiate contest engaging 100+ participants
+Career Goals:
+To leverage expertise in finance data engineering and cloud automation to lead large-scale digital transformations and improve enterprise forecast precision and operational agility.
+Achievements:
+- Professional: Promoted to Consultant at Deloitte USI; Ranked in top 1% of 1000+ practitioners; Outstanding Performance Award
+- Academic: 98.11 percentile in CAT 2024; MRD and CNR Rao Merit Scholarships
+- Extra-Curricular: Best Delegate at VIT Model UN; 3rd place in 50m and 100m breaststroke at WB District Swimming
+Certifications:
+- CAT 2024 (98.11%ile)
+- Deloitte USI Consultant Promotion
+- MRD Merit Scholarship (ECE Department)

requirements.txt ADDED Viewed

	@@ -0,0 +1,19 @@

+# Web Interface (Choose one, Streamlit is common for RAG)
+streamlit
+# RAG Framework
+langchain
+langchain-community
+langchain-huggingface
+# Embeddings and Vector Database
+sentence-transformers
+faiss-cpu
+# Document Processing
+pypdf
+pandas
+# Environment and API Management
+python-dotenv
+huggingface_hub

sidebar.py ADDED Viewed

	@@ -0,0 +1,93 @@

+import streamlit as st
+from langchain_core.prompts import ChatPromptTemplate
+from langchain_core.output_parsers import StrOutputParser
+def generate_ai_summary(llm):
+    """Reads profile.txt and generates a 3-sentence professional summary."""
+    try:
+        with open("./profile.txt", "r") as f:
+            profile_text = f.read()
+        # Simple Summarization Prompt
+        prompt = ChatPromptTemplate.from_template(
+            "Summarize the following professional profile into short paragraph not more than 50 words, do not reveal email ID or phone number, and ensure to end"
+            "with few sentences highlighting key skills, experience, and projects. "
+            "Write in the third person. \n\nProfile: {text}"
+        )
+        # Fast LCEL Chain
+        summarizer = prompt | llm | StrOutputParser()
+        return summarizer.invoke({"text": profile_text})
+    except Exception as e:
+        return "AI Summary currently unavailable. Update profile.txt to enable."
+def show_profile():
+    # --- CUSTOM CSS STYLING ---
+    st.markdown("""
+        <style>
+        /* Target the sidebar container */
+        [data-testid="stSidebar"] {
+            background-color: #f1f3f5 !important; /* Slightly darker grey for depth */
+            border-right: 2px solid #dee2e6;
+        }
+        /* Force all text in the sidebar to be Dark Grey/Black */
+        [data-testid="stSidebar"] .stText,
+        [data-testid="stSidebar"] p,
+        [data-testid="stSidebar"] li,
+        [data-testid="stSidebar"] span {
+            color: #212529 !important; /* Professional Dark Grey */
+            font-weight: 400;
+        }
+        /* Style the Subheaders specifically */
+        [data-testid="stSidebar"] h2,
+        [data-testid="stSidebar"] h3 {
+            color: #0d6efd !important; /* Blue for headers */
+            font-weight: 700 !important;
+        }
+        /* Style the Profile Name */
+        .profile-name {
+            font-size: 26px;
+            font-weight: 800;
+            color: #1a73e8 !important;
+            text-align: center;
+            margin-bottom: 10px;
+        }
+        /* Style the AI Summary Box */
+        .stInfo {
+            background-color: #ffffff !important;
+            color: #212529 !important;
+            border: 1px solid #ced4da !important;
+        }
+        </style>
+    """, unsafe_allow_html=True)
+    # --- SIDEBAR CONTENT ---
+    with st.sidebar:
+        # headshot image
+        st.image("https://media.licdn.com/dms/image/v2/D5603AQHnuwh4mMnwYg/profile-displayphoto-crop_800_800/B56ZjcS71_HUAI-/0/1756042608528?e=1772064000&v=beta&t=yer-pM8z72mJMF7Yg_nGDSeNCAT3YD2ybpj__AmxKaI", width=150)
+        st.markdown('<p class="profile-name">Ahan Bose</p>', unsafe_allow_html=True)
+        st.write("📍 **Mumbai, India**")
+        st.write("💼 **SPJIMR MBA**")
+        st.divider()
+        st.subheader("About Me")
+        st.caption("""
+            I build intelligent systems using LangChain and Hugging Face.
+            This Digital Twin is powered by a RAG pipeline to answer
+            questions about my career and projects.
+        """)
+        st.divider()
+        st.subheader("Connect")
+        st.markdown('<a href="www.linkedin.com/in/ahan-bose-spjimr" class="social-badge">LinkedIn</a>', unsafe_allow_html=True)

synthetic_data/synthetic_projects.txt ADDED Viewed

	@@ -0,0 +1,35 @@

+Professional Case Study 1: Real-Time Financial Digital Twin for Enterprise P&L
+Context
+A global professional services firm faced delays in financial visibility due to fragmented finance systems and batch-based reporting. Leadership required a near–real-time view of profitability across service lines to improve forecasting accuracy and decision-making.
+Approach
+As part of the finance data engineering team, I designed a full-stack cloud data pipeline that ingested transactional, budgeting, and forecast data from multiple source systems. The solution standardized financial master data and enabled cross-cloud data synchronization. Automated validation checks and reconciliation logic ensured data accuracy before downstream consumption.
+Outcome
+The digital twin enabled real-time P&L reporting with significantly reduced manual intervention. Finance leaders gained faster insights into margin movements and cost drivers, improving forecast precision and enabling proactive cost optimization. The solution became a reference architecture for similar implementations across other business units.
+Professional Case Study 2: Enterprise Profitability Planning & Forecast Standardization
+Context
+A large enterprise struggled with inconsistent profitability planning due to non-standard master data definitions across regions and business segments. This led to forecast mismatches and prolonged planning cycles.
+Approach
+I supported the design and implementation of a centralized profitability planning framework. Over 50 master data elements—covering cost centers, revenue categories, and allocation drivers—were standardized and integrated into the finance planning system. Automated data quality rules were embedded to flag inconsistencies at source.
+Outcome
+The standardized planning model reduced forecast variance and shortened planning cycles. Stakeholders gained confidence in scenario analysis outputs, enabling faster strategic decisions during quarterly reviews. The initiative materially improved enterprise-wide financial governance.
+Synthetic Project Descriptions
+Project 1: Cloud-Based Financial Digital Twin Architecture
+Designed a scalable cloud data architecture to mirror enterprise financial operations in near real time. The solution integrated actuals, forecasts, and budgets to simulate financial outcomes under different business scenarios, supporting leadership decision-making and financial stress testing.
+Project 2: Automated CAPEX Visibility & Forecast Analytics
+Built an automated CAPEX tracking and analytics layer using SQL and Python to consolidate spend data across projects. Implemented predictive analytics to flag potential overruns, improving capital allocation discipline and reducing forecast surprises.
+Project 3: Cross-Cloud Financial Data Orchestration
+Developed logic-based automation to orchestrate financial data flows across multiple cloud environments. The system ensured consistency between finance planning tools and reporting dashboards, reducing manual reconciliations and improving reporting reliability.

synthetic_data/sythentic_experience.txt ADDED Viewed

	@@ -0,0 +1,8 @@

+SYNTHETIC DATA - LEADERSHIP EXPERIENCE
+Role: Global Cross-Functional Task Force Lead
+Organization: Deloitte Digital Transformation Group (Synthetic Extension)
+Experience: Led a team of 10 junior engineers and 4 functional consultants to resolve a critical P&L integration blocker during a high-stakes migration.
+Actions:
+- Mentored 5 new hires on cloud data pipeline best practices to accelerate role transitions.
+- Acted as the primary technical representative in "War Room" sessions, communicating risks directly to C-suite stakeholders.
+- Standardized cross-functional communication protocols, leading to a 30% reduction in issue resolution time.