--- title: README emoji: ⚡ colorFrom: purple colorTo: yellow sdk: static pinned: false --- # CSEN 346 – Natural Language Processing (Santa Clara University) Welcome to the official Hugging Face organization for **CSEN 346: Natural Language Processing** at **Santa Clara University**. This space hosts the final course projects developed by students, including: - NLP/AI models - Datasets - Research papers - Evaluation benchmarks - Demo applications The goal is to encourage **open, reproducible, and responsible NLP research** while giving students experience contributing to the research community. --- ## Course Information **Course:** CSEN 346 – Natural Language Processing **Institution:** Santa Clara University **Instructor:** Prof. Oana Ignat **Focus areas:** - Large Language Models - Generative AI - Multimodal - Mutliagents - Multilingual - Responsible and Inclusive AI - Human-centered Evaluation - Applied NLP systems --- ## Project Goals Student projects aim to: - Apply NLP methods to real-world problems - Build reproducible research - Practice research communication - Contribute open resources (models/datasets) - Develop responsible AI systems Projects typically include: - Problem motivation - Dataset description - Model architecture - Training setup - Evaluation methodology - Error analysis - Future work --- ## Repository Structure Each project should include: ### Required - Project README - Research paper (ACL-style) - Model or dataset - Evaluation results ### Recommended - Training scripts - Inference scripts - Model card - Dataset card - Demo notebook - Error analysis --- ## Documentation Standards Projects should clearly describe: **Task** - What problem does the project solve? **Data** - Source - Size - Languages - Limitations **Model** - Architecture - Parameters - Training setup **Evaluation** - Metrics - Baselines - Results **Ethical Considerations** - Bias risks - Limitations - Appropriate use --- ## Responsible AI Guidelines All projects must follow responsible AI practices: - No private or sensitive data - No harmful applications - Clear documentation of limitations - Proper dataset citations - Transparency in evaluation --- ## Academic Integrity All work must follow SCU academic integrity policies. Students must: - Cite all external resources - Clearly indicate use of AI models - Document prompting or fine-tuning approaches - Clearly state team member contributions --- ## Acknowledgment This course emphasizes that: **Research is most valuable when it is shared, reproducible, and benefits others.** By publishing projects here, students contribute to the broader NLP community while building their research portfolio. --- ## License Unless otherwise specified, student projects are released for academic and research purposes only. --- ## Contact Course organization maintained by: **Dr. Oana Ignat** Assistant Professor Computer Science and Engineering Santa Clara University