Spaces:

ChatCausalGPT
/

test

Configuration error

App Files Files Community

ChatCausalGPT commited on Jan 26, 2025

Commit

6841f24

1 Parent(s): 1f26669

refactor: simplify project structure

Browse files

Files changed (3) hide show

README.md +7 -73
generate_quality_inspection_reports.py +0 -133
main.py +113 -0

README.md CHANGED Viewed

@@ -1,30 +1,15 @@
-# Quality Inspection Report Generator
 一个用于从 Excel 数据生成质量检验报告的 Python 工具。
-## 功能特点
-- 从源数据 Excel 文件自动生成质量检验报告
-- 支持多个客户的批量处理
-- 保持模板格式和样式
-- 自动计算日期和有效期
-- 完整的化学元素和尺寸数据处理
 ## 系统要求
 - Python 3.6+
 - pandas
 - openpyxl
-## 安装
-1. 克隆仓库：
-```bash
-git clone https://github.com/1587causalai/quality-inspection-report-generator.git
-cd quality-inspection-report-generator
-```
-2. 安装依赖：
 ```bash
 pip install -r requirements.txt
 ```
@@ -32,67 +17,16 @@ pip install -r requirements.txt
 ## 使用方法
 1. 准备输入文件：
-   - 源数据文件（参考 `examples/example_source_data.xls`）：
-     * HEADER 工作表：包含基本信息
-     * Dimension 工作表：包含尺寸数据
-     * Sand 工作表：包含化学元素测试数据
-   - 模板文件（参考 `examples/example_template.xlsx`）：
-     * 包含预设的报告格式
 2. 运行脚本：
 ```bash
-python generate_quality_inspection_reports.py
 ```
-3. 检查输出：
-   - 生成的报告将保存为 `2_updated.xlsx`
-   - 每个客户的数据将保存在单独的工作表中
-## 示例文件
-在 `examples` 目录中提供了示例文件：
-1. `example_source_data.xls`：源数据文件示例
-   - 展示了正确的数据格式和结构
-   - 包含了所有必需的工作表和字段
-2. `example_template.xlsx`：模板文件示例
-   - 展示了标准的报告格式
-   - 包含了所有必需的字段和公式
-您可以参考这些示例文件来准备您自己的输入文件。
-## 数据格式要求
-### 输入文件结构
-1. 源数据文件需要包含：
-   - HEADER：基本信息（数量、批准人等）
-   - Dimension：尺寸数据和检验日期
-   - Sand：化学元素测试数据
-2. 模板文件需要包含：
-   - WACKER：报告模板格式
-### 输出报告格式
-- 基本信息（B3-D5）：数量、批号、日期等
-- 化学元素数据（D9-D19）：11种元素的测试结果
-- 尺寸数据（D20-D28）：外径、高度、壁厚等
-- 批准信息（D29）：批准人姓名
-## 文档
-详细的代码文档请参考 `quality_inspection_documentation.py`。
 ## 许可证
-MIT License
-## 作者
-[Your Name]
-## 贡献
-欢迎提交 Issue 和 Pull Request！

+# 质量检验报告生成工具
 一个用于从 Excel 数据生成质量检验报告的 Python 工具。
 ## 系统要求
 - Python 3.6+
 - pandas
 - openpyxl
+## 安装依赖
 ```bash
 pip install -r requirements.txt
 ```
 ## 使用方法
 1. 准备输入文件：
+   - `1.xls`：源数据文件，包含 HEADER、Dimension 和 Sand 工作表
+   - `2.xlsx`：模板文件，包含 WACKER 工作表
 2. 运行脚本：
 ```bash
+python main.py
 ```
+3. 检查输出文件 `2_updated.xlsx`
 ## 许可证
+MIT License

generate_quality_inspection_reports.py DELETED Viewed

@@ -1,133 +0,0 @@
-import pandas as pd
-from openpyxl import load_workbook
-from datetime import datetime, timedelta
-import os
-def generate_reports(source_file='examples/example_source_data.xls',
-                    template_file='examples/example_template.xlsx',
-                    output_file='quality_inspection_report.xlsx'):
-    """
-    生成质量检验报告。
-    参数:
-        source_file (str): 源数据文件路径
-        template_file (str): 模板文件路径
-        output_file (str): 输出文件路径
-    """
-    # 检查文件是否存在
-    if not os.path.exists(source_file):
-        raise FileNotFoundError(f"源数据文件不存在: {source_file}")
-    if not os.path.exists(template_file):
-        raise FileNotFoundError(f"模板文件不存在: {template_file}")
-    # 读取第一个文件
-    header_df = pd.read_excel(source_file, sheet_name='HEADER')
-    # 读取Dimension表，跳过前12行，然后使用第13行作为列名
-    dimension_df = pd.read_excel(source_file, sheet_name='Dimension', skiprows=12)
-    # 使用第一行作为列名
-    dimension_df.columns = dimension_df.iloc[0]
-    # 删除第一行（现在已经作为列名）并重置索引
-    dimension_df = dimension_df.iloc[1:].reset_index(drop=True)
-    # 读取Sand表的数据
-    sand_df = pd.read_excel(source_file, sheet_name='Sand', header=None)
-    # 读取第二个文件
-    wb = load_workbook(template_file)
-    wacker_sheet = wb['WACKER']
-    # 获取Sales Order Quantity和Quality Assured By
-    sales_order_quantity = header_df.iloc[5, 2]  # Sales Order Quantity位置
-    quality_assured_by = header_df.iloc[3, 7]    # Quality Assured By在第4行最后一列
-    # 定义元素和行号的对应关系
-    element_row_mapping = {
-        'Al': 9,   # 硅柏_石英坩埚_QC530HS_201410_V3B-CN_Al
-        'Ca': 10,  # 硅柏_石英坩埚_QC530HS_201410_V3B-CN_Ca
-        'Cu': 11,  # 硅柏_石英坩埚_QC530HS_201410_V3B-CN_Cu
-        'Fe': 12,  # 硅柏_石英坩埚_QC530HS_201410_V3B-CN_Fe
-        'K': 13,   # 硅柏_石英坩埚_QC530HS_201410_V3B-CN_K
-        'Li': 14,  # 硅柏_石英坩埚_QC530HS_201410_V3B-CN_Li
-        'Mg': 15,  # 硅柏_石英坩埚_QC530HS_201410_V3B-CN_Mg
-        'Mn': 16,  # 硅柏_石英坩埚_QC530HS_201410_V3B-CN_Mn
-        'Na': 17,  # 硅柏_石英坩埚_QC530HS_201410_V3B-CN_Na
-        'Ti': 18,  # 硅柏_石英坩埚_QC530HS_201410_V3B-CN_Ti
-        'Zr': 19   # 硅柏_石英坩埚_QC530HS_201410_V3B-CN_Zr
-    }
-    # 定义元素在Sand表中的列索引
-    element_col_mapping = {
-        'Al': 4,   # 第5列
-        'Ca': 5,   # 第6列
-        'Cu': 6,   # 第7列
-        'Fe': 7,   # 第8列
-        'K': 8,    # 第9列
-        'Li': 9,   # 第10列
-        'Mg': 10,  # 第11列
-        'Mn': 11,  # 第12列
-        'Na': 12,  # 第13列
-        'Ti': 13,  # 第14列
-        'Zr': 14   # 第15列
-    }
-    # 遍历Dimension表格中的每个Customer ID
-    for index, row in dimension_df.iterrows():
-        customer_id = row['Customer ID']  # 现在这个列名应该是正确的了
-        inspection_date = pd.to_datetime(row['Inspection Date']).strftime('%Y-%m-%d')  # 格式化日期
-        # 创建新的工作表
-        new_sheet = wb.create_sheet(title=str(customer_id))
-        # 复制WACKER表格的内容到新工作表（这样会保持原有的客户名称）
-        for row_wacker in wacker_sheet.iter_rows(values_only=True):
-            new_sheet.append(row_wacker)
-        # 填充数据（不再覆盖客户名称）
-        new_sheet['B3'] = str(sales_order_quantity) + ' PCS'  # Number+Unit/数量+单位
-        new_sheet['B4'] = customer_id  # Batch reference/批号
-        new_sheet['D4'] = inspection_date  # Date of issue/报告日期
-        new_sheet['B5'] = inspection_date  # Production date/生产日期
-        new_sheet['D5'] = (datetime.strptime(inspection_date, '%Y-%m-%d') + timedelta(days=730)).strftime('%Y-%m-%d')  # Expiring date/失效日期
-        # 从sand表中获取当前customer_id的数据
-        sand_rows = sand_df[sand_df[2] == customer_id]  # 使用第3列（索引2）作为Crucible ID
-        if not sand_rows.empty:
-            sand_row = sand_rows.iloc[0]
-            # 填充元素数据
-            for element, target_row in element_row_mapping.items():
-                source_col = element_col_mapping[element]
-                new_sheet[f'D{target_row}'] = sand_row[source_col]
-        # 填充Analysis result/分析结果
-        # 保持原有的测试项目名称，只更新分析结果列
-        for i in range(20, 29):
-            if i == 20:
-                new_sheet[f'D{i}'] = row['OD1']  # 外径1
-            elif i == 21:
-                new_sheet[f'D{i}'] = row['OD2']  # 外径2
-            elif i == 22:
-                new_sheet[f'D{i}'] = row['OD3']  # 外径3
-            elif i == 23:
-                new_sheet[f'D{i}'] = row['Height']  # 高度
-            elif i == 24:
-                new_sheet[f'D{i}'] = row['Wall11']  # 壁厚11
-            elif i == 25:
-                new_sheet[f'D{i}'] = row['Wall12']  # 壁厚12
-            elif i == 26:
-                new_sheet[f'D{i}'] = row['Wall13']  # 壁厚13
-            elif i == 27:
-                new_sheet[f'D{i}'] = row['Wall2']  # 壁厚2
-            elif i == 28:
-                new_sheet[f'D{i}'] = row['Wall3']  # 壁厚3
-        # 保持"批准人："文本，并在其后添加名字
-        new_sheet['D29'] = f"批准人：{quality_assured_by}"
-    # 保存修改后的文件
-    wb.save(output_file)
-    print(f"报告已生成: {output_file}")
-if __name__ == "__main__":
-    generate_reports()

main.py ADDED Viewed

	@@ -0,0 +1,113 @@

+import pandas as pd
+from openpyxl import load_workbook
+from datetime import datetime, timedelta
+# 读取第一个文件
+file1 = '1.xls'
+header_df = pd.read_excel(file1, sheet_name='HEADER')
+# 读取Dimension表，跳过前12行，然后使用第13行作为列名
+dimension_df = pd.read_excel(file1, sheet_name='Dimension', skiprows=12)
+# 使用第一行作为列名
+dimension_df.columns = dimension_df.iloc[0]
+# 删除第一行（现在已经作为列名）并重置索引
+dimension_df = dimension_df.iloc[1:].reset_index(drop=True)
+# 读取Sand表的数据
+sand_df = pd.read_excel(file1, sheet_name='Sand', header=None)
+# 读取第二个文件
+file2 = '2.xlsx'
+wb = load_workbook(file2)
+wacker_sheet = wb['WACKER']
+# 获取Sales Order Quantity和Quality Assured By
+sales_order_quantity = header_df.iloc[5, 2]  # Sales Order Quantity位置
+quality_assured_by = header_df.iloc[3, 7]    # Quality Assured By在第4行最后一列
+# 定义元素和行号的对应关系
+element_row_mapping = {
+    'Al': 9,   # 硅柏_石英坩埚_QC530HS_201410_V3B-CN_Al
+    'Ca': 10,  # 硅柏_石英坩埚_QC530HS_201410_V3B-CN_Ca
+    'Cu': 11,  # 硅柏_石英坩埚_QC530HS_201410_V3B-CN_Cu
+    'Fe': 12,  # 硅柏_石英坩埚_QC530HS_201410_V3B-CN_Fe
+    'K': 13,   # 硅柏_石英坩埚_QC530HS_201410_V3B-CN_K
+    'Li': 14,  # 硅柏_石英坩埚_QC530HS_201410_V3B-CN_Li
+    'Mg': 15,  # 硅柏_石英坩埚_QC530HS_201410_V3B-CN_Mg
+    'Mn': 16,  # 硅柏_石英坩埚_QC530HS_201410_V3B-CN_Mn
+    'Na': 17,  # 硅柏_石英坩埚_QC530HS_201410_V3B-CN_Na
+    'Ti': 18,  # 硅柏_石英坩埚_QC530HS_201410_V3B-CN_Ti
+    'Zr': 19   # 硅柏_石英坩埚_QC530HS_201410_V3B-CN_Zr
+}
+# 定义元素在Sand表中的列索引
+element_col_mapping = {
+    'Al': 4,   # 第5列
+    'Ca': 5,   # 第6列
+    'Cu': 6,   # 第7列
+    'Fe': 7,   # 第8列
+    'K': 8,    # 第9列
+    'Li': 9,   # 第10列
+    'Mg': 10,  # 第11列
+    'Mn': 11,  # 第12列
+    'Na': 12,  # 第13列
+    'Ti': 13,  # 第14列
+    'Zr': 14   # 第15列
+}
+# 遍历Dimension表格中的每个Customer ID
+for index, row in dimension_df.iterrows():
+    customer_id = row['Customer ID']  # 现在这个列名应该是正确的了
+    inspection_date = pd.to_datetime(row['Inspection Date']).strftime('%Y-%m-%d')  # 格式化日期
+    # 创建新的工作表
+    new_sheet = wb.create_sheet(title=str(customer_id))
+    # 复制WACKER表格的内容到新工作表（这样会保持原有的客户名称）
+    for row_wacker in wacker_sheet.iter_rows(values_only=True):
+        new_sheet.append(row_wacker)
+    # 填充数据（不再覆盖客户名称）
+    new_sheet['B3'] = str(sales_order_quantity) + ' PCS'  # Number+Unit/数量+单位
+    new_sheet['B4'] = customer_id  # Batch reference/批号
+    new_sheet['D4'] = inspection_date  # Date of issue/报告日期
+    new_sheet['B5'] = inspection_date  # Production date/生产日期
+    new_sheet['D5'] = (datetime.strptime(inspection_date, '%Y-%m-%d') + timedelta(days=730)).strftime('%Y-%m-%d')  # Expiring date/失效日期
+    # 从sand表中获取当前customer_id的数据
+    sand_rows = sand_df[sand_df[2] == customer_id]  # 使用第3列（索引2）作为Crucible ID
+    if not sand_rows.empty:
+        sand_row = sand_rows.iloc[0]
+        # 填充元素数据
+        for element, target_row in element_row_mapping.items():
+            source_col = element_col_mapping[element]
+            new_sheet[f'D{target_row}'] = sand_row[source_col]
+    # 填充Analysis result/分析结果
+    # 保持原有的测试项目名称，只更新分析结果列
+    for i in range(20, 29):
+        if i == 20:
+            new_sheet[f'D{i}'] = row['OD1']  # 外径1
+        elif i == 21:
+            new_sheet[f'D{i}'] = row['OD2']  # 外径2
+        elif i == 22:
+            new_sheet[f'D{i}'] = row['OD3']  # 外径3
+        elif i == 23:
+            new_sheet[f'D{i}'] = row['Height']  # 高度
+        elif i == 24:
+            new_sheet[f'D{i}'] = row['Wall11']  # 壁厚11
+        elif i == 25:
+            new_sheet[f'D{i}'] = row['Wall12']  # 壁厚12
+        elif i == 26:
+            new_sheet[f'D{i}'] = row['Wall13']  # 壁厚13
+        elif i == 27:
+            new_sheet[f'D{i}'] = row['Wall2']  # 壁厚2
+        elif i == 28:
+            new_sheet[f'D{i}'] = row['Wall3']  # 壁厚3
+    # 保持"批准人："文本，并在其后添加名字
+    new_sheet['D29'] = f"批准人：{quality_assured_by}"
+# 保存修改后的文件
+wb.save('2_updated.xlsx')