scientific and invention-focused datasets designed to train AI systems in discovery-driven reasoning, experimentation logic, and innovation modeling