Skip to content Skip to navigation

Citrine Informatics

Data Science and Engineering Intern
Redwood City, CA
Innovation Area: 
Generation and Conversion
Storage, Transmission and Management
Position Type: 
TomKat Center-supported
Internship Term: 
8 weeks

Citrine is the data platform for the physical world. Our platform ingests and analyzes vast quantities of technical data on materials, chemicals, and devices to streamline R&D, manufacturing, and supply chain operations for any organization that produces a physical product. Our users are scientists and engineers at large manufacturing and materials companies, as well as researchers at universities and government labs, and our platform is an essential workflow tool that enables these users to analyze tremendous quantities of technical data.

Position title

Data Science and Engineering Intern


In this internship, you will combine knowledge of physical science with data science and machine learning to build predictive models for various material properties. You will also visualize and communicate results for materials scientists who are often unfamiliar with machine learning concepts. Further, you will grow Citrine's vast materials database, and use machine learning and other tools to identify possibly spurious or physically unreasonable data points that would have a detrimental effect on models. Interns will present results to the Data Science, Data Operations, and executive leadership teams at Citrine. 

Who should apply

Stanford undergraduates who have demonstrated excellence and conscientiousness (i.e., the ability to deliver) in their academic career to date. Candidates who believe in our mission and want to play a role in shaping the future of materials and manufacturing.

Required expertise

Background or strong interest in material science, physical sciences, and coding.

Preferred skills/majors

  • Pursuing a BS or MS degree in Material Science, Computer Science, Physics or Chemistry, or related technical field from an academic institution.
  • Knowledge of software development capabilities in Scala/Java, C++, or Python (in order of preference).
  • Ability to strengthen scientific teams and collaborations with proactive, concise, and insightful communication.
  • Familiarity with Apache Spark or other widely-used machine learning frameworks would be helpful.
  • Experience working in a shared codebase with git is a plus.
  • Curious, self-driven, analytical and excited to explore data.
  • Ability to thrive in a fast-paced work environment.
  • Interest in our vision and a desire to make an impact in a short time are far more important than any specific technical background.

Internship term

8 weeks