Career Area:
Technology, Digital and Data
Job Description:
Your Work Shapes the World at Caterpillar Inc.
When you join Caterpillar, you're joining a global team who cares not just about the work we do – but also about each other. We are the makers, problem solvers, and future world builders who are creating stronger, more sustainable communities. We don't just talk about progress and innovation here – we make it happen, with our customers, where we work and live. Together, we are building a better world, so we can all enjoy living in it.
Job Duties:
Caterpillar uses quantitative techniques to solve problems. Typical problems include maximizing operating profits by reducing costs; optimizing
supply chains and logistics; assurance of supply; forecasting procurement trends; analyzing supplier health; predicting quality related metrics.
The Data Scientist II demonstrates thorough knowledge of statistical approaches, data management techniques, and/or related digital technologies, and the ability to handle complex issues. The incumbent demonstrates very good communication and presentation skills, being able to explain conclusions to customers who have limited knowledge and experience with quantitative analytical methods. As an individual contributor on teams, they should also exhibit
strong initiative and teamwork skills, and a comprehensive knowledge of Caterpillar Inc., its products and services; its internal systems, processes, and
procedures; and the external environment in which it competes.
BACKGROUND/EXPERIENCE:
Typically requires a Bachelor’s degree, preferably in computer science, engineering, statistics, economics, mathematics, or a similar quantitative field and 4-5 years of professional experience utilizing quantitative analysis, a Master’s degree and 2-3 years of experience, or a PhD in one of the associated fields. Incumbent will have exhibited the ability to work on increasingly more complex assignments .demonstrated teamwork, initiative, interpersonal skills, and the ability to communicate effectively. Must have expertise in Python, SQL and industry standard statistics and data visualization packages. Experience in distributed computing like Spark, Cloud technologies like AWS, agile methodology,
version control like Git are desired.
MUST HAVE:
• Hands-on experience in writing extensive SQL and Python Code
• Proficiency with modern data platforms such as Snowflake
• Experience with data transformation tools such as dbt (data build tool)
• Experience with CI/CD pipelines, GitHub and DevOps practices
DESIRABLE QUALIFICATIONS:
• Working knowledge of containerization (Docker) and orchestration for model deployment
• Familiarity with graph databases (Neo4j) and knowledge graph construction for analytics
• Practical experience with Deep Learning frameworks (PyTorch, TensorFlow) and model lifecycle management (MLflow, SageMaker)
• Exposure to retrieval-augmented generation (RAG) patterns and vector search techniques
• Building and deploying Large Language Models (LLMs) and Agentic AI solutions
• Understanding of procurement, supply chain, or manufacturing domains within a large enterprise environment
Skill Descriptors
Business Statistics: Knowledge of the statistical tools, processes, and practices to describe business results in measurable scales; ability to use statistical tools and processes to assist in making business decisions.
Level Working Knowledge:
• Explains the basic decision process associated with specific statistics.
• Works with basic statistical functions on a spreadsheet or a calculator.
• Explains reasons for common statistical errors, misinterpretations, and misrepresentations.
• Describes characteristics of sample size, normal distributions, and standard deviation.
• Generates and interprets basic statistical data.
Accuracy and Attention to Detail: Understanding the necessity and value of accuracy; ability to complete tasks with high levels of precision.
Level Extensive Experience:
• Evaluates and makes contributions to best practices.
• Processes large quantities of detailed information with high levels of accuracy.
• Productively balances speed and accuracy.
• Employs techniques for motivating personnel to meet or exceed accuracy goals.
• Implements a variety of cross-checking approaches and mechanisms.
• Demonstrates expertise in quality assurance tools, techniques, and standards.
Analytical Thinking: Knowledge of techniques and tools that promote effective analysis; ability to determine the root cause of organizational problems and create alternative solutions that resolve these problems.
Level Working Knowledge:
• Approaches a situation or problem by defining the problem or issue and determining its significance.
• Makes a systematic comparison of two or more alternative solutions.
• Uses flow charts, Pareto charts, fish diagrams, etc. to disclose meaningful data patterns.
• Identifies the major forces, events and people impacting and impacted by the situation at hand.
• Uses logic and intuition to make inferences about the meaning of the data and arrive at conclusions.
Machine Learning: Knowledge of principles, technologies and algorithms of machine learning; ability to develop, implement and deliver related systems, products and services.
Level Working Knowledge:
• Completes specific tasks and initiatives utilizing machine learning technologies, such as search engine optimization.
• Utilizes specific tools and techniques to process descriptive and inferential statistics.
• Applies specific computing languages and tools in machine learning, such as R and Python.
• Explores to use machine learning in one own areas to make business improvements.
• Conducts data mining and cleaning initiatives.
Programming Languages: Knowledge of basic concepts and capabilities of programming; ability to use tools, techniques and platforms in order to write and modify programming languages.
Level Working Knowledge:
• Participates in the implementation and support of specialized programming languages.
• Conducts basic reviews on writing a specific programming language within a specific platform.
• Assists with the design and development of specialized programming languages.
• Follows an organization's standards, policies and guidelines for structured programming specifications.
• Diagnoses and reports minor or routine programming language problems.
This role is expected to be hybrid for the near future with limited days in the office every month and flexible hours.
Relocation is available for this position.Visa Sponsorship is not available for this position.
Posting Dates:
March 18, 2026 - March 26, 2026
Caterpillar is an Equal Opportunity Employer. Qualified applicants of any age are encouraged to apply
Not ready to apply? Join our Talent Community.