Responsibilities
Work with clients to gather and process data for execution on Cerebras infrastructure.
Execute data pipeline processes and integrate data into AI model training workflows.
Collaborate with System Engineers and AI specialists to ensure smooth data input and preparation for model training.
Assist in POCs by developing data structures optimized for Cerebras hardware
Qualifications:
Bachelor’s degree in Data Science, Computer Science, or comparable practical experience
Proficiency in data pipeline tools and processing frameworks.
Experience with AI model training data requirements and optimization strategies.
Technical Skills:
Cerebras Software Platform (CSoft): Familiarity with Cerebras CSoft for integrating data pipelines into AI model training workflows on the Cerebras hardware.
Data Processing: Experience with large-scale data processing and preparation for AI/ML models using Apache Spark, Hadoop, or similar frameworks, specifically in the context of Cerebras architecture.
Programming Languages: Strong expertise in Python, (C++ optional), and TensorFlow for creating data inputs and pipelines optimized for the Cerebras platform.
Machine Learning Frameworks: Proficiency in integrating and preparing data for TensorFlow, PyTorch, or ONNX, ensuring smooth execution on Cerebras’ Wafer Scale Engine (WSE).
Cluster Management: Experience with distributed computing and cluster provisioning, particularly for high-performance AI workloads on Cerebras hardware.
Apply here:https://remote.com/jobs/wiser-technology-c155vr96/principal-engineer-data-j1szx5cf