Job Summary
As a Data Trainer for our leading pharmaceutical technology company, you will play a crucial role in developing, curating, and optimizing datasets that support the training and improvement of AI and machine learning models across drug discovery, clinical research, and regulatory compliance. The ideal candidate is detail-oriented, possesses strong technical and analytical skills, and is passionate about data quality and compliance in the pharmaceutical sector. You will work closely with cross-functional teams to ensure our AI systems deliver accurate, compliant, and high-impact results for pharmaceutical research and operations.
Must-Haves:
We are seeking a candidate with hands-on experience in data annotation, cleaning, and preparation, particularly for AI or machine learning projects in healthcare or the life sciences. You should have proficiency in Python and data science tools, familiarity with clinical trial data, health records, or genomic datasets, and a strong understanding of regulatory requirements such as FDA, EMA, or GMP standards. Excellent communication skills and a commitment to data quality, integrity, and security are essential. Prior experience in the pharmaceutical industry or regulated environments is highly valued.
Responsibilities
- Curate, clean, and annotate datasets for training AI and machine learning models used in pharmaceutical research, clinical trials, and regulatory reporting.
- Develop and maintain data pipelines to support continuous model improvement and retraining cycles.
- Review and validate AI model outputs, providing feedback and corrections to enhance performance and compliance.
- Collaborate with data scientists, engineers, and subject matter experts to align data training with business goals and regulatory standards.
- Document data processes, annotation guidelines, and quality standards for internal and audit purposes.
- Monitor data quality and model performance, proactively identifying and resolving issues.
- Stay current with industry best practices, regulatory changes, and new tools relevant to data training in pharmaceuticals.
Required Skills
- Proficiency in Python and experience with data science libraries (e.g., pandas, NumPy).
- Experience with data annotation, cleaning, and preparation for AI/ML, preferably with pharmaceutical or clinical data.
- Familiarity with regulatory requirements (FDA, EMA, GMP) and data security best practices in the pharmaceutical industry.
- Strong analytical, problem-solving, and organizational skills.
- Excellent communication and documentation abilities.
- Ability to work independently in a remote, distributed team environment.
Required Years of Experience
- Minimum 2–3 years of experience in data training, data science, or related fields, with at least 1 year focused on AI/ML data preparation or annotation in healthcare, life sciences, or pharmaceuticals.
Required Education
- Bachelor’s degree in data science, Computer Science, Life Sciences, Bioinformatics, or a related field.
- Advanced degrees or relevant certifications are advantageous.
About Us:
Contemporary Staffing Solutions (CSS) is a trusted leader in providing contract, temporary, temp-to-hire, and direct hire staffing solutions. With decades of experience, we’ve grown from a staffing agency to a nationwide provider of workforce management solutions. Our niche recruitment expertise spans Accounting & Finance, Call Center & Office Support, Human Resources, Sales & Marketing, and Information Technology.
Explore more about CSS and how we connect great talent with exceptional opportunities by visiting www.ContemporaryStaffing.com.
#LI-TEC
#LI-AM1