Bioinformatics Data Engineer / Project Lead
Role summary
Seeking a remote, 1099 contractor Bioinformatics Data Engineer / Project Lead for a data standardization and curation project. This hands-on role requires strong Python coding skills, experience with data processing, curation, and standardization across various RNA sequencing data types (scRNA-seq, RNA-seq, perturbation, FPKM). You will operate within the Lamindb platform and extensively use AI tools like Cursor (Claude + MCP) for development and documentation, ensuring accuracy and reproducibility. A key aspect involves applying bioinformatics ontologies for data standardization. The role also includes project leadership responsibilities such as planning, stakeholder communication, and managing blockers, all while adhering strictly to SOPs and maintaining high documentation standards.
We are looking for a hands-on Bioinformatics Data Engineer / Project Lead to support a high-impact data standardization and curation project in a fully remote contractor role.
Contract Opportunity – Bioinformatics Data Engineer / Project Lead (Remote, 1099)
Key Responsibilities
Project Leadership
- Plan and govern delivery with clear expectations, timelines, and stakeholder alignment
- Maintain strict adherence to SOPs and ensure continuous client communication
- Proactively manage blockers and support the team in resolving issues
- Track adjacent workstreams (e.g., data curation) to understand dependencies and interfaces
Hands-on Delivery
- Execute data processing, curation, standardization, and exploration following established SOPs
- Work across scRNA-seq, RNA-seq, perturbation, and FPKM data (additional data types may follow)
- Operate within Lamindb as the core data platform
- Produce highly documented work (code, processes, and data changes)
- Raise and review PRs and ensure strict coding standards
- Write clean, maintainable Python code
AI-Enabled Work
- Work extensively with Cursor (Claude + MCP) for coding, data exploration, and documentation
- Ensure AI-assisted outputs are accurate, reproducible, and SOP-compliant
- Maintain full traceability and documentation of AI-supported work
Ontology Usage
- Apply bioinformatics ontologies to support data standardization and curation
- Ensure correct ontology usage aligned with project standards
- No heavy ontology pipeline setup required
Required Qualifications
- Strong Python coding skills with strict coding standards
- Hands-on experience in data processing, curation, and standardization
- Proven experience using Cursor with Claude/MCP for development and exploration
- Experience working with Lamindb
- Experience with scRNA-seq, RNA-seq, perturbation, and FPKM data
- Strong documentation discipline and PR workflow experience (Git-based)
- Clear and proactive communication with stakeholders
- Strict SOP adherence and structured delivery approach
- Understanding and application of bioinformatics ontologies in data standardization
- Ability to stay aware of adjacent data threads and dependencies
Preferred Skills
Must Have
- Efficient use of Cursor/AI tools (Claude + MCP)
- Experience with Lamindb data atlas workflows
- Experience handling scRNA-seq, RNA-seq, perturbation, and FPKM data
- Applied knowledge of bioinformatics ontologies
Good to Have
- Ability to monitor and coordinate adjacent data curation threads
Contract Details
- 1099 contractor role
- Minimum 3-month assignment with possible extension
- Full-time commitment (40+ hours/week)
- Fully remote