Data Engineer with EPIC - Contract Role - Remote at Remote, Remote, USA |
Email: [email protected] |
http://bit.ly/4ey8w48 https://jobs.nvoids.com/job_details.jsp?id=2154633&uid= From: Raghu Prasad, Blue Ocean Ventures [email protected] Reply to: [email protected] Hi, Role: Data Engineer Remote NOTE : Location: - This position is remote and the candidate's location in the Greater Boston Area or the Bay Area is preferred. Industry Knowledge (Preferred): Experience in life sciences or related sectors is preferred, but not required. What is in it for you As an Senior Data Engineer, you will be a part of an Agile team to provide your deep expertise in data engineering and data product development. Responsibilities: - Data Pipeline Development: Design, build, and maintain scalable, reliable, and idempotent data pipelines to support the ingestion, processing, and analysis of data. Preferred, but not required, experience would involve drug discovery or clinical development data. Data Integration: Integrate diverse and multi modal data sources, including structured and unstructured data, to create unified data environments that facilitate our software product innovation. Enable full provenance of data from end-user, through pipeline, back to source data. Data Product Development: Collaborate with AI and product development teams to create and enhance data products. This includes designing data architectures and services architectures that support advanced AI models and ensuring data quality, consistency, and availability. Data Management: Implement best practices for data governance, security, and compliance, particularly in handling sensitive data. Ideally, the candidate would have pharmaceutical-specific experience ensuring all systems comply with industry regulations (e.g., FDA, EMA). Performance Optimization: Optimize and orchestrate cloud-based workflows, database performance, and storage strategies to handle large datasets efficiently Collaboration & Leadership: Work closely with cross-functional teams, including data scientists, AI engineers, and domain experts, to understand data requirements and deliver solutions that meet business objectives. Growth Oriented: Stay updated with the latest advancements in data engineering, cloud technologies, and LLMs to continually improve our data infrastructure and capabilities. Skills: - We are looking for an experienced Data Engineer with deep expertise in data engineering and data product development. The ideal candidate will have strong expertise in handling large, complex datasets used across the biopharma and life sciences industry. You will work closely with data scientists, software engineers, and key stakeholders to build and optimize data pipelines, ensuring that our software platform is robust, scalable, and secure. Mandatory skills Education: Bachelors or Masters degree in Computer Science, Data Engineering, Information Systems, or a related field 7+ years of experience in data engineering roles, working with large complex data preferably in the pharmaceutical or life sciences domain. Proven experience working with drug development data, including clinical trials, preclinical studies, and regulatory submissions. Experience in developing data products and infrastructure to support AI applications. Managing data pipelines in a variety of environments, and dealing with evolving schemas of source data Designing and optimizing scalable data pipelines to efficiently process and manage large datasets (100+ million records) Proficiency in programming languages such as Python, Pyspark, and SQL. Expertise in data engineering platforms such as Databricks, Snowflake, DBT and their underlying functions Strong SQL skills and experience with relational databases (e.g., PostgreSQL) Experience with cloud platforms (e.g., AWS preferred) and infrastructure-as-code tools (e.g., Terraform, CloudFormation). Familiarity with containerization and orchestration tools like Docker and Kubernetes. Knowledge of data governance frameworks and compliance with pharmaceutical industry regulations. Excellent problem-solving skills with a focus on practical solutions. Enthusiasm for continuous learning and professional growth. A passion for exploring new technologies, frameworks, and software development methodologies. Embraces rapid prototyping with an emphasis on user feedback Autonomous and excited about taking ownership over major initiatives. Strong communication skills, capable of conveying complex technical concepts to both technical and non-technical stakeholders. Strong collaboration skills, with a demonstrated ability to work effectively in cross-functional teams Preferred Qualifications: Experience with data engineering in drug discovery or development Knowledge of LLMs, specifically embedding Experience integrating at-scale with a ML platform (such as AWS Sagemaker) as part of a data workflow Experience working with unstructured document data (PDFs, images Keywords: artificial intelligence machine learning information technology Data Engineer with EPIC - Contract Role - Remote [email protected] http://bit.ly/4ey8w48 https://jobs.nvoids.com/job_details.jsp?id=2154633&uid= |
[email protected] View All |
07:54 PM 07-Feb-25 |