Data Science AI Ops Lead
Job Title: Data Science AI Ops Lead
Career Level: E1
AstraZeneca is a global, innovation-driven biopharmaceutical business that focuses on the discovery, development and commercialization of prescription medicines for some of the world's most serious diseases. But we're more than one of the world's leading pharmaceutical companies. At AstraZeneca, we're proud to have a unique workplace culture that inspires innovation and collaboration. Here, employees are empowered to express diverse perspectives and are made to feel valued, energized and rewarded for their ideas and creativity.
We are looking for an AI Ops Lead to join our Data Science & AI team in Chennai. The ideal candidate will have industry experience working in a range of different cloud environments where they devised and deployed large-scale production infrastructure and platforms for data science. The position will involve taking these skills and applying them to some of the most exciting data & prediction problems in drug discovery.
The successful candidate will be part of building a new, close-knit team of deeply technical experts and together have the chance to create tools that will advance the standard of healthcare, improving the lives of millions of patients across the globe. This platform will support major AI initiatives such as clinical trial data analysis, knowledge graphs, imaging & omics for our therapy areas. You will also have responsibility to help provide the frameworks for data scientists to develop scalable machine learning and predictive models with our growing data science community, in a safe and robust manner.
As a strong software leader and an expert in building complex systems, you will be responsible for inventing how we use technology, machine learning, and data to enable the productivity of AstraZeneca. You will help envision, build, deploy and develop our next generation of data engines and tools at scale. You will be bridging the gap between science and engineering and functioning with deep expertise in both worlds.
• Own the development roadmap to build and operationalise our data science environment, platforms and tooling.
• Support any external opportunities, through close partnership and engagement such as Benevolent.AI collaboration.
• Deployment of systems, applications and tooling for data science on cloud environments.
• Understanding of the necessary guardrails required for different use cases and data sensitivities.
• Adapt standard machine learning methods to best exploit modern parallel environments (e.g. distributed clusters, multicore SMP, and GPU).
• Provide the necessary infrastructure and platform to support the deployment and monitoring of ML solutions in production Optimizing solutions for performance and scalability.
• Liaise with the Data Engineering team to ensure that the platform and the solutions deployment therein benefit from an optimised and scalable data flow between source systems and analytical models
• Implementing custom machine learning code and developing benchmarking capabilities to monitor drift of any analyses over time.
• Understanding of the latest AI webservices and data science tools, from DataBricks to citizen data science tools like Dataiku, C3.AI and Domino. Experience working on regulatory data would be helpful but not essential.
• Liaise with other teams to enhance our technological stack, to enable the adoption of the latest advances in Data Processing and AI
• Being an active member of the Data Science team, you will benefit from, and contribute to, our expanding bank of Data Science algorithms and work efficiently with our data science infrastructure.
• Appreciation of how to optimise predictive models, run in production and monitor. Experience running a service team will be beneficial.
• Testing and assessing the quality of new tools.
• Line management responsibilities as well as team recruitment, training provision and coaching
Candidate Knowledge, Skills and Experience
• BSc in Computer Science or related quantitative field or MSc/Ph.D degree in Computer Science or related quantitative field.
• More than 2 years of experience and demonstrable deep technical skills in one or more of the following areas: machine learning, recommendation systems, pattern recognition, natural language processing or computer vision.
• Experience managing an enterprise platform and service, handling new customer demand and feature requests.
• Strong software coding skills, with proficiency in Python and Scala preferred.
• Significant experience with AWS cloud environments, working knowledge of Google and Azure platforms. Knowledge of Kubernetes, S3, EC2, Sagemaker, Athena, RDS and Glue is essential. Certification in appropriate areas will be viewed favourably.
• Experience with best practice of data transport and storage within cloud system.
• Experience building large scale data processing pipelines. e. g. Hadoop/Spark and SQL.
• Experience provisioning computational resources in a variety of environments.
• Experience with containers and microservice architectures e.g. Kubernetes, Docker and serverless approaches.
• Experience with automation strategies e.g. CI/CD, gitops.
• Use of Data Science modelling tools e.g. R, Python, SAS and Data Science notebooks (e.g. Jupyter).
• Creative, collaborative, & product focused.
• Ability to just get things done.
The role will have line reports and task management responsibilities within project or services may occur.
Department – Data & Analytics, S&EUIT
Science and Enabling Units IT is a global IT capability supporting Drug Research, Drug Development, Product & Portfolio Strategy, Medical Affairs, Finance, HR, Compliance, Legal and Global Business Services. We are organized around 7 key capability areas: Business Partnering, Solution Delivery, Architecture, Application Support, Data & Analytics, Change & Operations, operating out of sites across the US, UK, Sweden, India and Mexico.
Data & Analytics provides analytics and data insight services and solutions critical to the Data & AI/ML emerging strategy and mission of S&EUIT and AZ. D&A is organized into teams specializing in Information Architecture, Data Engineering, Data Visualisation, Knowledge Management, Data Science, Data Analysis and Information Governance.
We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.
AstraZeneca embraces diversity and equality of opportunity. We are committed to building an inclusive and diverse team representing all backgrounds, with as wide a range of perspectives as possible, and harnessing industry-leading skills. We believe that the more inclusive we are, the better our work will be. We welcome and consider applications to join our team from all qualified candidates, regardless of their characteristics. We comply with all applicable laws and regulations on non-discrimination in employment (and recruitment), as well as work authorisation and employment eligibility verification requirements.