Design and implement data pipelines
The goal of this objective is to design and implement new data pipelines to keep our business scaling
Run a POC on AWS Glue for cost reduction
The goal of this objective is to cut data processing costs
Create a roadmap for migration from Redshift to Snowflake
The goal of this objective is to remove the overhead of managing the data infrastructure
Improve the accuracy of our NLP models
Develop a new NLP pipeline and benchmark it against OpenAI
Increase the range of tasks our NLP models can handle
Expand the capabilities of our NLP models
Reduce the time and resources required to run our NLP models
Optimize the efficiency of our NLP pipeline
Improve infrastructure
Responsible for collaborating with data scientists and analysts to build and improve data and ML products that drive revenue for the company
Improve model deployment and monitoring
Responsible for deploying, maintaining, and monitoring machine learning models used by the company
Improve data pipeline performance and reliability
Responsible for building performant and reliable data pipelines using tools like Airflow and BigQuery
Improve cloud infrastructure
Responsible for maintaining and improving the cloud infrastructure used by the team