Location: Primarily Remote. Candidates must be based in San Diego, CA or DC Metro Area for proximity to project and client teams.
Security Clearance Requirement: Applicants must be US citizens and eligible to obtain and maintain an active Secret security clearance or above.
Key Responsibilities
- Data Collection and Processing
- Acquire, clean, and preprocess diverse datasets from various sources.
- Build required infrastructure for optimal extraction, transformation and loading of data from various data sources using CSP managed services and SQL technologies
- Develop and maintain data pipelines to ensure a continuous flow of high-quality data
- Data Migrations & Optimization
- Develop data migration strategies and schemas to lead customer migrations from on-prem to cloud technologies
- Perform data migration activities
- Optimize databases and data warehouses for efficient querying and data storage
- Data Analysis and Visualization
- Perform exploratory data analysis to uncover patterns, trends, and insights.
- Create visualizations and reports to communicate findings effectively to stakeholders both internally and externally
- Collaboration and Documentation
- Collaborate with cross-functional teams, including software engineers, domain experts, and business analysts, to understand requirements and deliver integrated solutions.
- Create and maintain comprehensive documentation for code, algorithms, and models. Ensure that the knowledge is shared and accessible within the team.
- Customer Engagement
- Act on client feedback constructively to improve services and outcomes.
- Continuously seek ways to enhance the overall customer experience.
- Continuous Learning and Innovation
- Stay updated on the latest developments in machine learning, data science, and analytics.
- Drive innovation by proposing and implementing new techniques and technologies.
Qualifications
- Solid understanding and experience with SQL and relational database concepts
- Solid understanding of database technologies, data warehouses, and ETL tools (e.g., MySQL, PostgreSQL, Beam, Airflow, and Kafka).
- Experience with data analysis tools (eg., Jupyter, Colab, Pandas)
- Experience with data visualization tools (eg., Tableau, Looker, PowerBI, Qlik, and SuperSet)
- Previous experience developing data strategies and facilitating data migrations into production systems.
- Experience with cloud platforms (e.g., AWS, Azure, GCP).
- Proficiency in programming languages such as Python, Java, or C++.
- Strong software engineering skills with an emphasis on writing clean, modular, and maintainable code.
- Familiarity with version control systems (e.g., Git) and collaborative development workflows.
- Excellent problem-solving and critical-thinking skills.
- Effective communication skills and ability to work in a collaborative team environment.
Preferred qualifications:
- Bachelor's or advanced degree in Computer Science, Data Science, Machine Learning, or a related field.
- Experience with other database technologies (eg., NoSQL, Graph)
- Google Cloud Professional Cloud Architect
- Google Cloud Professional Database Engineer certification
- Google Cloud Professional Data Engineer
- Experience with additional data processing tools and technologies (e.g., Spark, Hadoop).
- Knowledge of containerization and orchestration tools (e.g., Docker, Kubernetes).