CurieTech Inc. Software Engineer, Data Acquisition Sunnyvale, CA · Full time

The Data Acquisition team is responsible for all aspects of collecting data to train our models. This team is responsible for writing deep crawlers and developing data pipelines to generate high quality data. They work closely with our AI/ML and Infrastructure teams to ensure that such pipelines solve the problem at hand and are easily deployed and maintained.

Description

Why Join CurieTech


CurieTech, is a Silicon Valley based startup that is building cutting edge AI software to help software development teams be more productive. At Curie you will have the opportunity to work at the forefront of generative AI and agents technology and build innovative UX approaches of embedding the capabilities of these technologies in developer workflows. You will work with a close knit technical team who are LLM and AI/ML experts and are innovating on the very cutting edge of AI software. The company was founded in 2023 and is backed by reputed Silicon Valley based venture capitalists. 


Job Function


  • Lead and manage engineering projects focused on data acquisition, including web crawling, data ingestion, and API development.
  • Collaborate with cross-functional teams (AI/ML, Infrastructure).
  • Design and implement highly scalable distributed systems to efficiently handle unstructured data.
  • Develop algorithms for data indexing and advanced search functionalities.
  • Build and maintain backend services for data storage, utilizing key-value databases and synchronization methods.
  • Deploy solutions within a Kubernetes Infrastructure-as-Code environment, ensuring system reliability through routine checks.
  • Conduct experiments on data to analyze and enhance system performance.


Qualifications


  • ⁠BS/MS/PhD in Computer Science or a related field.
  • 4+ years of industry experience in software development, with a focus on data systems and APIs.
  • Familiarity with large web crawlers is a plus.
  • Strong expertise in large stateful distributed systems and data processing.
  • ⁠Proficiency in Kubernetes and Infrastructure-as-Code principles.


We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status.