
Published: Wed, 23 Jul 2025 08:23:08 GMT
Position: Software Engineer, Data Infrastructure & Acquisition
Company: Speechify
Location: [Insert Location]
Type: Full-time
Overview:
Speechify is seeking a highly skilled and experienced Software Engineer to join our AI team. As a Software Engineer for our Data Infrastructure & Acquisition side, you will be responsible for managing all aspects of data collection to support our model training operations. Our team is dedicated to building high-quality datasets at petabyte-scale and low cost through a seamless integration of infrastructure, engineering, and research work.
Key Responsibilities:
– Proactively seek out new sources of audio data and integrate them into our ingestion pipeline
– Manage and expand our cloud infrastructure for the ingestion pipeline, currently operating on GCP and managed with Terraform
– Collaborate closely with our Scientists to optimize cost, throughput, and quality to deliver richer data at a larger scale and lower cost for our next-generation models
– Work closely with the AI Team and Speechify Leadership to develop and execute the AI Team’s dataset roadmap, powering our next-generation consumer and enterprise products
Qualifications:
– BS/MS/PhD in Computer Science or a related field
– Minimum of 5 years of industry experience in software development
– Proficiency in bash/Python scripting in Linux environments
– Strong understanding of Docker and Infrastructure-as-Code concepts, with professional experience working with at least one major Cloud Provider (GCP preferred)
– Experience with web crawlers and large-scale data processing workflows is a plus
– Ability to manage multiple tasks and adapt to changing priorities
– Excellent communication skills, both written and verbal
If you have a passion for data and a strong background in software engineering, we want to hear from you! Join our team at Speechify and help us revolutionize the way people interact with technology. Apply now and take the next step in your career. Apply link