Junior Data Engineer
Junior Data Engineer
The Data team is seeking a Junior Data Engineer to work on various tasks and projects. The members of Data the team uses rigorous analytics to generate insights that inform content, product, marketing, and business decisions across New York Public Radio. In the meantime, we build systems, infrastructure, data products for collecting, storing, and visualizing data to foster data democracy at New York Public Radio. We work in Python, SQL and we work with technologies like AWS Glue, Lambda, S3 and PySpark.
The main functions of this role include:
- Design and build ETL pipelines by using AWS Glue to collect data from different sources to data warehouses
- Build pipeline integration with our various data products for long-running processes. This will include CRM data ( Salesforce ), payments processors ( Stripe & Springboard ), email ( Salesforce Marketing Cloud), our own digital platforms ( apps and websites Google Analytics data ) and audio CMS data, which comes in log form.
- Identify the room for optimizing relational data storage through design, query optimization, indices, replicas, partitioning, etc.
Build scripts or recurring processes to fully/partially automate labor-intensive workflows of other departments (e.g. Content, Marketing, Fundraising, the Newsroom) as well as for running models and analysis on behalf of the Data team.
Requirements & Qualifications
- 1-2 years hands-on experience in ETL design, implementation and maintenance
- Experience in schema design and data modeling and in writing complex SQL queries to extract data from relational databases (e.g. MySQL, Redshift)
- Experience in the following tools/technology is desired: Python, PySpark, AWS Glue, Lambda, S3, Redshift and Google Cloud Platform
- Comfortable with extensive Python coding. pySpark is a plus.
- Willing to learn any language/tools/frameworks necessary
- Excellent skills in communicating technical knowledge to a broad non-technical audience
- Excellent organizational skills with the ability to manage several projects concurrently and communicate effectively across multiple divisions and with diverse groups of stakeholders
This is a full-time role with a salary range of $80-$85K per year (plus a full benefits package). Salary offer within this range is determined by skills, experience and organizational pay equity. New York Public Radio offers excellent medical, dental, and vision insurance, vacation, personal and sick time as well as parental leave.
This role is currently operating in a primarily remote capacity but the selected candidate will have the option to work from the office if fully vaccinated against COVID-19, unless NYPR has approved a specific religious or medical exemption in advance of the first day of work. At this time, NYPR defines being fully vaccinated as having received two Moderna or Pfizer COVID-19 vaccinations or one Johnson and Johnson vaccination.
This role, if desired by the selected candidate, could remain remote.
Commitment to Diversity, Equity & Inclusion
New York Public Radio is committed to diversity, equity, and inclusion. We continuously strive to place our employees at the center of our thinking and elevate inclusive practices to develop and support a more engaged and productive workforce. Our journalism and operations in the service of that journalism benefit from a broad range of perspectives, from all backgrounds, at all levels of the organization. Diversity is essential to honest, authentic, accurate storytelling and reportage; creating an institution in which all voices are encouraged, valued, and heard.
New York Public Radio is an equal opportunity employer committed to achieving the goal of equal employment opportunity for all. Applicants and employees are considered and evaluated for positions without regard to mental or physical disability, race, creed, color, religion, gender, national origin, citizenship status, age, genetic information, military or veteran status, sexual orientation, marital status, employment status or any protected federal, state or local status unrelated to the performance of the work involved.