Senior Data Engineer

Disney New York, NY 10036 2018-06-09
BAMTECH Media is a place for the creative and the bold. We’re seeking talent across disciplines to join our team. Whether New York City, San Francisco, Manchester or Amsterdam we provide opportunities to elevate your career and transform an industry. BAMTECH software engineers develop premium digital media products for Major League Baseball and our partners. The products we build, such as MLB.TV, NHL.TV, HBO NOW and PlayStation Vue are paving the way for the next-generation media and sport technologies. BAMTECH engineering is headquartered in the Chelsea area of New York, NY with an office in the SoMo area of San Francisco, CA and team members based around the world. If you are interested in joining BAMTECH in the pursuit of not only crafting new media products but enjoying the products you build, we are interested in hearing from you. At BAMTECH data is central to measuring all aspects of the business, and critical to its operations and growth. The data engineering team is responsible for collecting, analyzing and distributing data using public cloud and open source technologies and offers transparency into customer behavior and business performance.
Familiarity with binary data serialization formats such as Parquet, Avro, and Thrift Experience deploying data notebook and analytic environments such as Jupyter and Databricks Knowledge of the Python data ecosystem using pandas and numpy Experience building and deploying ML pipelines: training models, feature development, regression testing Experience with graph-based data workflows using Apache Airflow 3-5 years of experience developing in object orient Python Engineering big-data solutions using technologies like EMR, S3, Spark Loading and querying cloud-hosted databases such as Redshift and Snowflake Building data pipelines using Kafka, Spark, Flink, or Samza Collaborate with product teams, data analysts and data scientists to design and build data-forward solutions Build and deploy streaming and batch data pipelines capable of processing and storing petabytes of data quickly and reliably Integrate with a variety of data metric providers ranging from advertising, web analytics, and consumer devices Build and maintain dimensional data warehouses in support of business intelligence tools Develop data catalogs and data validations to ensure clarity and correctness of key business metrics Drive and maintain a culture of quality, innovation and experimentation 564060