Data Engineer (Machine Learning/Predictive Analytics)
- San José
- Permanente
- Tiempo completo
- Refine and improve core analytic products and services including Connex segmentation platform and predictive modeling capabilities.
- Manage and implement database updates on periodic refreshes to key data sets including the Simmons/MRI Survey of the American Consumer.
- Implement new analytic product updates including deployment of Connex Targets builds and its integration into production systems such as Connex Cluster Explorer.
- Load, process, and provide analysis on a wide range of data for analytic use within Enterprise databases such as Snowflake, SQL Server and AWS cloud environments.
- Assist and support execution of analytic projects including client Connex Interactive Profile projects and custom predictive models.
- Assist and support for the development and maintenance of the Connex product line including the Connex V8.0 development road map.
- Develop and refine interactive visual dashboards and applications using Tableau and other dashboarding and reporting tools. • Execution of complex SQL queries supporting list counts, extracts and fulfillment.
- Assist with processing and automation of analytic data solutions including data ingestion, cleansing, enhancement, transformation, data preparation, model building, model deployment and model operations.
- Help build, deploy and manage predictive models leveraging best practices for the model management lifecycle.
- Contribute to analytic data product development including product. enhancements, exploring alternatives, and implementation of prototypes and PoCs.
- Work and collaborate with other Data Engineers, Data Scientists, Product Managers to effectively explore, define, build and deploy analytic solutions and products.
- Create necessary documentation for technical design, process and data flows, project tracking, training material, etc.
- Manage and administer MS SQL Server databases and OLAP applications for SaaS product
- Maintain and support database applications and provide technical support
- Service ad hoc data requests and extracts • Manage operations for the database application including periodic batch jobs and email deployment campaigns.
- Previous workplace experience with at least 5 years of experience required with 3 or more years in similar field.
- College graduate or higher with a degree in data engineering, data analysis, data sciences and/or computational sciences or similarly related technical/analytical foundation.
- Detailed oriented with strong analytical skills.
- Solid background and experience with database management platforms, data processing, ETL and data wrangling.
- Proficiency with using SQL required, and other languages such as python desired.
- Experience or familiarity in the application of machine learning algorithms and predictive analytics to building data and analytic solutions and products is highly desirable.
- Experience with building or maintaining analytic and predictive modeling solutions using cloud-based tools in AWS, Snowflake or 3rd party tools such as H2O Driverless AI is a strong plus.
- Experience using data science, data wrangling and python programming libraries such as scikit-learn, XGBoost, LightGBM, Snowpark or others is a strong plus.
- Good experience and familiarity with Snowflake is desirable including as a database developer or in operations.
- Good experience and familiarity with MS SQL Server is desirable including as DBA, database developer or operations and utilizing such components as T-SQL, SSAS (OLAP) and SSIS (data orchestration).
- Good familiarity with data visualization tools such as Tableau, Excel, Microsoft BI.
- Good familiarity with data cleansing and data quality tools that perform address cleaning, parsing, deduping and matching. • Proficiencies with office productivity suite Microsoft Office (particularly Excel) or other similar suite of tools.
- Ability to create and produce data driven reports that clearly summarize results and insights.
- Work independently and as a part of a team.
- Effective organization skills with the ability to simultaneously work on multiple projects, prioritize, and manage overall workload.
- Experience as a member of an agile (Scrum/Kanban) product development or engineering team is a plus