- Client is a Texas State Agency located in Austin, TX.
- Position is REMOTE from anywhere in the State of Texas.
- 12-month Contract, open to C2C.
- Qualified candidates must meet or exceed the years of experience listed for the required skills.
Seeking a Data Scientist to perform complex data analysis and develop robust data solutions within an Enterprise Data Warehouse (EDW) environment. The ideal candidate will design, develop, and test ETL processes, ensuring data quality, accuracy, and reliability across various reporting and performance tracking initiatives. This role includes extracting, cleansing, and profiling data from multiple sources to support consistent and meaningful business insights.
Key Responsibilities:
- Analyze and assess customer data from a case management system for quality issues impacting performance reporting.
- Evaluate and improve current analytics models used to generate, merge, and process data to produce reliable performance outcomes.
- Design and implement ETL processes using Informatica, ensuring alignment with data governance and quality standards.
- Document and define technical architecture, data flows, and data relationships.
- Develop detailed written reports summarizing assessment methodology, gap analysis, findings, conclusions, and specific recommendations for improvement.
- Engage with stakeholders and leadership to communicate methodology, findings, recommendations, and implementation timelines.
- Support performance data reporting processes, including federal data submission formats such as PIRL for systems like WIPS.
- Perform additional duties as needed to maintain and enhance EDW operations.
Required Skills & Experience:
- 8+ years of experience in designing and developing ETL processes using Informatica, including expertise in data management, data profiling, and quality assurance.
- 8+ years of experience with enterprise data warehouse development and testing (relational and dimensional), preferably using Oracle.
- 8+ years of hands-on experience in data science including algorithm development, model utilization, data mining, error analysis, and data validation.
- 8+ years of experience in data modeling, particularly in warehouse environments.
- 8+ years of experience using Informatica Cloud.
- 3+ years of experience identifying and resolving data mismatches in case management systems.
Preferred Skills & Experience:
- 5+ years of experience working with multiple RDBMS platforms (e.g., DB2, SQL Server, Oracle).
- 5+ years of experience using data modeling tools such as Erwin.
- 3+ years of experience with Amazon Web Services (AWS).
- 2+ years of experience resolving data integrity issues in case management systems.