Creospan is a growing tech collective of makers, shakers, and problem solvers, offering solutions today that will propel businesses into a better tomorrow. “Tomorrow’s ideas, built today!” In addition to being able to work alongside equally brilliant and motivated developers, our consultants appreciate the opportunity to learn and apply new skills and methodologies to different clients and industries.
***NO C2C/3RD PARTY, LOOKING FOR W2 CANDIDATES ONLY, must be able to work in the US without sponsorship now or in the future
Support efforts to perform data mitigations on large scale datasets (image, video, text) leveraged by FAIR research teams. The goal is to proactively mitigate potential risks associated with these datasets.
Job Responsibilities:
Preprocessing: converting original datasets into a format that can be consumed by mitigation pipelines.
Filtering: running filtering using Integrity's pipeline.
Post-processing: consuming filtering results to filter in the original datasets, repackaging, and re-ingestion.
Optimization: identify optimization opportunities and improve the process.
Skills:
Software engineering skills include writing scripts to automate file processing and data transferring and creating tools to improve productivity and streamline workflows.
Data Management: Data pipeline building. Data processing and cleaning, transformation and formatting, data quality control and validation
Communication - effective communication skills to collaborate with stakeholders and team members