CAPE Analytics provides instant property intelligence for buildings across the United States. CAPE Analytics enables insurers and other property stakeholders to access valuable property attributes at time of underwriting, with the accuracy and detail that traditionally required an on-site inspection, but with the speed and coverage of property record pre-fill. Founded in 2014, CAPE Analytics is backed by leading venture firms and innovative insurers and is comprised of computer vision, data science, and risk analysis experts.
A BIT ABOUT USSince our founding in 2014, CAPE Analytics has used machine learning and computer vision to pioneer a new form of property information, built specifically for the organizations that finance, protect, and invest in our homes and businesses. Our 50+ (and rapidly growing!) clients across insurance and real estate are leading a digital transformation to secure properties and livelihoods in the face of complex trends in housing and climate.
THE OPPORTUNITYAs a Data Analyst (Ground Truth) on the Machine Learning team, your key roles and responsibilities will center around collection of high quality ground truth labels for the purpose of training Cape’s industry leading Deep Learning models.
WITHIN 1-3 MONTHS, YOU’LL
- Understand Cape’s business model and Products.
- Understand the tools and technology used for GT collection at Cape.
- Participate in creating and updating taxonomies for machine learning models. Create documentation for taxonomies. Train ground truth labelers and provide guidance throughout the data collection process.
- Create and run Ground Truth campaigns.
- Evaluate ground truth contractors and provide them feedback to keep high quality standards.
- Deliver high quality GT (ground truth) labels to the machine learning engineers and iterate on label collection based on feedback.
- Ad hoc analysis of campaign data to track progress of campaign, contribution by individual workers and communicate data-driven strategy to internal and external stakeholders.
- Effective utilization of resources in the off-shore team for meeting data collection requirements of multiple projects in a timely manner.
WITHIN 3-6 MONTHS, YOU’LL
- Design and implement methods to quantify and improve ground truth data accuracy in collaboration with the data scientists.
- Leverage the feedback from labelers to improve the taxonomy definitions, and collaborate with the engineering team to improve the tools for data collection and management.
- Build and support visualization and exploration capabilities around our data sets.
- Triage and report bugs throughout our data pipeline.
- Take ownership of communicating changes to the appropriate end-users.
WITHIN 6-12 MONTHS, YOU’LL
- Take mental ownership of our ground truth pipeline and help us extend its functionality to support the development of innovative new products.
- Maintain comprehensive documentation of data, definitions, tables, and schemas across multiple systems.
- Design and experiment new ways for more accurate and efficient ground truth generation.
THE SKILL SET
- BS required, MS preferred in Statistics, Analytics, Computer Science or related STEM fields.
- 2-3 years of experience as a Ground Truth Data Analyst.
- Excellent critical thinking, troubleshooting and analytical problem-solving abilities.
- Excellent verbal and written communication skills.
- Must be able to create clear documentations, communicate with off-shore contractors and with multiple teams at Cape.
- Solid foundation in Statistics and Data Analysis.
- Coding Skills: Python, SQL required.
- Ability to work with and direct a team of data labelers.