Reality Defender is a rapidly growing technology startup building the future of deepfake detection technology. Reality Defender seeks a data lead. You’d work on product-oriented dataset for synthetic (deep-fake) media (image, video, audio) detection and tackle cutting-edge deep learning, audio and computer vision data problems with an emphasis on classification and adversarial methods.
Responsibilities
Lead and build scalable datasets and their delivery pipelines
Closely interface with R&D team for machine (deep) learning model training and evaluation
Build data collection and processing pipelines
Develop at-scale data extraction, cleaning, and labeling, including human annotation methodology
Define methodology to ingest new datasets -- research-based, and commercial
Define metrics for dataset imbalance, visualization, and data sampling
Automate data quality control and content moderation
Help hire data scientists and engineers
Qualifications
Proficient in software development, esp. Python.
Interest in data exploration, visualization, cleaning, and analytics for real-world data modeling.
Familiarity with audio and video file formats, and codecs.
Solid understanding of linear algebra, statistics and machine learning concepts.
Experience working with very large databases and data analysis tools/libraries, for example, SQL, Pandas, etc.
Proven hands-on leader and team player with a positive attitude, sincerity, and good communication skills.
Smart, driven, and passionate about helping Reality Defender change the world.
Competencies
Attention to Detail — Requires being careful about detail and thorough in completing tasks.
Analytical Thinking — Requires analyzing information and using logic to address work-related issues and problems.
Independence — Requires developing one's own ways of doing things, guiding oneself with little or no supervision, and depending on oneself to get things done.
Initiative — Requires a willingness to take on responsibilities and challenges.
Achievement/Effort — Requires establishing and maintaining personally challenging achievement goals and exerting effort toward mastering tasks.