Data Scientist

Open Raven

Worldwide
Full-time
Software Development
python
java
aws

Description

Open Raven is poised to disrupt the red-hot data security market. Built by a team hailing from security businesses such as CrowdStrike, Tenable, and Splunk, and backed by top investors, Open Raven is tackling one of the most foundational and toughest problems for security teams today. Our customers are some of the largest in the world who rely upon Open Raven to protect their sensitive data in the cloud.


Open Raven is a West Coast-native, remote-first company. Positions can reside anywhere in the U.S. Functional teams get together as needed in person and we have two all-team onsites per year.


About the role

Open Raven's primary focus is scanning our customer's data for sensitive information - be that credit card number in the wrong place, developer secrets checked into source code, personally identifiable information alongside healthcare data. Due to the velocity and ubiquity of data in the cloud, understanding that data footprint and how that it is protected is key to stopping, and if not reducing, data leaks and breaches targeting sensitive data.

After all the discovery, scheduling, scanning, and scan management is taken into consideration, the foundational component of Open Raven are our data-classes which we use to describe, identify, and validate the data that is scanned. We current have over 200 of these data-classes, from AWS secret keys to Latvian drivers license numbers, and intend to keep adding to them and improving the ones we have. This role is to join the team building, managing, and analyzing those data-classes, ensuring that they are accurately detecting instances of those data footprints, and investigating scenarios where they are not, minimizing false-positives and false-negatives.


About you

  • Detailed-oriented, and interested in figuring out the ins-and-outs of a data format, for example how it's possible to cross-check where someone is born from their social security number.
  • You’re a data-wrangler, using big data to discover patterns and determine improvements
  • Awareness and expert of at least one form of data matching techniques, from regular expressions, through machine-learnt models, to natural language processing
  • You understand the difference between precision, recall, and F1, why they are important, and how to optimize for them.
  • You stay abreast of the latest technologies and share what you learn with others
  • You are not afraid of failing while experimenting with different technologies, development methodologies, and tools
  • You are fascinated by other cultures and interested building strong relationships with team members across the globe


What you’ll do

  • Work closely with the product team to identify data-class priorities
  • Build, modify, maintain, and improve data class definitions
  • Feed back technical needs and ideas to the engineering team that make data scanning more accurate and performant
  • Benchmark data scanning and matching, measuring accuracy against test data and telemetry from Open Raven scanning at scale
  • Document and report out current status, while investigating, advocating, and owning improvements
  • Break down larger data quality initiatives into pieces that deliver incremental business value and lead the team to execute on them
  • Drive quality mindset thoughts engineering practices


What we’re looking for

  • 5+ years professional experience as a Software Engineer, Data Scientist, or Machine Learning Engineer
  • Ability to script and code in Python and/or Java
  • Experience working with large scale data
  • Prior success working on data processing and matching systems such as intrusion detection, data loss prevention
  • Excellent teamwork and communication
  • Ability to challenge the norm and maturity to advocate for changes for the greater benefit of the business
  • BS/MS/PhD CS or equivalent


What we offer

  • Startup culture with a product company DNA
  • Competitive compensation + early-stage stock options
  • Excellent health insurance and benefits
  • Flexible work schedules and vacation policy
  • The ability to travel to conferences
  • Remote friendly / distributed team
  • High-performing, fast-paced team
  • A culture that values kindness and being a good human
  • A team with a “don’t talk about it, be about it” mindset


-------

Open Raven is an Equal Opportunity Employer. We are committed to building a diverse team where all races, genders, sexual orientations, ages, religions, and lived experiences are welcome. Equal opportunity for all is not only vital to our success but the right thing to do.

If you need assistance or accommodation due to a disability, you may contact us at careers@openraven.com and state your request for assistance in the subject line.

Job Summary

Job ID:776
Company:Open Raven
Location:Worldwide
Job Type:Full-time
Primary Tag:Software Development

To claim this job, send an email to admin@remoteng.com from your work email with the job ID.

More Details


Website:

https://www.openraven.com/

Job Posted:

3 years ago