RE/RS, Data Understanding (MM)
OpenAI
About The Team The Data Understanding team is responsible for creating the high quality datasets and their quantized representation for OpenAI. This includes synthesizing multimodal data, building VQ representations, and processing, filtering, deduplication, quality control, and tokenization so it can be used effectively in big model training runs. About The Role We're looking to advance how OpenAI prepares, curates, synthesizes and understands multimodal data at scale. You'll work on research and production problems like synthesizing multimodal content (images, audio, and video) and their supervisions, improving noisy data pipelines, building better quality filters, using models to automate data prep, and measuring whether changes in the dataset improve model performance. We Expect You To
We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic.
For additional information, please see OpenAI's Affirmative Action and Equal Employment Opportunity Policy Statement. Background checks for applicants will be administered in accordance with applicable law, and qualified applicants with arrest or conviction records will be considered for employment consistent with those laws, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act, for US-based candidates. For unincorporated Los Angeles County workers: we reasonably believe that criminal history may have a direct, adverse and negative relationship with the following job duties, potentially resulting in the withdrawal of a conditional offer of employment: protect computer hardware entrusted to you from theft, loss or damage; return all computer hardware in your possession (including the data contained therein) upon termination of employment or end of assignment; and maintain the confidentiality of proprietary, confidential, and non-public information. In addition, job duties require access to secure and protected information technology systems and related data security obligations. To notify OpenAI that you believe this job posting is non-compliant, please submit a report through this form. No response will be provided to inquiries unrelated to job posting compliance. We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link. OpenAI Global Applicant Privacy Policy At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.
- Have a strong track record of new or improved ML ideas, through publications, projects, or applied research.
- Own and drive a research agenda, from choosing the right multimodal data problems to carrying long-running work through to impact.
- Be excited by OpenAI's empirical, collaborative approach to research.
- Experience with multimodal learning, audio, vision, video, synthetic data, or data-centric ML.
- Thoughtfulness about AI's impact, including privacy, provenance, and data quality.
- Experience building high-performance deep learning or large-scale data processing systems.
We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic.
For additional information, please see OpenAI's Affirmative Action and Equal Employment Opportunity Policy Statement. Background checks for applicants will be administered in accordance with applicable law, and qualified applicants with arrest or conviction records will be considered for employment consistent with those laws, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act, for US-based candidates. For unincorporated Los Angeles County workers: we reasonably believe that criminal history may have a direct, adverse and negative relationship with the following job duties, potentially resulting in the withdrawal of a conditional offer of employment: protect computer hardware entrusted to you from theft, loss or damage; return all computer hardware in your possession (including the data contained therein) upon termination of employment or end of assignment; and maintain the confidentiality of proprietary, confidential, and non-public information. In addition, job duties require access to secure and protected information technology systems and related data security obligations. To notify OpenAI that you believe this job posting is non-compliant, please submit a report through this form. No response will be provided to inquiries unrelated to job posting compliance. We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link. OpenAI Global Applicant Privacy Policy At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.
Vacancy posted 5 days ago
Similar jobs that could be interesting for youBased on the RE/RS, Data Understanding (MM) in San Francisco, CA vacancy
$290k
...This role focuses on building the strategic unit economics understanding of OpenAI, guiding sustainable growth to make it the most impactful... ...lead the development of foundational causal inference and data science models and frameworks to predict and quantify the drivers...DataWork at officeRelocation package3 days per week$146.5k
...Scribd, Inc. is on a mission to advance human understanding. Our four products — Scribd®, Slideshare®, Everand™, and Fable — help billions... ...audiobooks, and more. We work at the intersection of machine learning, data engineering, and distributed systems, collaborating closely...DataLocal areaWorldwideHome officeFlexible hours$126k
...Scribd, Inc. is on a mission to advance human understanding. Our four products — Scribd®, Slideshare®, Everand™, and Fable — help billions... ...audiobooks, and more. We work at the intersection of machine learning, data engineering, and distributed systems, collaborating closely...DataLocal areaWorldwideHome officeFlexible hours- Desired Skills: Supply Chain Management SAP MM Excel SQL Tableau Power BI Data Analysis techniques such as Statistical analysis Predictive modeling... ...Tableau and Power BI. Hand on experience on SAP MM. Understanding of the basics of supply chain management, including...Data
$146.5k
Scribd, Inc. is on a mission to advance human understanding. Our four products — Scribd®, Slideshare®, Everand™, and Fable — help billions... ...audiobooks, and more. We work at the intersection of machine learning, data engineering, and distributed systems, collaborating closely...DataLocal areaWorldwideHome officeFlexible hours- ...Data Modeler Location: San Francisco Job Description: Open for subcontractors Must Have Skills: ~7 years experience... ...Experience working with Data Engineering and Data Analytics teams Understanding of principles and standard methodologies around Data...DataFor subcontractor
$75 - $80 per hour
...Data Architect Pay Range: $75hr - $80hr Responsible for designing and implementing conceptual, logical, and physical data... ...Ability to write SQL queries and perform data analysis. ~ Strong understanding of database technologies and data architecture principles. ~...Data- ...Business Data Analyst Location: San Francisco, CA Duration: Long Term Pharma Domain Mandatory Business Data Analyst in... ...other related pharma development data standards Data modelling. Understands the concepts and principles of data modelling and is able to produce...Data
- ...This role involves creating reports, extracting data, and collaborating with teams to identify and... ...solve bottlenecks. Candidates should have a strong understanding of supply chain management, along with experience in SAP MM, Excel, SQL, and data visualization tools like...Data
- ...Data Modeler Visa status: U.S. Citizens and those authorized to work in the U.S. are encouraged to apply. Tax Terms: W2, 1099 Corp... ...what patients need next. What you'll be working on: Understand and translate business needs into enterprise data models for Pharma...Data
- ...Data Architect Data Architect Bay Area, CA / Charlotte, NC (1 position for each) Manage multiple projects Design and execute... ...Lead product evaluations and make product recommendations Understand existing architectures and apply to solution design Create conceptual...Data
- ...Data Modeler San Francisco, CA Job Description: Must Have Skills ~7+ years of experience in the following tools/areas... ...Experience working with Data Engineering and Data Analytics teams Understanding of data architecture principles and best practices Ability...Data
- ...with R&D scientists, clinical operations, imaging SMEs, and CROs to understand workflows for image ingestion, review, labeling, and annotation. Define end-to-end requirements for image ingestion, data quality checks, metadata harmonization, viewing, annotation, and...Data
$170k - $225k
...Data Architect Atlanta; Boston; Charlotte; Chicago; Dallas; Los Angeles; New York; San Francisco This position is not eligible... ...or entity that submits an unsolicited resume does so with the understanding that Accordion will have the right to hire that applicant at...DataWork at officeLocal areaRemote work2 days per week- ...Scientist. It is a discovery engine that transforms messy biological data into insights in minutes. Scientists ask questions in natural... ...for structured or semi structured biological data Strong understanding of metadata standards, biological ontologies, and domain logic...DataWork at office
$82k - $118k
...large-scale neuroscientific connectomic data collection. Our mission is to enable the... ...proper tissue architecture. Fundamental understanding of fluorescence and fluorophore labeling... ...result in the timely staining of thick (1 mm) tissue sample with antibodies and small...DataFull time- ...The Data Architect is responsible for designing scalable data architectures that support the integration of multiple data sources... ...concepts Experience with cloud-based data solutions (AWS or similar) Understanding of data pipelines and ETL/ELT processes Experience with SQL...DataRemote work
- ...of varying technical levels Ability to define relevant metrics that can guide and influence stakeholders to the appropriate and accurate insights Building clear and easy to understand dashboards (Tableau) and presentations Location: SFO Duration: 1 Year...Data
$251k - $310k
...Technical Lead Manager, Perception, Vehicle Understanding Waymo is an autonomous driving... ...foundational models, large-scale 3rd party data, and partner teams in Research, Oracles,... ...priorities. Drive process and clarify R&Rs for cross-organizational alignments....DataFull timeTemporary workImmediate startRemote work- ...and characterization to lead our signal characterization and data labeling efforts. This role focuses on turning real world RF sensor... ...cluttered, interference heavy operational environments. Understanding of how label quality, taxonomy design, multi‑sensor context (for...DataFull time
- ...AWS Data Engineer Location: San Francisco and jersey city ( 3 days onsite 2 days remote) look for nearby Candidates... ...with data scientists, analysts, and other stakeholders to understand data requirements and deliver high-quality data solutions. Required...DataRemote work
- ...Data Engineer (Snowflake, DBT, Airflow) with SQL San Francisco, CA onsite Contract Role Description: SQL and Snowflake experience along with understanding the data pipelines. Documentation of the data pipelines, metadata, data catalog experience...DataContract workImmediate start
- ...Job Title: Sr. Data Scientist Job Location: Remote Job Duration: 8 Months on W2 Job Description: Utilize quantitative analysis to understand business drivers and conduct analytics experiments for hypothesis testing. Leverage statistical techniques...DataRemote work
- ...We are seeking a Strategic Data Solutions Architect to help enterprise organizations modernize their data ecosystems and build AI... ...data integration and modern cloud data platforms. ~ Strong understanding of data governance, data quality, and AI readiness strategies....DataWork at office2 days per week1 day per week
$100k - $170k
...teams and customers. What You'll Do Respond directly to customer data and analytics requests Build ad-hoc reports and answer business... ...Experienced with GA4, Segment, Amplitude, or Heap and understand how data flows from browser event to dashboard Familiarity with...DataFor contractorsWork at officeFlexible hours- ...About Us: At Emendata, we help bridge the gap between data and the real world. We partner with organizations that have a social... ...numbers requires more than just the right methods; it calls for understanding the people, policies, data, and systems involved....DataFull timeLive inVisa sponsorship
$60 - $70 per hour
...Overview Title: Data Analyst — Tableau / Snowflake Reporting Suite Job Type: Contract Contract Length: 3 Months Pay Range: $60 -... ...working knowledge of Snowflake, including writing SQL queries, understanding schemas, and working with large datasets. Domain Knowledge: Experience...DataContract workImmediate startRemote work- ...We are seeking a highly skilled Data Analyst with hands‑on experience in advanced data engineering and analytics tools. The ideal... ...of Role‑Based and Attribute‑Based Security. Strong understanding of data modeling techniques and root cause analysis. Experience...Data
- ...gathering, analyzing, and interpreting a wide variety of research data. Designs and conducts research including selecting data... ...knows how to apply theory and put it into practice with in-depth understanding of the professional field; independently performs the full range...DataTraineeshipWork at office
- ...Data Governance Engineer Chime Financial, Inc The Data Governance function is pivotal in ensuring the integrity, trustworthiness,... ...trust signals and scorecards that help data consumers quickly understand and act on the reliability of Chime’s data. This is a hands-on...Data
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to RE/RS, Data Understanding (MM). Be the first to apply!
Related searches
- data officer San Francisco, CA
- data network cabling San Francisco, CA
- data auditor San Francisco, CA
- test data management San Francisco, CA
- data mining San Francisco, CA
- minimum data set coordinator San Francisco, CA
- data capturer San Francisco, CA
- data tech San Francisco, CA
- sap data migration San Francisco, CA
- provider data management San Francisco, CA


