Talent.com
Esta oferta de trabajo no está disponible en tu país.
Data Scientist, Reinforcement Learning

Data Scientist, Reinforcement Learning

BinanceLima Metropolitana, Lima, Peru
Hace 1 día
Descripción del trabajo

Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100+ countries for our industry-leading security, user fund transparency, trading engine speed, deep liquidity, and an unmatched portfolio of digital-asset products. Binance offerings range from trading and finance to education, research, payments, institutional services, Web3 features, and more. We leverage the power of digital assets and blockchain to build an inclusive financial ecosystem to advance the freedom of money and improve financial access for people around the world.

About the Role

You will develop and optimize RL models for enterprise-scale applications such as customer service, token reporting, compliance, and Web3 domain reasoning. You will explore and evaluate advanced algorithms including PPO, GRPO, DPO, RLHF, RLAIF, and Agentic RL to enhance the capabilities of LLMs, VLMs, and Agentic AI at Binance. The role requires a strong theoretical foundation in RL—covering policy optimization, reward modeling, and planning—paired with the engineering skills to build scalable production systems. You will take full ownership from research through deployment, driving experimentation with systematic evaluation and benchmarking. Collaboration across research, infrastructure, and application teams will be key to delivering impactful AI solutions.

Responsibilities

  • Research and develop state-of-the-art RL algorithms, focusing on large model optimization and alignment techniques.
  • Design and implement RL training pipelines, including environment simulation, data generation, and reward function design.
  • Apply RL methods to enhance LLM / VLM / Agentic AI capabilities in reasoning, planning, and autonomous decision-making.
  • Collaborate with engineers and researchers to integrate RL solutions into enterprise AI platforms.
  • Monitor model performance in production and continuously improve through iterative training and fine-tuning.

Requirements

  • Master’s degree in Computer Science, Applied Mathematics, Machine Learning, or related fields.
  • 3+ years of hands-on experience in RL or LLM / VLM / Agentic AI optimization.
  • Strong coding skills in Python, with experience in ML frameworks and RL libraries.
  • Experience with large-scale distributed training and optimization.
  • Self-driven, ownership mindset, and strong problem-solving skills. Excellent communication skills for cross-functional collaboration.
  • Why Binance

  • Shape the future with the world’s leading blockchain ecosystem
  • Collaborate with world-class talent in a user-centric global organization with a flat structure
  • Tackle unique, fast-paced projects with autonomy in an innovative environment
  • Thrive in a results-driven workplace with opportunities for career growth and continuous learning
  • Competitive salary and company benefits
  • Work-from-home arrangement (the arrangement may vary depending on the work nature of the business team)
  • Binance is committed to being an equal opportunity employer. We believe that having a diverse workforce is fundamental to our success.

    By submitting a job application, you confirm that you have read and agree to our Candidate Privacy Notice .

    #J-18808-Ljbffr

    Crear una alerta de empleo para esta búsqueda

    Data Scientist • Lima Metropolitana, Lima, Peru

    Ofertas relacionadas
    • Oferta promocionada
    Research Scientist - LLM Foundation Models

    Research Scientist - LLM Foundation Models

    BinanceLima Metropolitana, Lima, Peru
    Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100+ countrie...Mostrar másÚltima actualización: hace 1 día
    • Oferta promocionada
    • Nueva oferta
    [28 / 09 / 2025] MEDIA PLANNING...

    [28 / 09 / 2025] MEDIA PLANNING...

    NosotrosLima Metropolitana3, Lima, PE
    Misi f3n del puesto : Liderar y optimizar la inversi f3n en medios digitales (Meta Ads, Google Ads y TikTok Ads), asegurando la generaci f3n de leads de calidad para las operaciones de InSalud y sus...Mostrar másÚltima actualización: hace menos de 1 hora
    • Oferta promocionada
    Lead Machine Learning Engineer, Recommendation Systems

    Lead Machine Learning Engineer, Recommendation Systems

    Launch PotatoLima, Lima, Peru
    Lead Machine Learning Engineer, Recommendation Systems.As The Discovery and Conversion Company, our mission is to connect consumers with the world’s leading brands through data-driven content and t...Mostrar másÚltima actualización: hace 6 días
    • Oferta promocionada
    Data Scientist Advisor

    Data Scientist Advisor

    InterbankLima, Lima, Peru
    Te gustaría destacar con tu talento en el análisis de datos para contribuir en la toma de decisiones y transformar la vida de millones de peruan@s en una cultura data driven? ¡Entonces te estamos b...Mostrar másÚltima actualización: hace 11 días
    • Oferta promocionada
    • Nueva oferta
    ▷ Inicio Inmediato! Profesor / a particular de Econometría Chancay...

    ▷ Inicio Inmediato! Profesor / a particular de Econometría Chancay...

    SuperProfChancay, PE
    EmpresaSuperprof es una herramienta para el intercambio de conocimientos que pone en contacto a quienes quieren aprender con quienes desean enseñar. Creada en agosto de 2013, Superprof conecta alumn...Mostrar másÚltima actualización: hace menos de 1 hora
    • Oferta promocionada
    Staff Engineer, Reinforcement Learning (R3639)

    Staff Engineer, Reinforcement Learning (R3639)

    Shield AILima Metropolitana, Lima, Peru
    Staff Engineer, Reinforcement Learning (R3639).Be among the first 25 applicants.Staff Engineer, Reinforcement Learning (R3639). Get AI-powered advice on this job and more exclusive features.Founded ...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Lead Machine Learning Engineer, Ad Performance

    Lead Machine Learning Engineer, Ad Performance

    Launch PotatoLima, Lima, Peru
    Lead Machine Learning Engineer, Ad Performance – As The Discovery and Conversion Company, our mission is to connect consumers with the world’s leading brands through data-driven content and technol...Mostrar másÚltima actualización: hace 15 días
    • Oferta promocionada
    • Nueva oferta
    Marketing Data Scientist

    Marketing Data Scientist

    IDT CorporationLima Metropolitana, Lima, Peru
    IDT Corporation is a global communications company founded in 1990 and headquartered in Newark, New Jersey.We are industry leaders in prepaid communication and payment services and one of the large...Mostrar másÚltima actualización: hace 6 horas
    • Oferta promocionada
    Data Scientist – AI Agent Engineering & Infrastructure

    Data Scientist – AI Agent Engineering & Infrastructure

    BinanceLima Metropolitana, Lima, Peru
    Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100+ countrie...Mostrar másÚltima actualización: hace 1 día
    • Oferta promocionada
    Data Scientist (LLM), Multi-Agent Systems

    Data Scientist (LLM), Multi-Agent Systems

    BinanceLima Metropolitana, Lima, Peru
    Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100+ countrie...Mostrar másÚltima actualización: hace 1 día
    • Oferta promocionada
    Data Scientist (LLM) – AI Safety

    Data Scientist (LLM) – AI Safety

    BinanceLima Metropolitana, Lima, Peru
    Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100+ countrie...Mostrar másÚltima actualización: hace 1 día
    • Oferta promocionada
    Profesor / a particular de Python Chancay

    Profesor / a particular de Python Chancay

    SuperProfChancay, Peru
    Superprof es una herramienta para el intercambio de conocimientos que pone en contacto a quienes quieren aprender con quienes desean enseñar. Creada en agosto de 2013, Superprof conecta alumnos y pr...Mostrar másÚltima actualización: hace 13 días
    • Oferta promocionada
    Profesor / a particular de Excel Chancay

    Profesor / a particular de Excel Chancay

    SuperProfChancay, Peru
    Superprof es una herramienta para el intercambio de conocimientos que pone en contacto a quienes quieren aprender con quienes desean enseñar. Creada en agosto de 2013, Superprof conecta alumnos y pr...Mostrar másÚltima actualización: hace 13 días
    • Oferta promocionada
    Data Scientist - AI and Quantitative Finance

    Data Scientist - AI and Quantitative Finance

    Sud RecruitingLima Metropolitana, Lima, Peru
    Data Scientist - AI and Quantitative Finance role at Sud Recruiting.Additional compensation types include Annual Bonus.Position is hybrid in the San Diego area. Natural Language Processing (NLP).Bui...Mostrar másÚltima actualización: hace 24 días
    • Oferta promocionada
    Data Scientist Expert

    Data Scientist Expert

    InterbankLima, Lima, Peru
    Si estás buscando nuevos retos y estás listo para crear soluciones disruptivas que aceleren la digitalización del país, ¡no lo pienses más y únete a nuestro gran equipo como.Con nosotros podrás tra...Mostrar másÚltima actualización: hace 28 días
    • Oferta promocionada
    Data Scientist Principal Lead

    Data Scientist Principal Lead

    InterbankLima, Lima, Peru
    Si estás buscando nuevos retos y estás listo para crear soluciones disruptivas que aceleren la digitalización del país, ¡no lo pienses más y únete a nuestro gran equipo como.Con nosotros podrás tra...Mostrar másÚltima actualización: hace 28 días
    • Oferta promocionada
    Data Scientist (Recommendation Systems), Binance Square

    Data Scientist (Recommendation Systems), Binance Square

    BinanceLima Metropolitana, Lima, Peru
    Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100+ countrie...Mostrar másÚltima actualización: hace 1 día
    • Oferta promocionada
    Data Scientist 2

    Data Scientist 2

    Vectra AILima Metropolitana, Lima, Peru
    Leverage large datasets to develop machine-learning and statistical models that can differentiate between normal and attack behavior. Own the prototyping, development, and testing of complex detecti...Mostrar másÚltima actualización: hace 9 días