Deep reinforcement learning engineer

hace 2 semanas


Valencia do Sil, España Friday Systems A tiempo completo

Friday Systems builds AI that allows industrial robots to adapt to dynamic warehouse environments. We focus on high-throughput palletizing and related tasks where classical approaches break down. Our stack is built around Deep Reinforcement Learning with modern sequence models.Asegúrese de enviar su solicitud rápidamente para maximizar sus posibilidades de ser considerado para una entrevista. Lea la descripción completa del puesto a continuación.Tiny team, zero bureaucracy, direct impact, salary + equity.THE ROLEOwn the DRL stack end-to-end: formulation → algorithm design → large-scale training → evaluation → deployment. You'll work directly with the CTO to turn cutting-edge DRL into production throughput at customer sites.YOU WILLDesign & ship DRL algorithms (PPO/SAC/DDQN and variants, based on encoders/cross-attention/pointer networks) for complex control & combinatorial optimization.Tackle stability & sample-efficiency: GAE, normalization, entropy/KL control, distributional/value-loss tuning, curriculum learning and reward shaping,...Launch multi-GPU training, parallel rollouts, efficient replay/storage, and reproducible experiment tooling.Productionize: clean Py Torch code, profiling, Dockerized services (Fast API), AWS deployments, experiment tracking, dashboards.Collaborate with the C-Level Team to ensure product excellence and alignment with business strategy. Forge strong relationships with clients, effectively translating their needs into unique technology solutions.Build and nurture a high-performing team by attracting top talent. Provide mentorship and leadership to foster a culture of quality and innovation.YOU HAVETrack record shipping RL beyond academic demos: you've led at least one end-to-end RL system from idea to production or a state-of-the-art benchmark in the last 3–5 years.Extensive Deep Learning, Reinforcement Learning & Py Torch expertise: You can implement several DRL algorithms from scratch, reason about root-cause performance drops and make informed decisions about next steps.Systems know-how: Python, Linux, Docker, Multi-GPU, Cloud (AWS).Math maturity: MDPs/Bellman operators, policy gradients, trust-region/KL, GAE/λ-returns, stability/regularization in on-policy vs off-policy regimes.Ownership: you're comfortable being the primary owner for experiments, code quality, and results in a small team.Location/time zone: EU-based (CET±1) and able to travel occasionally to customer warehouses.We are not considering entry-level or coursework-only profiles for this role. xsgfvud HIRING PROCESS30-min intro & mutual fitDeep technical session with CTO on your past RL work (no Leet Code, no homework)Two one-hour "Traits & Skills" conversations with our other Co-founders.Meet the team & offer



  • Valencia, España Friday Systems A tiempo completo

    Friday Systems builds AI that allows industrial robots to adapt to dynamic warehouse environments. We focus on high-throughput palletizing and related tasks where classical approaches break down. Our stack is built around Deep Reinforcement Learning with modern sequence models.Tiny team, zero bureaucracy, direct impact, salary + equity.THE ROLE Own the DRL...


  • Valencia, España Friday Systems A tiempo completo

    Friday Systems builds AI that allows industrial robots to adapt to dynamic warehouse environments. We focus on high-throughput palletizing and related tasks where classical approaches break down. Our stack is built around Deep Reinforcement Learning with modern sequence models. ¿Quiere enviar su solicitud? Lea toda la información sobre este puesto a...


  • Valencia, España Friday Systems A tiempo completo

    Friday Systems builds AI that allows industrial robots to adapt to dynamic warehouse environments. We focus on high-throughput palletizing and related tasks where classical approaches break down. Our stack is built around Deep Reinforcement Learning with modern sequence models.Tiny team, zero bureaucracy, direct impact, salary + equity.THE ROLEOwn the DRL...

  • Machine learning engineer

    hace 2 semanas


    Valencia do Sil, España European Tech Recruit A tiempo completo

    Machine Learning Engineer | AI Start-up | 6-Month fixed-term Contract¿Es este el siguiente paso en su carrera? Descubra si es el candidato adecuado leyendo la descripción completa a continuación.Join a European deep-tech leader in quantum and AI.A well-funded, fast-growing company backed by major global investors with its groundbreaking technology is...


  • Valencia, España Friday Systems A tiempo completo

    A leading robotics technology company in Valencia is looking for a skilled Reinforcement Learning Engineer to own the DRL stack from formulation to deployment. You will design and ship advanced DRL algorithms, ensure product excellence in collaboration with C-Level executives, and nurture a high-performing team. The ideal candidate has extensive experience...


  • Valencia do Sil, España Axpe Consulting A tiempo completo

    Axpe Consulting – España (100% Remoto)Antes de solicitar este puesto, por favor, lea la siguiente información sobre esta oportunidad que encontrará a continuación.Buscamos AI / Machine Learning Engineer | Proyecto estable | 100% remoto¿Te apasiona llevar modelos de Machine Learning a producción y trabajar en proyectos donde la IA tiene impacto real...

  • Machine learning engineer

    hace 2 semanas


    Valencia do Sil, España TENDAM A tiempo completo

    MACHINE LEARNING ENGINEER / DATA ENGINEERDesplácese hacia abajo para ver todos los requisitos del puesto y las responsabilidades que pueden esperar los candidatos seleccionados.At Tendam, we are expanding our Data & Analytics team to tackle exciting challenges in the fashion retail industry. We're looking for a Machine Learning Engineer who will bridge the...


  • Valencia, España Visium A tiempo completo

    Visium Valencia, Valencian Community, SpainTitle: Machine Learning Engineer / Data ScientistAbout UsWith expertise in strategy, architecture, cloud engineering, analytics, artificial intelligence and machine learning, we empower our clients to unleash and scale the power of their data.We're on a mission to pioneer a bright future and build future-proof and...


  • Valencia, España Visium A tiempo completo

    Visium Valencia, Valencian Community, SpainTitle: Machine Learning Engineer / Data ScientistAbout UsWith expertise in strategy, architecture, cloud engineering, analytics, artificial intelligence and machine learning, we empower our clients to unleash and scale the power of their data.We're on a mission to pioneer a bright future and build future-proof and...


  • Valencia, España Visium SA A tiempo completo

    Senior Machine Learning Engineer Visium SA•Valencia, Valencian Community, ESDescripción del trabajoTitle: Senior Machine Learning Engineer / Data ScientistType: Full-timeLocation: Valencia or BarcelonaAbout usAt Visium, we enable enterprise executives in defining their AI & Data strategy, execute large scale transformations and implement AI across...