howrar@lemmy.caMEnglish · 30 days agoFactorio Learning Environmentplus-squarejackhopkins.github.ioexternal-linkmessage-square0linkfedilinkarrow-up14arrow-down10
arrow-up14arrow-down1external-linkFactorio Learning Environmentplus-squarejackhopkins.github.iohowrar@lemmy.caMEnglish · 30 days agomessage-square0linkfedilink
howrar@lemmy.caMEnglish · 1 month agoAndrew Barto and Richard Sutton are the recipients of the 2024 ACM A.M. Turing Award for developing the conceptual and algorithmic foundations of reinforcement learning.plus-squarewww.acm.orgexternal-linkmessage-square0linkfedilinkarrow-up16arrow-down10
arrow-up16arrow-down1external-linkAndrew Barto and Richard Sutton are the recipients of the 2024 ACM A.M. Turing Award for developing the conceptual and algorithmic foundations of reinforcement learning.plus-squarewww.acm.orghowrar@lemmy.caMEnglish · 1 month agomessage-square0linkfedilink
howrar@lemmy.caMEnglish · 2 months agoOpen Sourcing π₀plus-squarewww.physicalintelligence.companyexternal-linkmessage-square0linkfedilinkarrow-up15arrow-down10
arrow-up15arrow-down1external-linkOpen Sourcing π₀plus-squarewww.physicalintelligence.companyhowrar@lemmy.caMEnglish · 2 months agomessage-square0linkfedilink
howrar@lemmy.caMEnglish · 2 months agoA Little Bit of Reinforcement Learning from Human Feedback -- Nathan Lambertplus-squarerlhfbook.comexternal-linkmessage-square0linkfedilinkarrow-up18arrow-down11
arrow-up17arrow-down1external-linkA Little Bit of Reinforcement Learning from Human Feedback -- Nathan Lambertplus-squarerlhfbook.comhowrar@lemmy.caMEnglish · 2 months agomessage-square0linkfedilink
howrar@lemmy.caMEnglish · 4 months agoReinforcement Learning: An Overviewplus-squarearxiv.orgexternal-linkmessage-square0linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkReinforcement Learning: An Overviewplus-squarearxiv.orghowrar@lemmy.caMEnglish · 4 months agomessage-square0linkfedilink
howrar@lemmy.caMEnglish · 6 months agoKeynotes from the 2024 Reinforcement Learning Conferenceplus-squarewww.youtube.comexternal-linkmessage-square0linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkKeynotes from the 2024 Reinforcement Learning Conferenceplus-squarewww.youtube.comhowrar@lemmy.caMEnglish · 6 months agomessage-square0linkfedilink
howrar@lemmy.caMEnglish · edit-27 months agoOpenAI: Learning to Reason with LLMsplus-squareopenai.comexternal-linkmessage-square0linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkOpenAI: Learning to Reason with LLMsplus-squareopenai.comhowrar@lemmy.caMEnglish · edit-27 months agomessage-square0linkfedilink
howrar@lemmy.caMEnglish · 1 year agoIntroducing SIMA, a Scalable Instructable Multiworld Agentplus-squaredeepmind.googleexternal-linkmessage-square0linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkIntroducing SIMA, a Scalable Instructable Multiworld Agentplus-squaredeepmind.googlehowrar@lemmy.caMEnglish · 1 year agomessage-square0linkfedilink