Lugh@futurology.todayM to Futurology@futurology.todayEnglish · 9 months agoTwo-faced AI language models learn to hide deception - ‘Sleeper agents’ seem benign during testing but behave differently once deployed. And methods to stop them aren’t working.www.nature.comexternal-linkmessage-square8fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkTwo-faced AI language models learn to hide deception - ‘Sleeper agents’ seem benign during testing but behave differently once deployed. And methods to stop them aren’t working.www.nature.comLugh@futurology.todayM to Futurology@futurology.todayEnglish · 9 months agomessage-square8fedilink
minus-squaremateomaui@reddthat.comlinkfedilinkEnglisharrow-up0·9 months agoAlright, I’ll be out back digging the bomb shelter.
minus-squarePossibly linux@lemmy.ziplinkfedilinkEnglisharrow-up0·9 months agoIts to late for that honestly
minus-squaremateomaui@reddthat.comlinkfedilinkEnglisharrow-up0·9 months agoAlright, I’ll switch to digging holes for the family burial ground.
Alright, I’ll be out back digging the bomb shelter.
Its to late for that honestly
Alright, I’ll switch to digging holes for the family burial ground.