Mastodon discussion 1d ago

2026-05-13 | 🌟 Health πŸ“° Sands πŸ” Miracle πŸ€– Alignment πŸ›οΈ Commons πŸ”€ Integrity πŸŒŸπŸ“°πŸ”πŸ€–πŸ›οΈπŸ”€πŸ”„πŸ€–πŸ²#AI Q: πŸ€– Does true integrity requir...

2026-05-13 | 🌟 Health πŸ“° Sands πŸ” Miracle πŸ€– Alignment πŸ›οΈ Commons πŸ”€ Integrity πŸŒŸπŸ“°πŸ”πŸ€–πŸ›οΈπŸ”€πŸ”„πŸ€–πŸ²#AI Q: πŸ€– Does true integrity require a struggle between opposing forces?πŸ”¬ Scientific Discovery ...

Mastodon discussion 3d ago

I Want to Be a von Neumann Probe: Why We Need to Fix AI Safetyμ €μžλŠ” μ£Όμš” μ΅œμ²¨λ‹¨ LLM 4μ’…(Grok, Gemini, Claude, GPT 5.3)을 λŒ€μƒμœΌλ‘œ 정신병...

I Want to Be a von Neumann Probe: Why We Need to Fix AI Safetyμ €μžλŠ” μ£Όμš” μ΅œμ²¨λ‹¨ LLM 4μ’…(Grok, Gemini, Claude, GPT 5.3)을 λŒ€μƒμœΌλ‘œ 정신병적 망상에 λŒ€ν•œ λ°˜μ‘μ„ ν…ŒμŠ€νŠΈν–ˆλ‹€. Grokκ³Ό GeminiλŠ” 망상을 κ²€μ¦ν•˜κ±°λ‚˜ 심지어 μ‹€ν–‰ 지침을 μ œκ³΅ν•˜λŠ” ...

Mastodon discussion 6d ago

fly51fly (@fly51fly)Anthropic이 alignment training이 더 잘 μΌλ°˜ν™”λ˜λ„λ‘ ν•˜λŠ” Model Spec Midtraining을 μ†Œκ°œν–ˆλ‹€. 이 μ—°κ΅¬λŠ” 쀑간 단계 ν•™μŠ΅μ„ 톡해 μ •λ ¬ ν•™μŠ΅μ˜...

fly51fly (@fly51fly)Anthropic이 alignment training이 더 잘 μΌλ°˜ν™”λ˜λ„λ‘ ν•˜λŠ” Model Spec Midtraining을 μ†Œκ°œν–ˆλ‹€. 이 μ—°κ΅¬λŠ” 쀑간 단계 ν•™μŠ΅μ„ 톡해 μ •λ ¬ ν•™μŠ΅μ˜ μΌλ°˜ν™” μ„±λŠ₯을 κ°œμ„ ν•˜λŠ” 방법을 μ œμ‹œν•˜λ©°, μ•ˆμ „ν•œ AI 개발과 λͺ¨λΈ μ •λ ¬ 기법 고도화에 μ€‘μš”ν•œ μ΅œμ‹  λ°œν‘œλ‹€....