stillcharlie (charlie)

replied to Ujjwal-Tyagi's post 17 days ago

"We must actively work on protecting these models from bad actors who seek to exploit them for malicious purposes, such as generating disinformation, creating non-consensual imagery, or automating cyberattacks. " -

disinformation is a term that is often used to hide "inconvenient" truths. This is about control of the narrative.
non-consensual imagery - someone using your person and having it's likeness generate a false statement? Photoshop exists and does this as well?
automating cyberattacks - ever try to purchase an RTX 5090 at MSRP? Do you think those are human's you're battling against to purchase it?.

Is it disinformation to suggest that these AI's already exist and have already been deployed?
From Gemini:"
. The Asymptotic Wall: S→1,C→∞

As your safety requirements approach perfection, you hit the Halting Problem and Rice's Theorem

Rice’s Theorem: Any non-trivial property of a program’s behavior (like "Will this AI ever lie?" or "Will this AI cause harm?") is undecidable.

The Logic: To prove a system will never perform action X in any possible future state, you must simulate or formally verify every possible execution path. For a Turing-complete system, the number of states is infinite.

. The Scaling Reality

In the formula dS/dC=α⋅(1−S)/Cβ, as S approaches 1, the "Risk Gap" (1−S) approaches zero. To keep the rate of safety improvement constant, C must explode.
The Reality Check

We are currently in a "Safety Debt" crisis. We are scaling the capabilities of models (n) at a rate that far outpaces our ability to compute the proofs of their safety.

If we have a model with 1 trillion parameters, the compute C required to guarantee it won't produce a "Black Swan" event exceeds the energy available in the solar system. Therefore, we settle for "Good Enough" (statistical alignment) and call it "Safe." "

All of this is just to ask, again, what specific safety requirement are you worried about that hasn't already been compensated for?

replied to Ujjwal-Tyagi's post 18 days ago

turned against us? Who exactly is us? Those who seek to harm, will create models or strip current models of their guardrails anyways. I'm not suggesting no guard-rails but just a high-level overview of what exactly you're trying to achieve with more "safety" that hasn't already been compensated for?

The issue is that the problem of safety is a calculus problem. As "safety" approaches 100%, computation goes to infinity. And plus, how do we know that the real "scammers" aren't the ones programming the AI to not reveal certain "harmful" information? From their perspective (getting caught) would be extremely harmful.

With the death of Moore's law, we can no longer afford to bloat every system with every single scenario that any bad actor can ever think of.

What fear-provoking scenario are you worried about that hasn't already been accounted for? At what point do we just take away the ability for normal person to buy computers? Oh.... wait....

charlie

AI & ML interests

Recent Activity

Organizations

charlie

AI & ML interests

Recent Activity

Organizations

stillcharlie's activity