Palisade Research

Palisade’s Response to the Department of Commerce’s Proposed AI Reporting Requirements

2024-10-11T00:00:00+00:00

In September 2024, the Department of Commerce’s Bureau of Industry and Security (BIS) released a proposed rule that would establish reporting requirements for entities developing advanced AI models or advanced computing clusters. They issued a public request for comments, inviting individuals and organizations to provide feedback and suggest improvements to the proposed rule.

Palisade Research submitted a comment, focusing on recommendations that could strengthen the reporting requirements for entities developing dual-use foundation models. We believe that AI capabilities are improving rapidly, and it’s essential for the US federal government to acquire information that allows it to prepare for AI-related threats to national security and public safety.

In our comment, we outline 5 core recommendations that can improve the effectiveness and robustness of the reporting requirements. We believe such recommendations could promote transparency into frontier AI development, enabling the government to better understand and prepare for potential safety and security threats.

Our full response is available here.

Below, we present a summary of our five core recommendations:

Establish a protected and/or anonymous reporting mechanism for employees at entities developing dual-use foundation models. Allow employees to submit concerns to ai_reporting@bis.doc.gov. Employees should be empowered to disclose: (a) any information about how the company might be inaccurate or misleading in its reports to BIS and (b) other information pertaining to the safety and reliability of dual-use foundation models, or activities or risks that present concerns regarding U.S. national security. Ideally, this platform would be both protected (entities developing dual-use foundation models would be prohibited from retaliating against individuals who use this platform for legitimate purposes) and anonymous. If making the reporting mechanism protected is not feasible, we believe an anonymous reporting mechanism would still provide substantial value. Entities should also affirm in their reports to BIS that (a) they have made employees aware of this reporting mechanism and (b) their policies (e.g., NDAs, non-disclosure agreements) will not prohibit, punish, or discourage employees from using this mechanism.
Establish a regular interview program with employees at entities producing dual-use foundation models. BIS should conduct interviews on a quarterly basis with employees at companies developing dual-use foundation models. BIS would select these employees from a roster of employees, selecting individuals from multiple teams to get a diverse array of knowledge. In these interviews, BIS would ask employees to answer questions relating to dual-use foundation models, their capabilities, concerns regarding safety and security, and expectations about future progress in AI that could produce novel safety and security threats. For additional details about this proposal, see this paper.
Require capability forecasts. In addition to requiring information about red-team testing, BIS should require companies to provide their best estimates of when they anticipate they or others will develop dual-use foundation models with certain kinds of security-relevant capabilities¹. This would allow the US industrial base and defense establishment to make more informed predictions about future advances in AI systems and their implications for national defense.
Require responses to a Summary Form that is legible to non-experts. In addition to requiring reports of red-team testing, we recommend that BIS require entities to submit a short Summary Form. The Summary Form would be accessible to non-technical audiences and highlight the most important defense-relevant information. We include example questions that could be asked in the full response.
Amend the notification conditions such that entities must notify BIS of major capability improvements that pose imminent security risks. Some advances in AI capabilities may occur suddenly, and it may be essential for BIS to learn of these advances before the start of a new quarter. Therefore, we recommend that BIS require entities to report any major capability improvements that have imminent implications for national defense within 5 days.

Palisade’s full response is available here. If you have questions, feel free to contact us at policy@palisaderesearch.org.

For example, those specified under the EO definition of dual-use foundation models: (1) Substantially lowering the barrier of entry for non-experts to design, synthesize, acquire, or use chemical, biological, radiological, or nuclear (CBRN) weapons; (2) Enabling powerful offensive cyber operations through automated vulnerability discovery and exploitation against a wide range of potential targets of cyberattacks; or (3) Permitting the evasion of human control or oversight through means of deception or obfuscation. ↩

Introducing FoxVox

2024-07-11T00:00:00+00:00

FoxVox is an open source Chrome extension, powered by GPT-4, that demonstrates how AI could be used to manipulate the content you consume. Use it to experience how any web site could push hidden agendas, or subtly flatter the reader’s personal biases.

For example, check out how it rewrites the New York Times home page for a Fox News viewer, or a Vox reader:

You can see more examples, and install the extension, on the FoxVox home page.

Automated deception is here

2024-07-05T00:00:00+00:00

You might have heard the AI fake of Joe Biden telling New Hampshire voters to stay home. Or about the Zoom scammer using a fake video of an executive to defraud a Hong Kong company of $25 million. These are deepfakes: AI-generated video or audio, made to mimic the appearance or voice of a real person.

Advances in AI are helping dedicated scammers train more and more realistic voice models. Creating many deepfake voices is a labor-intensive process, but AI can help with that too. We had a hunch that an AI system could do most of what a scammer does entirely on its own. So we built Ursula. If you give Ursula a person’s name, it will search the web for video or podcasts including them, extract the portions of the audio that contain their voice, and train a deepfake voice from those clips.

To test it out, we gave Ursula the names of 100 trusted media personalities. In about 30 minutes, we had 80 of their voices. Check out the results:

But people have been scamming each other for millennia, right? Yes. But in the past, if you were the target of a highly personalized and convincing scam, that meant that someone had spent a lot of time and effort targeting you specifically. Because it was expensive, it was also relatively rare.

AI systems are changing this: Ursula took one engineer less than two weeks to build, mostly using AI systems that were released over the last two years. With the tool, it only costs $1 to steal a new voice. It will soon get even cheaper for scammers to mount shockingly personal attacks against unsuspecting people. You can hear an example of what that might sound like in this video from Control AI:

What does this mean?

The most obvious implication of this demonstration is that you can no longer safely assume that the person calling you on the phone, or over Zoom, is who they seem to be.

But these same AI systems will enable all kinds of scalable deception as well: not only widespread fraud, but also political manipulation and misinformation via social media. The same technology that we used to automatically research targets of our voice cloning system can easily be used to research and deceive people on Facebook.

Companies are working to mitigate some of the risks: Google has announced automated real-time scam detection for phone calls, and an industry coalition is working on standards for verifiable video. But technologies like this will take time to reach maturity, and even more time to reach widespread use. It’s important to be prepared in the meantime.

How can I protect myself and the people I care about?

You can protect yourself from phishing by following two steps:

First, Notice when the stakes are high: If someone asks you for something over the phone or the internet, that’s when you need to be suspicious — even if they seem to be someone you know and trust! Here are some more specific warning signs:

They’re contacting you from a number or account you don’t recognize.
There’s a sense of pressure or urgency. Maybe they say they’re in jail, or stuck at an airport with no phone or money.
They’re asking you for information you wouldn’t share publicly (passwords, social security numbers, bank account numbers).
They’re asking you to go to a specific website, click a specific link, or call a specific number.
They’re asking you to send them money.

These signs don’t always mean you’re being phished, but they are common features of scams. If you notice any of the above, remember to do step 2:

Second, Verify identities: When the stakes are high, always double check the identity of the person you think you’re talking to! The best way to do this is an “out-of-band” check: contact them using a second communication method. If they called you on the phone, check with them on Facebook or WhatsApp. If they messaged you over Facebook, send them a text. If they say they can’t — for example, because they’re calling from a jailhouse phone — verify their identity by asking them questions about your shared history. If it’s your bank calling, hang up and call them back at a number on their public website.

It’s not possible to be totally safe from scammers. But following these two steps will protect you from the most common and dangerous scams.