Salesforce Study Finds LLM Agents Flunk CRM and Confidentiality Tests

SlashDot - Mon, 06/16/2025 - 18:10
A new Salesforce-led study found that LLM-based AI agents struggle with real-world CRM tasks, achieving only 58% success on simple tasks and dropping to 35% on multi-step ones. They also demonstrated poor confidentiality awareness. "Agents demonstrate low confidentiality awareness, which, while improvable through targeted prompting, often negatively impacts task performance," a paper published at the end of last month said. The Register reports: The Salesforce AI Research team argued that existing benchmarks failed to rigorously measure the capabilities or limitations of AI agents, and largely ignored an assessment of their ability to recognize sensitive information and adhere to appropriate data handling protocols. The research unit's CRMArena-Pro tool is fed a data pipeline of realistic synthetic data to populate a Salesforce organization, which serves as the sandbox environment. The agent takes user queries and decides between an API call or a response to the users to get more clarification or provide answers. "These findings suggest a significant gap between current LLM capabilities and the multifaceted demands of real-world enterprise scenarios," the paper said. [...] AI agents might well be useful, however, organizations should be wary of banking on any benefits before they are proven.

Read more of this story at Slashdot.

Trump Mobile Phone Company Announced by President’s Family, but Details Are Murky

NY Times - Mon, 06/16/2025 - 17:38
The new company says it will manufacture its Android phone in the United States but it has not said how it could do that.

Negotiation or Capitulation? How Columbia Got Off Trump’s Hot Seat.

NY Times - Mon, 06/16/2025 - 17:30
The university has largely complied with the administration’s demands, but has adjusted them in meaningful ways. One department offers a window into that effort.

The US Navy Is More Aggressively Telling Startups, 'We Want You'

SlashDot - Mon, 06/16/2025 - 17:30
An anonymous reader quotes a report from TechCrunch: While Silicon Valley executives like those from Palantir, Meta, and OpenAI are grabbing headlines for trading their Brunello Cucinelli vests for Army Reserve uniforms, a quieter transformation has been underway in the U.S. Navy. How so? Well, the Navy's chief technology officer, Justin Fanelli, says he has spent the last two and a half years cutting through the red tape and shrinking the protracted procurement cycles that once made working with the military a nightmare for startups. The efforts represent a less visible but potentially more meaningful remaking that aims to see the government move faster and be smarter about where it's committing dollars. "We're more open for business and partnerships than we've ever been before," Fanelli told TechCrunch in a recent episode of StrictlyVC Download. "We're humble and listening more than before, and we recognize that if an organization shows us how we can do business differently, we want that to be a partnership." Right now, many of these partnerships are being facilitated through what Fanelli calls the Navy's innovation adoption kit, a series of frameworks and tools that aim to bridge the so-called Valley of Death, where promising tech dies on its path from prototype to production. "Your granddaddy's government had a spaghetti chart for how to get in," Fanelli said. "Now it's a funnel, and we are saying, if you can show that you have outsized outcomes, then we want to designate you as an enterprise service." In one recent case, the Navy went from a Request for Proposal (RFP) to pilot deployment in under six months with Via, an eight-year-old, Somerville, Massachusetts-based cybersecurity startup that helps big organizations protect sensitive data and digital identities through, in part, decentralization, meaning the data isn't stored in one central spot that can be hacked. (Another of Via's clients is the U.S. Air Force.) The Navy's new approach operates on what Fanelli calls a "horizon" model, borrowed and adapted from McKinsey's innovation framework. Companies move through three phases: evaluation, structured piloting, and scaling to enterprise services. The key difference from traditional government contracting, Fanelli says, is that the Navy now leads with problems rather than predetermined solutions. "Instead of specifying, 'Hey, we'd like this problem solved in a way that we've always had it,' we just say, 'We have a problem, who wants to solve this, and how will you solve it?'" Fanelli said.

Read more of this story at Slashdot.

Obscure Chinese Stock Scams Dupe American Investors by the Thousands

SlashDot - Mon, 06/16/2025 - 16:50
Thousands of American investors have lost millions of dollars to sophisticated pump-and-dump schemes involving small Chinese companies listed on Nasdaq, prompting the Justice Department to declare the fraud a priority under the Trump administration's white-collar enforcement program. The scams recruit victims through social media ads and WhatsApp messages, directing them to purchase shares in obscure Chinese firms whose stock prices are artificially inflated before collapsing. Since 2020, nearly 60 China-based companies have conducted initial public offerings on Nasdaq raising $15 million or less each, with more than one-third experiencing sudden single-day price drops exceeding 50%. In one recent case, seven traders earned over $480 million by defrauding 600 victims who purchased shares in China Liberal Education Holdings.

Read more of this story at Slashdot.

OpenAI, Growing Frustrated With Microsoft, Has Discussed Making Antitrust Complaints To Regulators

SlashDot - Mon, 06/16/2025 - 16:11
Tensions between OpenAI and Microsoft over the future of their famed AI partnership are flaring up. WSJ, minutes ago: OpenAI wants to loosen Microsoft's grip on its AI products and computing resources, and secure the tech giant's blessing for its conversion into a for-profit company. Microsoft's approval of the conversion is key to OpenAI's ability to raise more money and go public. But the negotiations have been so difficult that in recent weeks, OpenAI's executives have discussed what they view as a nuclear option: accusing Microsoft of anticompetitive behavior during their partnership, people familiar with the matter said. That effort could involve seeking federal regulatory review of the terms of the contract for potential violations of antitrust law, as well as a public campaign, the people said.

Read more of this story at Slashdot.

That 'Unsubscribe' Button Could Be a Trap, Researchers Warn

SlashDot - Mon, 06/16/2025 - 15:35
Researchers are cautioning users against clicking unsubscribe links embedded in email bodies, citing new data showing such actions can expose recipients to malicious websites and confirm active email addresses to attackers. DNSFilter found that one in every 644 clicks on unsubscribe links leads users to potentially malicious websites. "You've left the safe, structured environment of your email client and entered the open web," TK Keanini, DNSFilter's chief technology officer, told WSJ. The risks range from confirming to bad actors that an email address belongs to an active user to redirecting victims to fake websites designed to steal login credentials or install malware. Clicking such links "can make you a bigger target in the future," said Michael Bargury, CTO of security company Zenity.

Read more of this story at Slashdot.

The Tick Situation Is Getting Worse

NY Times - Mon, 06/16/2025 - 15:02
As temperatures rise, ticks of several kinds are flourishing in ways that threaten people’s health.

Dutch Court Confirms Apple Abused Dominant Position in Dating Apps

SlashDot - Mon, 06/16/2025 - 14:58
A Dutch court on Monday confirmed a 2021 consumer watchdog's ruling saying that Apple had abused its dominant position by imposing unfair conditions on providers of dating apps in the App Store. From a report: The Rotterdam District Court ruled that the Dutch Authority for Consumers and Markets (ACM) was therefore right to impose an order subject to a penalty for non-compliance. The court ruled that ACM was right in finding that dating app providers had to use Apple's own payment system, were not allowed to refer to payment options outside the App Store, and had to pay a 30% commission (15% for small providers) to Apple.

Read more of this story at Slashdot.

California’s Wildfires Could Be Brutal This Summer

NY Times - Mon, 06/16/2025 - 14:21
Experts say there could be more large wildfires than usual this year. Here’s why.

Windows Hello Face Unlock No Longer Works in the Dark and Microsoft Says It's Not a Bug

SlashDot - Mon, 06/16/2025 - 14:10
Microsoft has disabled Windows Hello's ability to authenticate users in low-light environments through a recent security update that now requires both infrared sensors and color cameras to verify faces. The change forces the system to see a visible face through the webcam before completing authentication with IR sensors. Windows Hello earlier relied solely on infrared sensors to create 3D facial scans, allowing the feature to work in complete darkness similar to iPhone's Face ID. Microsoft pushed the dual-camera requirement to address a spoofing vulnerability in the biometric system.

Read more of this story at Slashdot.

Terry Moran Says He Doesn’t Regret Posts Criticizing Trump Administration

NY Times - Mon, 06/16/2025 - 14:05
In his first interview since losing his job at ABC News, the longtime TV correspondent, newly popular on Substack, says he does not regret his social media post criticizing the Trump administration.

Japan Builds Near $700 Million Fund To Lure Foreign Academic Talent

SlashDot - Mon, 06/16/2025 - 13:32
An anonymous reader shares a report: Japan is the latest nation hoping to tempt disgruntled US researchers alarmed by the Trump administration's hostile attitude to academia to relocate to the Land of the Rising Sun. The Japanese government aims to create an elite research environment, and has detailed a $693 million package to attract researchers from abroad, including those from America who may have seen their budgets slashed or who fear a clampdown on their academic freedom.

Read more of this story at Slashdot.

Researchers Create World's First Completely Verifiable Random Number Generator

SlashDot - Mon, 06/16/2025 - 12:56
Researchers have built a breakthrough random number generator that solves a critical problem: for the first time, every step of creating random numbers can be independently verified and audited, with quantum physics guaranteeing the numbers were truly unpredictable. Random numbers are essential for everything from online banking encryption to fair lottery drawings, but current systems have serious limitations. Computer-based generators follow predictable algorithms -- if someone discovers the starting conditions, they can predict all future outputs. Hardware generators that measure physical processes like electronic noise can't prove their randomness wasn't somehow predetermined or tampered with. The new system, developed by teams at the University of Colorado Boulder and the National Institute of Standards and Technology, uses quantum entanglement -- Einstein's "spooky action at a distance" -- to guarantee unpredictability. The setup creates pairs of photons that share quantum properties, then sends them to measurement stations 110 meters apart. When researchers measure each photon's properties, quantum mechanics ensures the results are fundamentally random and cannot be influenced by any classical communication between the stations. The team created a system called "Twine" that distributes the random number generation process across multiple independent parties, with each step recorded in tamper-proof digital ledgers called hash chains. This means no single organization controls the entire process, and anyone can verify that proper procedures were followed. During a 40-day demonstration, the system successfully generated random numbers in 7,434 of 7,454 attempts -- a 99.7% success rate. Each successful run produced 512 random bits with mathematical certainty of randomness bounded by an error rate of 2^-64, an extraordinarily high level of confidence.

Read more of this story at Slashdot.

A Timeline of the Minnesota Shooting

NY Times - Mon, 06/16/2025 - 00:38
A manhunt is underway for a man suspected in the killing on Saturday of a state lawmaker and her husband and in the shooting of another lawmaker and his wife. Here is how the events unfolded.

Minnesota Shootings Suspect Had a Notebook With 70 Potential Targets

NY Times - Mon, 06/16/2025 - 00:17
The tally, which included politicians, community and business leaders, and locations for Planned Parenthood, was recovered in a car linked to the attacks.

Can Labubu, This Ugly Elf, Make China Cool?

NY Times - Mon, 06/16/2025 - 00:01
China has long struggled to improve its image, especially in the West. It may be scoring some victories now.

Randi Weingarten Quits D.N.C. Post in Dispute With Chairman

NY Times - Sun, 06/15/2025 - 22:36
Randi Weingarten, head of one of the nation’s most influential teachers unions, and Lee Saunders, the president of a large union of public workers, each pointed to Ken Martin’s leadership.

‘I’m an American, Bro!’: Latinos Report Raids in Which U.S. Citizenship Is Questioned

NY Times - Sun, 06/15/2025 - 22:25
A raid in Montebello, Calif., has stirred fears that federal agents are detaining and racially profiling U.S. citizens of Hispanic descent.

Oil Prices Climb Further After Israel Strikes Iran’s Energy Assets

NY Times - Sun, 06/15/2025 - 21:36
U.S. oil prices already jumped last week, which could cause prices at the pump to rise about 20 cents a gallon in the coming weeks, according to one estimate.

Pages

Back to top