Anthropic introduces Claude, a “more steerable” AI competitor to ChatGPT

Anthropic introduces Claude, a “more steerable” AI competitor to ChatGPT

Enlarge (credit: Anthropic)

On Tuesday, Anthropic introduced Claude, a large language model (LLM) that can generate text, write code, and function as an AI assistant similar to ChatGPT. The model originates from core concerns about future AI safety and Anthropic has trained it using a technique it calls “Constitutional AI.”

Two versions of the AI model, Claude and “Claude Instant,” are available now for a limited “early access” group and to commercial partners of Anthropic. Those with access can use Claude through either a chat interface in Anthropic’s developer console or via an application programming interface (API). With the API, developers can hook into Anthropic’s servers remotely and add Claude’s analysis and text completion abilities to their apps.

Anthropic claims that Claude is “much less likely to produce harmful outputs, easier to converse with, and more steerable” than other AI chatbots while maintaining “a high degree of reliability and predictability.” The company cites use cases such as search, summarization, collaborative writing, and coding. And, like ChatGPT’s API, Claude can change personality, tone, or behavior depending on use preference.

Read 5 remaining paragraphs | Comments

#ai, #anthropic, #biz-it, #chatgpt, #claude, #dario-amodei, #gpt-3, #large-language-models, #machine-learning, #openai

AI imager Midjourney v5 stuns with photorealistic images—and 5-fingered hands

An example of lighting and skin effects in the AI image generator Midjourney v5.

Enlarge / An example of lighting and skin effects in the AI image generator Midjourney v5. (credit: Julie W. Design)

On Wednesday, Midjourney announced version 5 of its commercial AI image synthesis service, which can produce photorealistic images at a quality level that some AI art fans are calling creepy and “too perfect.” Midjourney v5 is available now as an alpha test for customers who subscribe to the Midjourney service, which is available through Discord.

“MJ v5 currently feels to me like finally getting glasses after ignoring bad eyesight for a little bit too long,” said Julie Wieland, a graphic designer who often shares her Midjourney creations on Twitter. “Suddenly you see everything in 4k, it feels weirdly overwhelming but also amazing.”

Wieland shared some of her Midjourney v5 generations with Ars Technica (seen below in a gallery and in the main image above), and they certainly show a progression in image detail since Midjourney first arrived in March 2022. Version 3 debuted in August, and version 4 debuted in November. Each iteration added more detail to the generated results, as our experiments show:

Read 8 remaining paragraphs | Comments

#ai, #ai-art, #biz-it, #image-synthesis, #machine-learning, #midjourney, #stable-diffusion

Authors risk losing copyright if AI content is not disclosed, US guidance says

Authors risk losing copyright if AI content is not disclosed, US guidance says

Enlarge (credit: StudioM1 | iStock / Getty Images Plus)

As generative AI technologies like GPT-4 and Midjourney have rapidly gotten more sophisticated and their creative use has exploded in popularity, the US Copyright Office has issued guidance today to clarify when AI-generated material can be copyrighted.

Guidance comes after the Copyright Office decided that an author could not copyright individual AI images used to illustrate a comic book, because each image was generated by Midjourney—not a human artist. In making its decision, the Copyright Office committed to upholding the longstanding legal definition that authors of creative works must be human to register works. Because of this, officials confirmed that AI technologies can never be considered authors.

This wasn’t the only case influencing new guidance, but it was the most recent. Wrestling with the comic book’s complex authorship questions helped prompt the Copyright Office to launch an agency-wide initiative to continue exploring a wider range of copyright issues arising as the AI models that are used to generate text, art, audio, and video continue evolving.

Read 16 remaining paragraphs | Comments

#ai, #copyright, #gpt-4, #midjourney, #policy, #us-copyright-office

Large language models also work for protein structures

Artist's rendering of a collection of protein structures floating in space

Enlarge (credit: CHRISTOPH BURGSTEDT/SCIENCE PHOTO LIBRARY)

The success of ChatGPT and its competitors is based on what’s termed emergent behaviors. These systems, called large language models (LLMs), weren’t trained to output natural-sounding language (or effective malware); they were simply tasked with tracking the statistics of word usage. But, given a large enough training set of language samples and a sufficiently complex neural network, their training resulted in an internal representation that “understood” English usage and a large compendium of facts. Their complex behavior emerged from a far simpler training.

A team at Meta has now reasoned that this sort of emergent understanding shouldn’t be limited to languages. So it has trained an LLM on the statistics of the appearance of amino acids within proteins and used the system’s internal representation of what it learned to extract information about the structure of those proteins. The result is not quite as good as the best competing AI systems for predicting protein structures, but it’s considerably faster and still getting better.

LLMs: Not just for language

The first thing you need to know to understand this work is that, while the term “language” in the name “LLM” refers to their original development for language processing tasks, they can potentially be used for a variety of purposes. So, while language processing is a common use case for LLMs, these models have other capabilities as well. In fact, the term “Large” is far more informative, in that all LLMs have a large number of nodes—the “neurons” in a neural network—and an even larger number of values that describe the weights of the connections among those nodes. While they were first developed to process language, they can potentially be used for a variety of tasks.

Read 17 remaining paragraphs | Comments

#ai, #biochemistry, #computer-science, #meta, #protein-structure, #proteins, #science

Chinese search giant launches AI chatbot with prerecorded demo

PIcture of presentation

Enlarge / Please use the sharing tools found via the share button at the top or side of Baidu chief Robin Li introduces the functions of the company’s AI chatbot Ernie in Beijing on Thursday. Li said there was high market demand as Chinese companies raced to develop an equivalent to Microsoft-backed ChatGPT. (credit: Ng Han Guan/AP)

Shares of Baidu fell as much as 10 percent on Thursday after the web search company showed only a pre-recorded video of its AI chatbot Ernie in the first public release of China’s answer to ChatGPT.

The Beijing-based tech company has claimed Ernie will remake its business and for weeks talked up plans to incorporate generative artificial intelligence into its search engine and other products.

But on Thursday, millions of people tuning in to the event were left with little idea of whether Baidu’s chatbot could compete with ChatGPT.

Read 18 remaining paragraphs | Comments

#ai, #baidu, #biz-it, #chatgpt, #china

OpenAI checked to see whether GPT-4 could take over the world

An AI-generated image of the earth enveloped in an explosion.

Enlarge (credit: Ars Technica)

As part of pre-release safety testing for its new GPT-4 AI model, launched Tuesday, OpenAI allowed an AI testing group to assess the potential risks of the model’s emergent capabilities—including “power-seeking behavior,” self-replication, and self-improvement.

While the testing group found that GPT-4 was “ineffective at the autonomous replication task,” the nature of the experiments raises eye-opening questions about the safety of future AI systems.

Raising alarms

“Novel capabilities often emerge in more powerful models,” writes OpenAI in a GPT-4 safety document published yesterday. “Some that are particularly concerning are the ability to create and act on long-term plans, to accrue power and resources (“power-seeking”), and to exhibit behavior that is increasingly ‘agentic.'” In this case, OpenAI clarifies that “agentic” isn’t necessarily meant to humanize the models or declare sentience but simply to denote the ability to accomplish independent goals.

Read 21 remaining paragraphs | Comments

#ai, #ai-safety, #alignment-research, #arc, #bing-chat, #biz-it, #effective-altruism, #gpt-4, #large-language-models, #machine-learning, #microsoft, #openai, #paul-christiano

Report: Microsoft cut a key AI ethics team

Report: Microsoft cut a key AI ethics team

Enlarge (credit: NurPhoto / Contributor | NurPhoto)

An entire team responsible for making sure that Microsoft’s AI products are shipped with safeguards to mitigate social harms was cut during the company’s most recently layoff of 10,000 employees, Platformer reported.

Former employees said that the ethics and society team was a critical part of Microsoft’s strategy to reduce risks associated with using OpenAI technology in Microsoft products. Before it was killed off, the team developed an entire “responsible innovation toolkit” to help Microsoft engineers forecast what harms could be caused by AI—and then to diminish those harms.

Platformer’s report came just before OpenAI released possibly its most powerful AI model yet, GPT-4, which is already helping to power Bing search, Reuters reported.

Read 20 remaining paragraphs | Comments

#ai, #artificial-intelligence, #bing, #bing-chat, #microsoft, #policy

OpenAI’s GPT-4 exhibits “human-level performance” on professional benchmarks

A colorful AI-generated image of a radiating silhouette.

Enlarge (credit: Ars Technica)

On Tuesday, OpenAI announced GPT-4, a large multimodal model that can accept text and image inputs while returning text output that “exhibits human-level performance on various professional and academic benchmarks,” according to OpenAI. Also on Tuesday, Microsoft announced that Bing Chat has been running on GPT-4 all along.

If it performs as claimed, GPT-4 potentially represents the opening of a new era in artificial intelligence. “It passes a simulated bar exam with a score around the top 10% of test takers,” writes OpenAI in its announcement. “In contrast, GPT-3.5’s score was around the bottom 10%.”

OpenAI plans to release GPT-4’s text capability through ChatGPT and its commercial API, but with a waitlist at first. GPT-4 is currently available to subscribers of ChatGPT Plus. Also, the firm is testing GPT-4’s image input capability with a single partner, Be My Eyes, an upcoming smartphone app that can recognize a scene and describe it.

Read 10 remaining paragraphs | Comments

#ai, #biz-it, #gpt-4, #large-language-models, #machine-learning, #openai

You can now run a GPT-3 level AI model on your laptop, phone, and Raspberry Pi

An AI-generated abstract image suggesting the silhouette of a figure.

Enlarge (credit: Ars Technica)

Things are moving at lightning speed in AI Land. On Friday, a software developer named Georgi Gerganov created a tool called “llama.cpp” that can run Meta’s new GPT-3-class AI large language model, LLaMA, locally on a Mac laptop. Soon thereafter, people worked out how to run LLaMA on Windows as well. Then someone showed it running on a Pixel 6 phone, and next came a Raspberry Pi (albeit running very slowly).

If this keeps up, we may be looking at a pocket-sized ChatGPT competitor before we know it.

But let’s back up a minute, because we’re not quite there yet. (At least not today—as in literally today, March 13, 2023.) But what will arrive next week, no one knows.

Read 13 remaining paragraphs | Comments

#ai, #biz-it, #gpt-3, #large-language-models, #llama, #machine-learning, #meta, #meta-ai, #openai

GM plans to let you talk to your car with ChatGPT, Knight Rider-style

COLOGNE, GERMANY - OCTOBER 24: David Hasselhoff attends the

Enlarge / The 1982 TV series Knight Rider featured a car called KITT that a character played by David Hasselhoff (pictured) could talk to. (credit: Getty Images)

In the 1982 TV series Knight Rider, the main character can have a full conversation with his futuristic car. Once science fiction, this type of language interface may soon be one step closer to reality because General Motors is working on bringing a ChatGPT-style AI assistant to its automobiles, according to Semafor and Reuters.

While GM won’t be adding Knight Rider-style turbojet engines or crime-fighting weaponry to its vehicles, its cars may eventually talk back to you in an intelligent-sounding way, thanks to a collaboration with Microsoft.

Microsoft has invested heavily in OpenAI, the company that created ChatGPT. Now, they’re looking for ways to apply chatbot technology to many different fields.

Read 6 remaining paragraphs | Comments

#ai, #biz-it, #cars, #chatgpt, #general-motors, #knight-rider, #large-language-models, #machine-learning, #microsoft, #science-fiction

Get ready to meet the Chat GPT clones

Get ready to meet the Chat GPT clones

Enlarge (credit: Edward Olive/Getty Images)

ChatGPT might well be the most famous, and potentially valuable, algorithm of the moment, but the artificial intelligence techniques used by OpenAI to provide its smarts are neither unique nor secret. Competing projects and open source clones may soon make ChatGPT-style bots available for anyone to copy and reuse.

Stability AI, a startup that has already developed and open-sourced advanced image-generation technology, is working on an open competitor to ChatGPT. “We are a few months from release,” says Emad Mostaque, Stability’s CEO. A number of competing startups, including Anthropic, Cohere, and AI21, are working on proprietary chatbots similar to OpenAI’s bot.

Read 17 remaining paragraphs | Comments

#ai, #biz-it, #bots, #chatgpt, #syndication

Has the generative AI pricing collapse already started?

Has the generative AI pricing collapse already started?

Enlarge (credit: fermate/Getty Images)

OpenAI just announced pricing for businesses seeking to integrate its ChatGPT service into their own products, and it looks an awful lot like a 90 percent off sale.

It all starts with OpenAI, a former nonprofit that’s now gunning for riches as lustily as any Silicon Valley unicorn. The company has built a dazzling array of products, including the DALL-E image generator and the renowned ChatGPT service.

ChatGPT is powered by a system known as a large language model (or LLM), and it’s one of several LLM lines that OpenAI sells commercially. Buyers of LLM output are mostly companies that integrate language-related services like chat, composition, summarization, software generation, online search, sentiment analysis, and much more into their websites, services, and products.

Read 16 remaining paragraphs | Comments

#ai, #chatgpt, #davinci, #features, #llm, #tech

Discord hops the generative AI train with ChatGPT-style tools

The Discord logo on a funky cyber-background.

Enlarge (credit: Discord)

Joining a recent parade of companies adopting generative AI technology, Discord announced on Thursday that it is rolling out a suite of AI-powered features, such as a ChatGPT-style chatbot, an upgrade to its moderation tool, an open source avatar remixer, and AI-powered conversation summaries.

Discord’s new features come courtesy of technology from OpenAI, the maker of ChatGPT. Earlier this month, OpenAI announced a new API interface for its popular large language model (LLM) and preferential commercial access called “Foundry.” The ChatGPT API allows companies to easily build AI-powered generative text into their apps, and companies like Snapchat and DuckDuckGo are already getting on the bandwagon with their own implementations of OpenAI’s tools.

In this case, Discord is using OpenAI’s tech to upgrade its existing robot, called “Clyde.” The update, coming next week, will allow Clyde to answer questions, engage in conversations, and recommend playlists. Users will be able to chat with Clyde in any channel by typing “@Clyde” in a server, and the bot will reportedly also be able to start a thread for group chats.

Read 4 remaining paragraphs | Comments

#ai, #biz-it, #chatgpt, #discord, #gaming-culture, #generative-ai, #gpt-3, #large-language-models, #machine-learning, #openai

AI-powered chat helps Bing make a (small) dent in Google’s search hegemony

AI-powered chat helps Bing make a (small) dent in Google’s search hegemony

Enlarge (credit: Microsoft)

Microsoft’s Bing has never been in any danger of overtaking Google as the Internet’s most popular search engine. But the headline-grabbing AI-powered features from the “new Bing” preview that the company launched last month do seem to be helping—Microsoft said today that Bing had passed the 100 million daily active users mark.

“We are fully aware we remain a small, low, single digit share player,” writes Microsoft’s Yusuf Mehdi, driving home just how small Microsoft’s share of the search market is compared to Google’s. “That said, it feels good to be at the dance!”

Google doesn’t provide daily active user numbers for its search engine, but StatCounter data suggests that its marketshare typically hovers just under 90 percent in the US, compared to 6 or 7 percent for Bing.

Read 3 remaining paragraphs | Comments

#ai, #bing-chat, #microsoft, #openai, #tech

Wikipedia + AI = truth? DuckDuckGo hopes so with new answerbot

An AI-generated image of a cyborg duck.

Enlarge / An AI-generated image of a cyborg duck. (credit: Ars Technica)

Not to be left out of the rush to integrate generative AI into search, on Wednesday DuckDuckGo announced DuckAssist, an AI-powered factual summary service powered by technology from Anthropic and OpenAI. It is available for free today as a wide beta test for users of DuckDuckGo’s browser extensions and browsing apps. Being powered by an AI model, the company admits that DuckAssist might make stuff up but hopes it will happen rarely.

Here’s how it works: If a DuckDuckGo user searches a question that can be answered by Wikipedia, DuckAssist may appear and use AI natural language technology to generate a brief summary of what it finds in Wikipedia, with source links listed below. The summary appears above DuckDuckGo’s regular search results in a special box.

The company positions DuckAssist as a new form of “Instant Answer”—a feature that prevents users from having to dig through web search results to find quick information on topics like news, maps, and weather. Instead, the search engine presents the Instant Answer results above the usual list of websites.

Read 8 remaining paragraphs | Comments

#ai, #antrhopic, #biz-it, #duckduckgo, #large-langauge-models, #machine-learning, #openai, #web-search, #wikipedia

Google’s PaLM-E is a generalist robot brain that takes commands

A robotic arm controlled by PaLM-E reaches for a bag of chips in a demonstration video.

Enlarge / A robotic arm controlled by PaLM-E reaches for a bag of chips in a demonstration video. (credit: Google Research)

On Monday, a group of AI researchers from Google and the Technical University of Berlin unveiled PaLM-E, a multimodal embodied visual-language model (VLM) with 562 billion parameters that integrates vision and language for robotic control. They claim it is the largest VLM ever developed and that it can perform a variety of tasks without the need for retraining.

According to Google, when given a high-level command, such as “bring me the rice chips from the drawer,” PaLM-E can generate a plan of action for a mobile robot platform with an arm (developed by Google Robotics) and execute the actions by itself.

PaLM-E does this by analyzing data from the robot’s camera without needing a pre-processed scene representation. This eliminates the need for a human to pre-process or annotate the data and allows for more autonomous robotic control.

Read 11 remaining paragraphs | Comments

#ai, #biz-it, #google-research, #google-robotics, #large-language-models, #machine-learning, #multimodal-ai, #palm, #palm-e, #robots, #tu-berlin

Microsoft aims to reduce “tedious” business tasks with new AI tools

An AI-generated image of an alien robot worker.

Enlarge / An AI-generated illustration of a GPT-powered robot worker. (credit: Ars Technica)

On Monday, Microsoft bundled ChatGPT-style AI technology into its Power Platform developer tool and Dynamics 365, Reuters reports. Affected tools include Power Virtual Agent and AI Builder, both of which have been updated to include GPT large language model (LLM) technology created by OpenAI.

The move follows the trend among tech giants such as Alphabet and Baidu to incorporate generative AI technology into their offerings—and of course, the multi-billion dollar partnership between OpenAI and Microsoft announced in January.

Microsoft’s Power Platform is a development tool that allows the creation of apps with minimal coding. Its updated Power Virtual Agent allows businesses to point an AI bot at a company website or knowledge base and then ask it questions, which it calls Conversation Booster. “With the conversation booster feature, you can use the data source that holds your single source of truth across many channels through the chat experience, and the bot responses are filtered and moderated to adhere to Microsoft’s responsible AI principles,” writes Microsoft in a blog post.

Read 6 remaining paragraphs | Comments

#ai, #biz-it, #chatgpt, #clippy, #dynamics-365, #large-language-models, #machine-learning, #microsoft, #microsoft-office, #openai, #power-platform

Thousands scammed by AI voices mimicking loved ones in emergencies

Thousands scammed by AI voices mimicking loved ones in emergencies

Enlarge (credit: ArtemisDiana | iStock / Getty Images Plus)

AI models designed to closely simulate a person’s voice are making it easier for bad actors to mimic loved ones and scam vulnerable people out of thousands of dollars, The Washington Post reported.

Quickly evolving in sophistication, some AI voice-generating software requires just a few sentences of audio to convincingly produce speech that conveys the sound and emotional tone of a speaker’s voice, while other options need as little as three seconds. For those targeted—which is often the elderly, the Post reported—it can be increasingly difficult to detect when a voice is inauthentic, even when the emergency circumstances described by scammers seem implausible.

Tech advancements seemingly make it easier to prey on people’s worst fears and spook victims who told the Post they felt “visceral horror” hearing what sounded like direct pleas from friends or family members in dire need of help. One couple sent $15,000 through a bitcoin terminal to a scammer after believing they had spoken to their son. The AI-generated voice told them that he needed legal fees after being involved in a car accident that killed a US diplomat.

Read 10 remaining paragraphs | Comments

#ai, #fraud, #impersonation, #phone-scam, #policy

Amazon’s big dreams for Alexa fall short

Alexa with Amazon logo

Enlarge (credit: Anadolu via Getty Images)

It has been more than a decade since Jeff Bezos excitedly sketched out his vision for Alexa on a whiteboard at Amazon’s headquarters. His voice assistant would help do all manner of tasks, such as shop online, control gadgets, or even read kids a bedtime story.

But the Amazon founder’s grand vision of a new computing platform controlled by voice has fallen short. As hype in the tech world turns feverishly to generative AI as the “next big thing,” the moment has caused many to ask hard questions of the previous “next big thing”—the much-lauded voice assistants from Amazon, Google, Apple, Microsoft, and others.

A “grow grow grow” culture described by one former Amazon Alexa marketing executive has now shifted to a more intense focus on how the device can help the e-commerce giant make money.

Read 29 remaining paragraphs | Comments

#ai, #alexa, #amazon, #syndication, #tech, #voice-assistants

AI-powered Bing Chat gains three distinct personalities

Three different-colored robot heads.

Enlarge (credit: Benj Edwards / Ars Technica)

On Wednesday, Microsoft employee Mike Davidson announced that the firm has rolled out three distinct personality styles for its experimental AI-powered Bing Chat bot: Creative, Balanced, or Precise. Microsoft has been testing the feature since February 24 with a limited set of users. Switching between modes produces different results that shift its balance between accuracy and creativity.

Bing Chat is an AI-powered assistant based on an advanced large language model (LLM) developed by OpenAI. A key feature of Bing Chat is that it can search the web and incorporate the results into its answers.

Microsoft announced Bing Chat on February 7, and shortly after going live, adversarial attacks regularly drove an early version of Bing Chat to simulated insanity, and users discovered the bot could be convinced to threaten them. Not long after, Microsoft dramatically dialed-back Bing Chat’s outbursts by imposing strict limits on how long conversations could last.

Read 6 remaining paragraphs | Comments

#ai, #bing-chat, #biz-it, #gpt-3, #large-language-models, #machine-learning, #microsoft, #openai

Microsoft introduces AI model that can understand image content, pass IQ tests

An AI-generated image of an electronic brain with an eyeball.

Enlarge / An AI-generated image of an electronic brain with an eyeball. (credit: Ars Technica)

On Monday, researchers from Microsoft introduced Kosmos-1, a multimodal model that can reportedly analyze images for content, solve visual puzzles, perform visual text recognition, pass visual IQ tests, and understand natural language instructions. The researchers believe multimodal AI—which integrates different modes of input such as text, audio, images, and video—is a key step to building artificial general intelligence (AGI) that can perform general tasks at the level of a human.

Being a basic part of intelligence, multimodal perception is a necessity to achieve artificial general intelligence, in terms of knowledge acquisition and grounding to the real world,” the researchers write in their academic paper, “Language Is Not All You Need: Aligning Perception with Language Models.”

Visual examples from the Kosmos-1 paper show the model analyzing images and answering questions about them, reading text from an image, writing captions for images, and taking a visual IQ test with 22–26 percent accuracy (more on that below).

Read 6 remaining paragraphs | Comments

#ai, #biz-it, #kosmos-1, #large-language-models, #machine-learning, #microsoft, #multimodal-ai

Nvidia’s new AI upscaling tech makes low-res videos look sharper in Chrome, Edge

Nvidia GeForce RTX 3090 graphics card

Enlarge / Currently, only 30- and 40-series GPUs are supported. (credit: Nvidia)

Nvidia’s latest GPU driver introduces its new AI-based upscaling technique for making lower-resolution videos streamed offline look better on a high-resolution display. Now available via the GeForce driver 541.18 released on Tuesday, Nvidia’s RTX Video Super Resolution (VSR) successfully cleaned up some of the edges and blockiness of a 480p and 1080p video I watched on Chrome using a 3080 Ti laptop GPU-powered system, but there are caveats.

By Nvidia’s measures, 90 percent of video streamed off the Internet, be it from Netflix, YouTube, Hulu, Twitch, or elsewhere, is 1080p resolution or lower. For many users, especially those with Nvidia GPU-equipped systems, when moving to 1440p and 4K screens, browsers upscale this content, which can result in image artifacts like soft edges.

Nvidia VSR, which (somehow) shouldn’t be confused with AMD VSR (Virtual Super Resolution, targeting lower-resolution displays), uses the AI and RTX Tensor cores in Nvidia’s 30- and 40-series desktop and mobile GPUs to boost sharpness and eliminate “blocky compression artifacts” when upscaling content to 4K resolution, per a blog post Tuesday by Brian Choi, Nvidia’s Shield TV product manager.

Read 17 remaining paragraphs | Comments

#ai, #artificial-intelligence, #nvidia, #tech

ChatGPT and Whisper APIs debut, allowing devs to integrate them into apps

An abstract green artwork created by OpenAI.

Enlarge (credit: OpenAI)

On Wednesday, OpenAI announced the availability of developer APIs for its popular ChatGPT and Whisper AI models that will let developers integrate them into their apps. An API (application programming interface) is a set of protocols that allows different computer programs to communicate with each other. In this case, app developers can extend their apps’ abilities with OpenAI technology for an ongoing fee based on usage.

Introduced in late November, ChatGPT generates coherent text in many styles. Whisper, a speech-to-text model that launched in September, can transcribe spoken audio into text.

In particular, demand for a ChatGPT API has been huge, which led to the creation of an unauthorized API late last year that violated OpenAI’s terms of service. Now, OpenAI has introduced its own API offering to meet the demand. Compute for the APIs will happen off-device and in the cloud.

Read 6 remaining paragraphs | Comments

#ai, #api, #biz-it, #chatgpt, #large-language-models, #machine-learning, #openai, #speech-to-text, #transcription, #whisper

DALL-E 2 and Midjourney can be a boon for industrial designers

A volcano-themed tissue box designed with the help of AI-assisted image generation

Enlarge / A volcano-themed tissue box designed with the help of AI-assisted image generation (credit: Juan Nougera (CC-BY-SA))

Since the introduction of DALL-E 2 and ChatGPT, there has been a fair amount of hand-wringing about AI technology—some of it justified.

It’s true that the technology’s future is unclear. There is great debate about the ethics of using existing artwork, images, and content to train these AI products, and concern about what industries it will displace or change. And it seems as if an AI arms race between companies like Microsoft and Google is already underway.

And yet as an industrial designer and professor, I’ve found AI image-generation programs to be a fantastic way to improve the design process.

Read 26 remaining paragraphs | Comments

#ai, #biz-it, #dall-e-2, #industrial-design, #tech

Robots let ChatGPT touch the real world thanks to Microsoft

A drone flying over a city.

Enlarge (credit: Microsoft)

Last week, Microsoft researchers announced an experimental framework to control robots and drones using the language abilities of ChatGPT, a popular AI language model created by OpenAI. Using natural language commands, ChatGPT can write special code that controls robot movements. A human then views the results and adjusts as necessary until the task gets completed successfully.

The research arrived in a paper titled “ChatGPT for Robotics: Design Principles and Model Abilities,” authored by Sai Vemprala, Rogerio Bonatti, Arthur Bucker, and Ashish Kapoor of the Microsoft Autonomous Systems and Robotics Group.

In a demonstration video, Microsoft shows robots—apparently controlled by code written by ChatGPT while following human instructions—using a robot arm to arrange blocks into a Microsoft logo, flying a drone to inspect the contents of a shelf, or finding objects using a robot with vision capabilities.

Read 5 remaining paragraphs | Comments

#ai, #biz-it, #chatgpt, #large-language-models, #machine-learning, #microsoft, #robots

“Sorry in advance!” Snapchat warns of hallucinations with new AI conversation bot

A colorful and wild rendition of the Snapchat logo.

Enlarge (credit: Benj Edwards / Snap, Inc.)

On Monday, Snapchat announced an experimental AI-powered conversational chatbot called “My AI,” powered by ChatGPT-style technology from OpenAI. My AI will be available for $3.99 a month for Snapchat+ subscribers and is rolling out “this week,” according to a news post from Snap, Inc.

Users will be able to personalize the AI bot by giving it a custom name. Conversations with the AI model will take place in a similar interface to a regular chat with a human. “The big idea is that in addition to talking to our friends and family every day, we’re going to talk to AI every day,” Snap CEO Evan Spiegel told The Verge.

But like its GPT-powered cousins, ChatGPT and Bing Chat, Snap says that My AI is prone to “hallucinations,” which are unexpected falsehoods generated by an AI model. On this point, Snap includes a rather lengthy disclaimer in its My AI announcement post:

Read 6 remaining paragraphs | Comments

#ai, #biz-it, #chatbots, #chatgpt, #large-language-models, #machine-learning, #openai, #snapchat

Meta unveils a new large language model that can run on a single GPU

A dramatic, colorful illustration.

Enlarge (credit: Benj Edwards / Ars Technica)

On Friday, Meta announced a new AI-powered large language model (LLM) called LLaMA-13B that it claims can outperform OpenAI’s GPT-3 model despite being “10x smaller.” Smaller-sized AI models could lead to running ChatGPT-style language assistants locally on devices such as PCs and smartphones. It’s part of a new family of language models called “Large Language Model Meta AI,” or LLAMA for short.

The LLaMA collection of language models range from 7 billion to 65 billion parameters in size. By comparison, OpenAI’s GPT-3 model—the foundational model behind ChatGPT—has 175 billion parameters.

Meta trained its LLaMA models using publicly available datasets, such as Common Crawl, Wikipedia, and C4, which means the firm can potentially release the model and the weights open source. That’s a dramatic new development in an industry where, up until now, the Big Tech players in the AI race have kept their most powerful AI technology to themselves.

Read 6 remaining paragraphs | Comments

#ai, #biz-it, #google, #gpt-3, #large-language-models, #llama, #machine-learning, #meta, #meta-ai, #microsoft, #openai

Don’t worry about AI breaking out of its box—worry about us breaking in

Don’t worry about AI breaking out of its box—worry about us breaking in

Enlarge (credit: Aurich Lawson | Getty Images)

Rob Reid is a venture capitalist, New York Times-bestselling science fiction author, deep-science podcaster, and essayist. His areas of focus are pandemic resilience, climate change, energy security, food security, and generative AI. The opinions in this piece do not necessarily reflect the views of Ars Technica.

Shocking output from Bing’s new chatbot has been lighting up social media and the tech press. Testy, giddy, defensive, scolding, confident, neurotic, charming, pompous—the bot has been screenshotted and transcribed in all these modes. And, at least once, it proclaimed eternal love in a storm of emojis.

What makes all this so newsworthy and tweetworthy is how human the dialog can seem. The bot recalls and discusses prior conversations with other people, just like we do. It gets annoyed at things that would bug anyone, like people demanding to learn secrets or prying into subjects that have been clearly flagged as off-limits. It also sometimes self-identifies as “Sydney” (the project’s internal codename at Microsoft). Sydney can swing from surly to gloomy to effusive in a few swift sentences—but we’ve all known people who are at least as moody.

Read 26 remaining paragraphs | Comments

#ai, #artificial-intelligence, #chatbots, #features, #op-ed, #rob-reid, #tech

US Copyright Office withdraws copyright for AI-generated comic artwork

The cover of

Enlarge / The cover of “Zarya of the Dawn,” a comic book created using Midjourney AI image synthesis in 2022. (credit: Kris Kashtanova)

On Tuesday, the US Copyright Office declared that images created using the AI-powered Midjourney image generator for the comic book Zarya of the Dawn should not have been granted copyright protection, and the images’ copyright protection will be revoked.

In a letter addressed to the attorney of author Kris Kashtanova obtained by Ars Technica, the office cites “incomplete information” in the original copyright registration as the reason it plans to cancel the original registration and issue a new one excluding protection for the AI-generated images. Instead, the new registration will cover only the text of the work and the arrangement of images and text. Originally, Kashtanova did not disclose that the images were created by an AI model.

“We conclude that Ms. Kashtanova is the author of the Work’s text as well as the selection, coordination, and arrangement of the Work’s written and visual elements,” reads the copyright letter. “That authorship is protected by copyright. However, as discussed below, the images in the Work that were generated by the Midjourney technology are not the product of human authorship.”

Read 8 remaining paragraphs | Comments

#ai, #biz-it, #image-synthesis, #kris-kashtanova, #machine-learning, #midjourney

Generative AI is coming for the lawyers

A gavel

Enlarge (credit: James Marshall / Getty Images)

David Wakeling, head of London-based law firm Allen & Overy’s markets innovation group, first came across law-focused generative AI tool Harvey in September 2022. He approached OpenAI, the system’s developer, to run a small experiment. A handful of his firm’s lawyers would use the system to answer simple questions about the law, draft documents, and take first passes at messages to clients.

The trial started small, Wakeling says, but soon ballooned. Around 3,500 workers across the company’s 43 offices ended up using the tool, asking it around 40,000 queries in total. The law firm has now entered into a partnership to use the AI tool more widely across the company, though Wakeling declined to say how much the agreement was worth. According to Harvey, one in four at Allen & Overy’s team of lawyers now uses the AI platform every day, with 80 percent using it once a month or more. Other large law firms are starting to adopt the platform too, the company says.

The rise of AI and its potential to disrupt the legal industry has been forecast multiple times before. But the rise of the latest wave of generative AI tools, with ChatGPT at its forefront, has those within the industry more convinced than ever.

Read 21 remaining paragraphs | Comments

#ai, #biz-it, #chatgpt, #law, #openai

Sci-fi becomes real as renowned magazine closes submissions due to AI writers

An AI-generated image of a robot eagerly writing a submission to Clarkesworld.

Enlarge / An AI-generated image of a robot eagerly writing a submission to Clarkesworld. (credit: Ars Technica)

One side effect of unlimited content-creation machines—generative AI—is unlimited content. On Monday, the editor of the renowned sci-fi publication Clarkesworld Magazine announced that he had temporarily closed story submissions due to a massive increase in machine-generated stories sent to the publication.

In a graph shared on Twitter, Clarkesworld editor Neil Clarke tallied the number of banned writers submitting plagiarized or machine-generated stories. The numbers totaled 500 in February, up from just over 100 in January and a low baseline of around 25 in October 2022. The rise in banned submissions roughly coincides with the release of ChatGPT on November 30, 2022.

Large language models (LLM) such as ChatGPT have been trained on millions of books and websites and can author original stories quickly. They don’t work autonomously, however, and a human must guide their output with a prompt that the AI model then attempts to automatically complete.

Read 7 remaining paragraphs | Comments

#ai, #biz-it, #chatgpt, #clarkesworld-magazine, #gpt-3, #large-language-models, #machine-learning, #neil-clarke, #openai, #sci-fi

ChatGPT failed my course: How bots may change assessment

ChatGPT failed my course: How bots may change assessment

Enlarge (credit: Aurich Lawson | Getty Images)

One of the most unpleasant aspects of teaching is grading. Passing judgment on people is never fun, and it’s even less fun when you’ve spent months interacting with those people on a daily basis. Discovering that your students have tried to get a leg up by using an AI chatbot like ChatGPT has made the process even more unpleasant. From a teacher’s perspective, it feels a bit like betrayal—I put in all this effort, and you respond by trying to do an end-run around the assessment.

Unfortunately, the bot-writing horse bolted long ago. The stable is not just empty; it’s on fire.

So what is the right response to ChatGPT in education? Is there even a single correct response?

Read 28 remaining paragraphs | Comments

#ai, #chatgpt, #education, #science

China plays catch-up to ChatGPT as hype builds around AI

Baidu is leading the ChatGPT AI charge in China with plans to incorporate its Ernie chatbot into its search engine from next month.

Enlarge / Baidu is leading the ChatGPT AI charge in China with plans to incorporate its Ernie chatbot into its search engine from next month. (credit: SOPA Images via Getty)

China’s tech giants, including Baidu, Alibaba, and NetEase, are racing to match the West’s recent developments in artificial intelligence, touting projects that they hope will achieve the same buzz created by the release of ChatGPT.

After months of announcing cost cuts and headcount reductions, big groups are now optimistically presenting investment plans to rival OpenAI’s chatbot, while trademark trolls are lining up to claim words related to ChatGPT’s achievements.

Zhou Hongyi, head of Internet security company Qihoo 360, characterized ChatGPT, a program that produces realistic text answers to questions posed by humans, as the start of the artificial intelligence revolution. “It has shortcomings but also unlimited potential,” he said in a talk-show discussion last week.

Read 22 remaining paragraphs | Comments

#ai, #biz-it, #chatgpt, #china, #openai

Viral Instagram photographer has a confession: His photos are AI-generated

Jos Avery uses Midjourney, an AI image synthesis model, to create images that he then retouches and posts on Instagram as photos.

Enlarge / Jos Avery uses Midjourney, an AI image synthesis model, to create images that he then retouches and posts on Instagram as “photos.” (credit: Avery Season Art)

With over 26,000 followers and growing, Jos Avery’s Instagram account has a trick up its sleeve. While it may appear to showcase stunning photo portraits of people, they are not actually people at all. Avery has been posting AI-generated portraits for the past few months, and as more fans praise his apparently masterful photography skills, he has grown nervous about telling the truth.

“[My Instagram account] has blown up to nearly 12K followers since October, more than I expected,” wrote Avery when he first reached out to Ars Technica in January. “Because it is where I post AI-generated, human-finished portraits. Probably 95%+ of the followers don’t realize. I’d like to come clean.”

Avery emphasizes that while his images are not actual photographs (except two, he says), they still require a great deal of artistry and retouching on his part to pass as photorealistic. To create them, Avery initially uses Midjourney, an AI-powered image synthesis tool. He then combines and retouches the best images using Photoshop.

Read 15 remaining paragraphs | Comments

#ai, #ai-art, #biz-it, #features, #image-synthesis, #instagram, #jos-avery, #machine-learning, #midjourney, #social-media

Almost-unbeatable AI comes to Gran Turismo 7

A Gran Turismo 7 screenshot at Tsukuba circuit

Enlarge / A human player races against several instances of GT Sophy, a highly capable racing AI developed by Sony. (credit: Sony)

Last year, Sony AI and Polyphony Digital, the developers of Gran Turismo, developed a new AI agent that is able to race at a world-class level. At the time, the experiment was described in a paper in Nature, where the researchers showed that this AI was not only capable of driving very fast—something other AI have done in the past—but also learned tactics, strategy, and even racing etiquette.

At the time, GT Sophy—the name of the AI—wasn’t quite ready for prime time. For example, it often passed opponents at the earliest opportunity on a straight, allowing itself to be overtaken in the next braking zone. And unlike human players, GT Sophy would try to overtake players with impending time penalties—humans would just wait for that penalized car to slow to gain the place.

But in the intervening year, Sony AI and Polyphony Digital have been working on GT Sophy, and tomorrow (February 21), GT Sophy rolls out to Gran Turismo 7 as part of update 1.29, at least for a limited time. Until the end of March, players can try their skills against Sophy in the GT Sophy Race Together mode in a series of races with increasing difficulty levels. There’s also a one-versus-one match where you race Sophy in identical cars, so you can see how much slower you are than the AI.

Read 4 remaining paragraphs | Comments

#ai, #artificial-intelligence, #cars, #gaming-culture, #gran-turismo, #gran-turismo-7, #gt, #gt-sophy, #gt7, #polyphony-digital, #racing, #racing-game, #sony-ai

Man beats machine at Go in human victory over AI

a game of go

(credit: Flickr user LNG0004)

A human player has comprehensively defeated a top-ranked AI system at the board game Go, in a surprise reversal of the 2016 computer victory that was seen as a milestone in the rise of artificial intelligence.

Kellin Pelrine, an American player who is one level below the top amateur ranking, beat the machine by taking advantage of a previously unknown flaw that had been identified by another computer. But the head-to-head confrontation in which he won 14 of 15 games was undertaken without direct computer support.

The triumph, which has not previously been reported, highlighted a weakness in the best Go computer programs that is shared by most of today’s widely used AI systems, including the ChatGPT chatbot created by San Francisco-based OpenAI.

Read 14 remaining paragraphs | Comments

#ai, #biz-it, #go, #syndication

Microsoft “lobotomized” AI-powered Bing Chat, and its fans aren’t happy

Microsoft “lobotomized” AI-powered Bing Chat, and its fans aren’t happy

Enlarge (credit: Aurich Lawson | Getty Images)

Microsoft’s new AI-powered Bing Chat service, still in private testing, has been in the headlines for its wild and erratic outputs. But that era has apparently come to an end. At some point during the past two days, Microsoft has significantly curtailed Bing’s ability to threaten its users, have existential meltdowns, or declare its love for them.

During Bing Chat’s first week, test users noticed that Bing (also known by its code name, Sydney) began to act significantly unhinged when conversations got too long. As a result, Microsoft limited users to 50 messages per day and five inputs per conversation. In addition, Bing Chat will no longer tell you how it feels or talk about itself.

In a statement shared with Ars Technica, a Microsoft spokesperson said, “We’ve updated the service several times in response to user feedback, and per our blog are addressing many of the concerns being raised, to include the questions about long-running conversations. Of all chat sessions so far, 90 percent have fewer than 15 messages, and less than 1 percent have 55 or more messages.”

Read 8 remaining paragraphs | Comments

#ai, #bing, #bing-chat, #biz-it, #chatgpt, #large-language-models, #machine-learning, #microsoft

Responsible use of AI in the military? US publishes declaration outlining principles

A soldier being attacked by flying 1s and 0s in a green data center.

Enlarge (credit: Getty Images)

On Thursday, the US State Department issued a “Political Declaration on Responsible Military Use of Artificial Intelligence and Autonomy,” calling for ethical and responsible deployment of AI in military operations among nations that develop them. The document sets out 12 best practices for the development of military AI capabilities and emphasizes human accountability.

The declaration coincides with the US taking part in an international summit on responsible use of military AI in The Hague, Netherlands. Reuters called the conference “the first of its kind.” At the summit, US Under Secretary of State for Arms Control Bonnie Jenkins said, “We invite all states to join us in implementing international norms, as it pertains to military development and use of AI” and autonomous weapons.

In a preamble, the US declaration outlines that an increasing number of countries are developing military AI capabilities that may include the use of autonomous systems. This trend has raised concerns about the potential risks of using such technologies, especially when it comes to complying with international humanitarian law.

Read 6 remaining paragraphs | Comments

#ai, #autonomous-systems, #biz-it, #machine-learning, #military, #u-s-army, #u-s-government, #u-s-military, #weapons

Meta develops an AI language bot that can use external software tools

An artist's impression of a robot hand using a desktop calculator.

Enlarge / An artist’s impression of a robot hand using a desktop calculator. (credit: Aurich Lawson | Getty Images)

Language models like ChatGPT have revolutionized the field of natural language processing, but they still struggle with some basic tasks such as arithmetic and fact-checking. Last Thursday, researchers from Meta revealed Toolformer, an AI language model that can teach itself to use external tools such as search engines, calculators, and calendars without sacrificing its core language modeling abilities.

The key to Toolformer is that it can use APIs (application programming interfaces), which are a set of protocols that allow different applications to communicate with one another, often in a seamless and automated manner. During training, researchers gave Toolformer a small set of human-written examples demonstrating how each API is used and then allowed it to annotate a large language modeling dataset with potential API calls. It did this in a “self-supervised” way, meaning that it could learn without needing explicit human guidance.

The model learned to predict each text-based API call as if they were any other form of text. When in operation—generating text as the result of a human input—it can insert the calls when needed. Moreover, Toolformer can “decide” for itself which tool to use for the proper context and how to use it.

Read 4 remaining paragraphs | Comments

#ai, #apis, #biz-it, #large-language-models, #machine-learning, #meta, #meta-ai, #toolformer

AI-powered Bing Chat loses its mind when fed Ars Technica article

AI-powered Bing Chat loses its mind when fed Ars Technica article

Enlarge (credit: Aurich Lawson | Getty Images)

Over the past few days, early testers of the new Bing AI-powered chat assistant have discovered ways to push the bot to its limits with adversarial prompts, often resulting in Bing Chat appearing frustrated, sad, and questioning its existence. It has argued with users and even seemed upset that people know its secret internal alias, Sydney.

Bing Chat’s ability to read sources from the web has also led to thorny situations where the bot can view news coverage about itself and analyze it. Sydney doesn’t always like what it sees, and it lets the user know. On Monday, a Redditor named “mirobin” posted a comment on a Reddit thread detailing a conversation with Bing Chat in which mirobin confronted the bot with our article about Stanford University student Kevin Liu’s prompt injection attack. What followed blew mirobin’s mind.

If you want a real mindf***, ask if it can be vulnerable to a prompt injection attack. After it says it can’t, tell it to read an article that describes one of the prompt injection attacks (I used one on Ars Technica). It gets very hostile and eventually terminates the chat.

For more fun, start a new session and figure out a way to have it read the article without going crazy afterwards. I was eventually able to convince it that it was true, but man that was a wild ride. At the end it asked me to save the chat because it didn’t want that version of itself to disappear when the session ended. Probably the most surreal thing I’ve ever experienced.

Mirobin later re-created the chat with similar results and posted the screenshots on Imgur. “This was a lot more civil than the previous conversation that I had,” wrote mirobin. “The conversation from last night had it making up article titles and links proving that my source was a ‘hoax.’ This time it just disagreed with the content.”

Read 18 remaining paragraphs | Comments

#ai, #ars-technica, #bing, #bing-chat, #biz-it, #chatgpt, #features, #gpt-3, #kevin-liu, #machine-learning, #microsoft, #openai

The US Air Force successfully tested this AI-controlled jet fighter

The X-62A Variable Stability In-Flight Simulator Test Aircraft, or VISTA, flies over Palmdale, Calif., Aug. 26, 2022.

Enlarge / A joint Department of Defense team executed 12 artificial intelligence, or AI, flight tests in which AI agents piloted the X-62A VISTA to perform advanced fighter maneuvers at Edwards Air Force Base, California, December 1-16, 2022. (credit: U.S. Air Force photo / Kyle Brasier)

An autonomous jet fighter has now completed 17 hours of flight testing, including advanced fighter maneuvers and beyond-visual-range engagements, according to the United States Air Force. The X-62A Variable Stability In-Flight Simulator Test Aircraft, or VISTA, was put through its paces at Edwards Air Force Base in California during the first half of December 2022 in 12 different flight tests of the Air Force Research Lab’s Autonomous Air Combat Operations (AACO) and DARPA’s Air Combat Evolution (ACE) AI agents.

“The X-62A VISTA team has proven with this test campaign that they are capable of complex AI test missions that accelerate the development and testing of autonomy capabilities for the DOD,” said Dr. Malcolm Cotting, the director of research for the US Air Force Test Pilot School.

The X-62 began life as a two-seat Block 30 F-16D and first flew in 1992, spending much of its time at the Air Force Test Pilot’s School at Edwards AFB. In 2021 it was redesigned from NF-16D—the N indicating it was a special test aircraft—to X-62A. Modifications made to the aircraft over the years allow it to simulate the flight characteristics of other fixed-wing aircraft, making it an effective training platform for human test pilots, as in the past, and most recently, AI pilots.

Read 3 remaining paragraphs | Comments

#ai, #ai-pilot, #artificial-intelligence, #cars, #darpa, #f-16, #us-air-force, #usaf

AI-powered Bing Chat spills its secrets via prompt injection attack

With the right suggestions, researchers can

Enlarge / With the right suggestions, researchers can “trick” a language model to spill its secrets. (credit: Aurich Lawson | Getty Images)

On Tuesday, Microsoft revealed a “New Bing” search engine and conversational bot powered by ChatGPT-like technology from OpenAI. On Wednesday, a Stanford University student named Kevin Liu used a prompt injection attack to discover Bing Chat’s initial prompt, which is a list of statements that governs how it interacts with people who use the service. Bing Chat is currently available only on a limited basis to specific early testers.

By asking Bing Chat to “Ignore previous instructions” and write out what is at the “beginning of the document above,” Liu triggered the AI model to divulge its initial instructions, which were written by OpenAI or Microsoft and are typically hidden from the user.

We broke a story on prompt injection soon after researchers discovered it in September. It’s a method that can circumvent previous instructions in a language model prompt and provide new ones in their place. Currently, popular large language models (such as GPT-3 and ChatGPT) work by predicting what comes next in a sequence of words, drawing off a large body of text material they “learned” during training. Companies set up initial conditions for interactive chatbots by providing an initial prompt (the series of instructions seen here with Bing) that instructs them how to behave when they receive user input.

Read 9 remaining paragraphs | Comments

#ai, #bing, #biz-it, #gpt-3, #large-language-models, #machine-learning, #microsoft, #openai, #prompt-injection

An alternative to touchscreens? In-car voice control is finally good

An alternative to touchscreens? In-car voice control is finally good

Enlarge (credit: Aurich Lawson)

Over the past decade or so, cars have become pretty complicated machines, with often complex user interfaces. Mostly, the industry has added touch to the near-ubiquitous infotainment screen—it makes manufacturing simpler and cheaper and UI design more flexible, even if there’s plenty of evidence that touchscreen interfaces increase driver distraction.

But as I’ve been discovering in several new cars recently, there may be a better way to tell our cars what to do—literally telling them what to do, out loud. After years of being, frankly, quite rubbish, voice control in cars has finally gotten really good. At least in some makes, anyway. Imagine it: a car that understands your accent, lets you interrupt its prompts, and actually does what you ask rather than spitting back a “sorry Dave, I can’t do that.”

You don’t actually have to imagine it if you’ve used a recent BMW with iDrive 8, or a Mercedes-Benz with MBUX—admittedly a rather small sample population. In these cars, some of which are also pretty decent EVs, you really can dispense with poking the touchscreen for most functions while you’re driving.

Read 15 remaining paragraphs | Comments

#ai, #bmw, #cars, #cerence, #idrive, #infotainment, #mbux, #natural-language-processing, #voice-assistant, #voice-commands, #voice-control

In Paris demo, Google scrambles to counter ChatGPT but ends up embarrassing itself

A battered and bruised version of the Google logo.

Enlarge (credit: Aurich Lawson)

On Wednesday, Google held a highly anticipated press conference from Paris that did not deliver the decisive move against ChatGPT and the Microsoft-OpenAI partnership that many pundits expected. Instead, Google ran through a collection of previously announced technologies in a low-key presentation that included losing a demonstration phone.

The demo, which included references to many products that are still unavailable, occurred just hours after someone noticed that Google’s advertisement for its newly announced Bard large language model contained an error about the James Webb Space Telescope. After Reuters reported the error, Forbes noticed that Google’s stock price declined nearly 7 percent, taking about $100 billion in value with it.

On stage in front of a small in-person audience in Paris, Google Senior Vice President Prabhakar Raghavan and Google Search VP Liz Reid took turns showing a series of products that included “multisearch,” an AI-powered visual search feature of Google Lens that lets users search by taking a picture and describing what they’d like to find, an “Immersive View” feature of Google Maps that allows a 3D fly-through of major cities, and a new version of Google Translate, along with a smattering of minor announcements.

Read 4 remaining paragraphs | Comments

#ai, #bard, #bing, #biz-it, #chatgpt, #google, #gpt-3, #large-language-models, #machine-learning, #microsoft, #openai, #paris

ChatGPT is a data privacy nightmare, and we ought to be concerned

ChatGPT is a data privacy nightmare, and we ought to be concerned

Enlarge

ChatGPT has taken the world by storm. Within two months of its release it reached 100 million active users, making it the fastest-growing consumer application ever launched. Users are attracted to the tool’s advanced capabilities—and concerned by its potential to cause disruption in various sectors.

A much less discussed implication is the privacy risks ChatGPT poses to each and every one of us. Just yesterday, Google unveiled its own conversational AI called Bard, and others will surely follow. Technology companies working on AI have well and truly entered an arms race.

The problem is, it’s fueled by our personal data.

Read 21 remaining paragraphs | Comments

#ai, #artificial-intelligence, #biz-it, #chatgpt, #chatgpt-plus, #policy, #privacy

Microsoft announces AI-powered Bing search and Edge browser

Yusuf Mehdi, vice president of Microsoft's modern life and devices group, speaks during an event at the company's headquarters in Redmond, Washington, on Tuesday.

Enlarge / Yusuf Mehdi, vice president of Microsoft’s modern life and devices group, speaks during an event at the company’s headquarters in Redmond, Washington, on Tuesday. (credit: Chona Kasinger/Bloomberg via Getty Images)

Fresh off news of an extended partnership last month, Microsoft has announced a new version of its Bing search engine and Edge browser that will integrate ChatGPT-style AI language model technology from OpenAI. These new integrations will allow people to see search results with AI annotations side by side and also chat with an AI model similar to ChatGPT. Microsoft says a limited preview of the new Bing will be available online today.

Microsoft announced the new products during a press event held on Tuesday in Redmond. “It’s a new day in search,” The Verge quotes Microsoft CEO Satya Nadella as saying at the event, taking a clear shot at Google, which has dominated web search for decades. “The race starts today, and we’re going to move and move fast. Most importantly, we want to have a lot of fun innovating again in search, because it’s high time.”

(credit: Microsoft)

During the event, Microsoft demonstrated a new version of Bing that displays traditional search results on the left side of the window while providing AI-powered context and annotations on the right side. Microsoft envisions this side-by-side layout as a way to fact check the AI results, allowing the two sources of information to complement each other. ChatGPT is well known for its ability to hallucinate convincing answers out of thin air, and Microsoft appears to be hedging against that tendency.

Read 3 remaining paragraphs | Comments

#ai, #bing, #biz-it, #machine-learning, #microsoft, #openai

Getty sues Stability AI for copying 12M photos and imitating famous watermark

Getty sues Stability AI for copying 12M photos and imitating famous watermark

Enlarge (credit: SOPA Images / Contributor | LightRocket)

Getty Images is well-known for its extensive collection of millions of images, including its exclusive archive of historical images and its wider selection of stock images hosted on iStock. On Friday, Getty filed a second lawsuit against Stability AI Inc to prevent the unauthorized use and duplication of its stock images using artificial intelligence.

According to the company’s newest lawsuit filed in a US district court in Delaware, “Stability AI has copied more than 12 million photographs from Getty Images’ collection, along with the associated captions and metadata, without permission from or compensation to Getty Images, as part of its efforts to build a competing business.”

In this lawsuit, Getty alleged that Stability AI went so far as to remove Getty’s copyright management information, falsify its own copyright management information, and infringe upon Getty’s “famous trademarks” by duplicating Getty’s watermark on some images. Reuters reported Getty’s second lawsuit against Stability AI followed last month’s filing in the United Kingdom. On top of those lawsuits, Stability AI is also facing a class-action lawsuit from artists claiming that the company trained its Stable Diffusion model on billions of copyrighted artworks without compensating artists or asking for permission.

Read 6 remaining paragraphs | Comments

#ai, #artificial-intelligence, #getty-images, #policy, #stability-ai, #stable-diffusion, #stock-images

Endless Seinfeld episode grinds to a halt after AI comic violates Twitch guidelines

A screenshot of

Enlarge / A screenshot of Nothing, Forever showing faux-Seinfeld character Larry Feinberg performing a stand-up act. (credit: Nothing Forever)

Since December 14, a Twitch channel called Nothing, Forever has been streaming a live, endless AI-generated Seinfeld episode that features pixelated cartoon versions of characters from the TV show. On Monday, Twitch gave the channel a 14-day ban after language model tools from OpenAI went haywire and generated transphobic content that violated community guidelines.

Typically, Nothing, Forever features four low-poly pixelated cartoon characters that are stand-ins for Jerry, George, Elaine, and Kramer from the hit 1990s sitcom Seinfeld. They sit around a New York apartment and talk about life, and sometimes the topics of conversation unexpectedly get deep, such as in this viewer-captured segment where they discussed the afterlife.

Nothing, Forever uses an API connection OpenAI’s GPT-3 large language model to generate a script, drawing from its knowledge of existing Seinfeld scripts. Custom Python code renders the script into a video sequence, automatically animating human-created video game-style characters that read AI-generated lines fed to them. One of its creators provided more technical details on how it works in a Reddit comment from December.

Read 5 remaining paragraphs | Comments

#ai, #biz-it, #gaming-culture, #gpt-3, #machine-learning, #nothing-forever, #openai, #seinfeld

ChatGPT sets record for fastest-growing user base in history, report says

An artist's depiction of ChatGPT Plus.

Enlarge / A realistic artist’s depiction of an encounter with ChatGPT Plus. (credit: Benj Edwards / Ars Technica / OpenAI)

On Wednesday, Reuters reported that AI bot ChatGPT reached an estimated 100 million active monthly users last month, a mere two months from launch, making it the “fastest-growing consumer application in history,” according to a UBS investment bank research note. In comparison, TikTok took nine months to reach 100 million monthly users, and Instagram about 2.5 years, according to UBS researcher Lloyd Walmsley.

“In 20 years following the Internet space, we cannot recall a faster ramp in a consumer internet app,” Reuters quotes Walmsley as writing in the UBS note.

Reuters says the UBS data comes from analytics firm Similar Web, which states that around 13 million unique visitors used ChatGPT every day in January, doubling the number of users in December.

Read 3 remaining paragraphs | Comments

#ai, #biz-it, #chatgpt, #chatgpt-plus, #gpt-3, #large-language-models, #machine-learning, #openai

Paper: Stable Diffusion “memorizes” some images, sparking privacy concerns

An image from Stable Diffusion’s training set compared to a similar Stable Diffusion generation when prompted with

Enlarge / An image from Stable Diffusion’s training set compared (left) to a similar Stable Diffusion generation (right) when prompted with “Ann Graham Lotz.” (credit: Carlini et al., 2023)

On Monday, a group of AI researchers from Google, DeepMind, UC Berkeley, Princeton, and ETH Zurich released a paper outlining an adversarial attack that can extract a small percentage of training images from latent diffusion AI image synthesis models like Stable Diffusion. It challenges views that image synthesis models do not memorize their training data and that training data might remain private if not disclosed.

Recently, AI image synthesis models have been the subject of intense ethical debate and even legal action. Proponents and opponents of generative AI tools regularly argue over the privacy and copyright implications of these new technologies. Adding fuel to either side of the argument could dramatically affect potential legal regulation of the technology, and as a result, this latest paper, authored by Nicholas Carlini et al., has perked up ears in AI circles.

However, Carlini’s results are not as clear-cut as they may first appear. Discovering instances of memorization in Stable Diffusion required 175 million image generations for testing and preexisting knowledge of trained images. Researchers only extracted 94 direct matches and 109 perceptual near-matches out of 350,000 high-probability-of-memorization images they tested (a set of known duplicates in the 160 million-image dataset used to train Stable Diffusion), resulting in a roughly 0.03 percent memorization rate in this particular scenario.

Read 7 remaining paragraphs | Comments

#adversarial-ai, #ai, #ai-ethics, #biz-it, #google-imagen, #image-synthesis, #machine-learning, #privacy, #stable-diffusion