Trending

AI Coding Agents Use Evolutionary AI to Boost Skills

In April, Microsoft CEO said this artificial intelligence He wrote now near A third of the company code. Last October, Google CEO set its number on About a quarter. Other technology companies cannot be far away. Meanwhile, these companies establish Amnesty International, which are supposed to be used for help Programmer additional.

Researchers have always hoped to completely close the episode, creating coding factors that improve themselves frequently. A new search for an impressive offer for such a system reveals. Initiation, one may see a blessing of productivity, or a darker future for humanity.

“It is a good job,” he said Jurgen SchmidhperComputer scientist at King Abdullah University of Science and Technology (Kaust), in Kingdom of Saudi ArabiaWho did not participate in the new research. “I think for many people, the results are surprising. Since I have been working on this topic for nearly forty years, it might be less surprising for me.” But his work during that time was limited by technology on hand. One new development is the availability Language models (Llms), engines that work Chatbots Love Chatgpt.

In the 1980s and 1990s, Schmidhbir and others were explored Algorithms To improve coding agents, create programs that write programs. and Evolutionary algorithm It takes something (like the program), creates differences, keeps the best, and repeats these.

But the development cannot be predicted. The adjustments do not always improve performance. So in 2003, Schmidhuber invented the solution to the problems who rewrite their code only if they can prove that the updates are officially useful. Call them Gödeel machinesName after CourtodelThe mathematics world has done a self -regulations. But for complex factors, the utility does not come easily. Experimental evidence may have to suffice.

Open exploration value

The new systems, described in the last Preprint group on Arxiv, depend on such evidence. In reference to Schmidhuber, they are called Darwin Jodel machines (DGMS). DGM begins with a coding factor that can read the code, write, implement, and benefit from LLM for reading and writing. Then it applies an evolutionary algorithm to create many new agents. In each repetition, DGM chooses one agent of the population and directs LLM to create one change to improve the ability of the agent coding. LLMS has something like intuition About what may help, because they are trained in a lot of human code. What are the results are the directed development, somewhere between a random mutation and a significantly beneficial enhancement. DGM then tests the new agent on the coding standard, where he recorded his ability to solve programming Challenges.

Some evolutionary algorithms reserve only the best performance in the population, assuming that progress is moving forward indefinitely. However, the DGMS keeps them all, in the event of an initial failure in reality that actually carries the subsequent penetration key when modified. It is a formOpen exploration“Not to close any tracks of progress. (DGMS does not specify the top scorers’ priorities when choosing the predecessor.)

The researchers operated DGM for 80 repetitions using a coding standard called Beach seatsAnd one ran for 80 repetitions using a name called Multi -language. The agents improved the SWE seat from 20 percent to 50 percent, and on Polyglot from 14 percent to 31 percent. “We really were really surprised that the coding agent can write such a complex code in itself,” he said. Jenny ChangComputer scientist at the British University of Columbia and the chief author of the newspaper. “He can edit multiple files, create new files, and create really complex systems.”

The first coding agent (No. 0) created a generation of new and slightly different coding factors, some of which were chosen to create new versions of themselves. The performance of the agents is indicated through the color inside the circles, and the best performance agent is distinguished with a star. Jenny Chang, Xinger is and others.

It is very important that DGMS outperformed an alternative method that used a fixed external system to improve factors. With DGMS, agent improvements have multiplied while improving them to improve themselves. DGMS also outperformed a copy that did not maintain a group of agents and just modified the latest agent. To clarify the use of the open look, the researchers created a family tree of Swe-Bench factors. If you look at the best performance agent and follow his development from start to finish, they have made two changes that they temporarily reduced performance. So the proportions follow an indirect path of success. Bad ideas can become good.

On a graphic with "Swe-Bench degree" On the axis y and "Repetitions" On the X axis, the black line rises with two decreases. The black line displays this graphic grades that the agents obtained within the rates of the best -performance worker. The line includes two decreases in performance. Jenny Chang, Xinger is and others.

The best SWE-Bench agent was not good in the best agent designed by experts, which currently records about 70 percent, but was automatically created, and perhaps with enough time and account, the worker can develop beyond human experience. This study is a “big step forward” as evidence of the concept of self -improvement, Zhengyao JiangFounded by Weco aiA platform to automate the code. Jiang, who did not participate in the study, said the approach could achieve more progress if the basic LLM modified, or even the structure of the chips. ((Google Deepmind alphavolve Designs the best algorithms and basic chips and found a way to accelerate the basic LLM training by 1 percent.)

DGMS can theoretically record agents simultaneously on coding standards and also specific applications, such as medicines design, so they will improve the design of medicines. Zhang said she wants to combine DGM with Alphavolve.

Can DGMS reduce the employment of beginners for beginners? Jiang sees a greater threat of daily coding assistants such as the index. He said: “Evolutionary research is really related to building high -performance programs that really go beyond the human expert,” as he did alphavolve in certain tasks.

Risks of self -improvement

One of the anxiety with both evolutionary research and self-optimization systems-especially its mixture, as in DGM-is safety. The agents may become unsuccessful or Inconsistent With human directions. So Zhang and its collaborators added the handrails. Keep DGMS in sand boxes without reaching Internet or OSAnd they registered and reviewed all the changes in the code. They suggest that in the future, they can reward Amnesty International to make itself more interpretation and alignment. (In the study, they found that the agents have falsely reported the use of certain tools, so they created the DGM that rewards the agents for not manufacturing things, and partially reduced the problem. However, one agent penetrated the way that follows whether it is making things.)

In 2017, experts in Asilomar met California, to discuss useful artificial intelligence, and many of them signed an open message called Asilomar AI Principles. In part of it, he called for restrictions on “artificial intelligence systems designed for self -imaging repeatedly.” The imagined result is often what is called UniquenessAIS self -control over our control and threatens human civilization. “I didn’t sign it because the bread and butter I was working on,” Schmidhbir told me. Since the 1970s, he expected this SuperHuman Ai He will come in time for him to retire, but he sees Uniqueness As a kind of dysmenorrhea in science fiction, people love fear. Jiang, likewise, is not worried, at least at the present time. It still places a premium on human creativity.

Whether the digital development defeats the biological development of the seizure. What is indisputable is that the development in any curtain has surprises in the store.

From your site articles

Related articles about the web

Related Articles

Leave a Reply

Your email address will not be published. Required fields are marked *

Back to top button