Nouvelles et reportages
Computer-use agents are AI systems that autonomously navigate and interact with software applications through graphical user interfaces (GUIs), and they are emerging as a new capability in artificial intelligence. By navigating and manipulating the same visual interfaces that people use,…
In recent years, as the shift toward agentic AI has accelerated, automation has advanced to handle increasingly complex tasks, from document and code generation to image creation, visual understanding, and mathematical reasoning. This trend points to the growing need to…
When industry knowledge meets PIKE-RAG: The innovation behind Signify’s customer service boost
| Industry Innovation Center
A collaboration between Signify and Microsoft Research shows how PIKE-RAG improves enterprise knowledge systems, delivering a 12% increase in accuracy and faster, more reliable answers.
Large vision-language models are improving at describing images, yet hallucinations still erode trust by introducing contradictions and fabricated details that limit practical applications. In response, Microsoft Research Asia has developed On-Policy Alignment DPO (OPA-DPO), a new algorithm that aligns expert…
Developers who are blind or have low vision have historically been limited to back-end programming, but new research suggests AI programming assistants are changing that in remarkable ways. A Microsoft Research Asia study found that developers who use screen readers…
Prix | The Hong Kong University of Science and Technology
Lidong Zhou awarded Honorary Fellowship by HKUST
Dans l’actualité | The Korea Times
IITP, Microsoft collaborate to build global network for fostering AI workers
Dans l’actualité | VentureBeat
Microsoft’s new AI framework trains powerful reasoning models with a fraction of the cost
RenderFormer: How neural networks are reshaping 3D rendering
| Yue Dong
RenderFormer, from Microsoft Research, is the first model to show that a neural network can learn a complete graphics rendering pipeline. It’s designed to support full-featured 3D rendering using only machine learning—no traditional graphics computation required.