13 October 2024

From GUI to CUI: Exploring the Optimal Human-Machine Interaction Paradigm in the Age of AI

Most AI-powered applications have adopted the Conversational User Interface (CUI) for interaction, making it seem as though human-AI interaction is synonymous with conversation. 

We’ve even heard some SaaS vendors claim they will switch their products entirely to CUI. 

So, what advantages does CUI offer over the traditional Graphical User Interface (GUI)?


From GUI to CUI: 
Exploring the Optimal Human-Machine Interaction Paradigm in the Age of AI


Hzongquan
4 min read
## CUI vs GUI
The primary advantage of CUI is its **low learning curve**. Users do not need to learn all the functionalities displayed on the screen (as in GUIs). Instead, they only need to express their needs clearly through language.
CUI is also more **flexible**. GUIs can respond to fixed user services, and the development cost for long-tail custom requirements is high. Typically, GUIs serve as a proxy for API service calls and returns, with this proxy being static. On the other hand, large language models (LLMs) can call tools (APIs) flexibly (though the current accuracy of these calls still leaves much to be desired).
**Despite its longevity, GUIs naturally have their own advantages:**
GUIs display information in a structured manner (layout and formatting), multimodally (visual charts, images, videos), which is more human-friendly. For example, most people prefer watching movies to reading books.
Additionally, GUIs inherently provide rich contextual information. Sometimes, clicking a button can accomplish tasks that are difficult to express through language.## Integrating GUI and CUI
As AI-powered applications advance, many products are exploring interaction paradigms that leverage the strengths of both CUI and GUI. Here are a few examples:

### AI Search
The AI search space is highly competitive. After Perplexity was replicated with just 500 lines of code, GPT Search is entering the fray. Let’s see how GPT Search innovates in AI search interactions:
#### 1. Richer Information Formats: Visual charts, images, and even videos.



#### 2. Structured Information Presentation.


#### 3. More Interactive Information
For example, hovering over a link displays a summary of the linked page.

In summary, GPT Search uses CUI for queries but employs GUI for responses.
### AI Content Generation
AI content generation applications abound. A recent product called mymap generates mind maps from text prompts, helping users expand their thinking. It achieved an MRR of over $10,000 upon launch.
Compared to plain text output, mymap dynamically draws mind maps from scratch, making it more engaging.

https://www.youtube.com/watch?v=-j1TBwBb6zw

### AI Programming
This next product is currently a demo. In this application, you can collaborate with AI to design a product’s PRD, generate ER diagrams, then proceed to database and API designs, ultimately generating front-end and back-end code.



In this example, CUI is simplified into a microphone or text input box integrated into the GUI. If the GUI is the cockpit for operating the platform, the AI operates using the same cockpit. You know how it interacts with the platform and can interrupt it at any time. This interaction paradigm leverages the low learning curve and flexibility of CUI while utilizing the advantages of GUI.
Recent hits like Cursor and OpenAI canvas exemplify this interaction paradigm. Under this approach, each business team can integrate AI capabilities more deeply into their operations.

## Conclusion
CUI allows for simple and flexible expression of user needs, while GUI provides rich and intuitive information presentation. Embedding microphones (or text boxes) within GUIs for queries and allowing GUIs to provide animated responses can combine the strengths of both CUI and GUI, making AI act more like a co-pilot working in the same workspace.
From a technical perspective, this integrated interaction form differs from pure CUI by transforming text streams into UI streams.

How is this transformation achieved? What are the development costs?
Stay tuned for our next article:
“Decoding the AI Co-Pilot: Transforming Text Streams into Cool GUI Streams.”

No comments:

Sergei Rachmaninov - Russian Rhapsody {Русская рапсодия} (1891) Vladimir Ashkenazy and André Previn

  Sergei Vasilievich Rachmaninoff (Russian: Серге́й Васи́льевич Рахма́нинов, tr. Sergéj Vasíl'evič Rahmáninov; 1 April [O.S. 20 March] 1...