Qwen LLM

The Qwen LLM process enables local inference of Qwen language models for text generation, conversation, and creative applications within score.

Overview

Qwen (通义千问) is a series of large language models that can be run locally for:

Text generation and completion
Interactive conversations
Code generation
Creative writing
Real-time text processing in performances

Inputs

Port	Type	Description	Default	Range
Prompt	String	Input text or question	-	-
Model	File Selector	Path to Qwen ONNX model	-	.onnx files
Tokenizer	File Selector	Path to Qwen tokenizer	-	.json files
Temperature	Float	Randomness control	-	-
Top P	Float	Nucleus sampling	0.9	0.0-1.0
Max Tokens	Integer	Maximum response length	-	-
Top K	Integer	Top-k sampling	50	1-100

Outputs

Port	Type	Description
Response	String	Generated text
Tokens/second	Float	Number of tokens processed
Generation Info	Float	Generation statistics

Model Variants

There are multiple variants of Qwen, all with different tradeoffs. For instance, you can download the smallest, 0.6B, on HuggingFace.

Qwen-1.8B

Lightweight, fast inference
Suitable for real-time applications
Lower memory requirements

Qwen-7B

Better quality responses
More creative capabilities
Higher resource usage

Qwen-14B

Best quality
Advanced reasoning
Requires significant resources

Usage Examples

Interactive Text Generation

Create dynamic text based on sensor input:

[Sensor Data] → [Prompt Composer] → [Qwen LLM] → [Text Display]
                                          ↓
                                  [Text to Speech]

Live Performance Assistant

Generate performance cues and text:

[MIDI Input] → [Note to Text] → [Qwen LLM] → [Projection]
                                      ↑
                            [Performance Context]

Automated Storytelling

Create evolving narratives:

[Scene Analysis] → [Qwen LLM] → [Story Text]
        ↑               ↓
    [FastVLM]    [Memory Buffer]

Prompt Engineering

System Messages

Set the model’s behavior and context:

“You are a poetic narrator describing visual scenes”
“Generate short, rhythmic responses suitable for music”
“Respond only with stage directions”

Effective Prompting

Be specific about desired output format
Provide examples when possible
Use consistent formatting
Include relevant context

Creative Applications

Poetry generation from sensor data
Dynamic subtitles for performances
Interactive narrative experiences
Generative dialogue systems

Performance Optimization

Model Loading

Load models at startup to avoid delays
Use quantized models for better performance
Consider model size vs. quality tradeoffs

Streaming Mode

Enable streaming for:

Real-time text display
Reduced latency perception
Progressive text revelation

Caching

Cache common responses
Implement context windowing
Reuse computation where possible

Integration Examples

Combine with other AI processes:

[Camera] → [FastVLM] → [Scene Description]
                              ↓
                         [Qwen LLM]
                              ↓
              [Poetic Interpretation] → [Display]

Code Generation

Generate live code for other processes:

[Musical Input] → [Analysis] → [Qwen LLM] → [JavaScript Code]
                                                    ↓
                                            [JS Process]

Best Practices

Model Selection : Choose model size based on quality/performance needs
Temperature Tuning : Lower for consistency, higher for creativity
Context Length : Balance context with performance
Error Handling : Implement fallbacks for failed generations
Resource Monitoring : Watch memory and CPU/GPU usage

Troubleshooting

Slow Generation

Use smaller models
Reduce max token length

Poor Quality Output

Adjust temperature and sampling parameters
Improve prompt clarity
Provide better examples
Check model compatibility

Memory Issues

Use quantized models
Reduce batch size
Clear context regularly
Monitor system resources

Creative Use Cases

Performance

Generate live subtitles
Create dynamic poetry
Produce stage directions
Generate character dialogue

Installation

Visitor interaction systems
Generative text displays
Responsive narratives
Educational experiences

Experimentation

Text-based compositions
Language exploration
Conceptual art generation
Hybrid human-AI creation

AI Prompt Composer - Build complex prompts
Text - Text display
JavaScript - Process generated code

ossia