Qwen LLM
The Qwen LLM process enables local inference of Qwen language models for text generation, conversation, and creative applications within score.
Overview
Qwen (通义千问) is a series of large language models that can be run locally for:
- Text generation and completion
- Interactive conversations
- Code generation
- Creative writing
- Real-time text processing in performances
Inputs
Port | Type | Description | Default | Range |
---|---|---|---|---|
Prompt | String | Input text or question | - | - |
Model | File Selector | Path to Qwen ONNX model | - | .onnx files |
Tokenizer | File Selector | Path to Qwen tokenizer | - | .json files |
Temperature | Float | Randomness control | - | - |
Top P | Float | Nucleus sampling | 0.9 | 0.0-1.0 |
Max Tokens | Integer | Maximum response length | - | - |
Top K | Integer | Top-k sampling | 50 | 1-100 |
Outputs
Port | Type | Description |
---|---|---|
Response | String | Generated text |
Tokens/second | Float | Number of tokens processed |
Generation Info | Float | Generation statistics |
Model Variants
There are multiple variants of Qwen, all with different tradeoffs. For instance, you can download the smallest, 0.6B, on HuggingFace.
Qwen-1.8B
- Lightweight, fast inference
- Suitable for real-time applications
- Lower memory requirements
Qwen-7B
- Better quality responses
- More creative capabilities
- Higher resource usage
Qwen-14B
- Best quality
- Advanced reasoning
- Requires significant resources
Usage Examples
Interactive Text Generation
Create dynamic text based on sensor input:
[Sensor Data] → [Prompt Composer] → [Qwen LLM] → [Text Display]
↓
[Text to Speech]
Live Performance Assistant
Generate performance cues and text:
[MIDI Input] → [Note to Text] → [Qwen LLM] → [Projection]
↑
[Performance Context]
Automated Storytelling
Create evolving narratives:
[Scene Analysis] → [Qwen LLM] → [Story Text]
↑ ↓
[FastVLM] [Memory Buffer]
Prompt Engineering
System Messages
Set the model’s behavior and context:
- “You are a poetic narrator describing visual scenes”
- “Generate short, rhythmic responses suitable for music”
- “Respond only with stage directions”
Effective Prompting
- Be specific about desired output format
- Provide examples when possible
- Use consistent formatting
- Include relevant context
Creative Applications
- Poetry generation from sensor data
- Dynamic subtitles for performances
- Interactive narrative experiences
- Generative dialogue systems
Performance Optimization
Model Loading
- Load models at startup to avoid delays
- Use quantized models for better performance
- Consider model size vs. quality tradeoffs
Streaming Mode
Enable streaming for:
- Real-time text display
- Reduced latency perception
- Progressive text revelation
Caching
- Cache common responses
- Implement context windowing
- Reuse computation where possible
Integration Examples
Multi-Modal Performance
Combine with other AI processes:
[Camera] → [FastVLM] → [Scene Description]
↓
[Qwen LLM]
↓
[Poetic Interpretation] → [Display]
Code Generation
Generate live code for other processes:
[Musical Input] → [Analysis] → [Qwen LLM] → [JavaScript Code]
↓
[JS Process]
Best Practices
- Model Selection : Choose model size based on quality/performance needs
- Temperature Tuning : Lower for consistency, higher for creativity
- Context Length : Balance context with performance
- Error Handling : Implement fallbacks for failed generations
- Resource Monitoring : Watch memory and CPU/GPU usage
Troubleshooting
Slow Generation
- Use smaller models
- Reduce max token length
Poor Quality Output
- Adjust temperature and sampling parameters
- Improve prompt clarity
- Provide better examples
- Check model compatibility
Memory Issues
- Use quantized models
- Reduce batch size
- Clear context regularly
- Monitor system resources
Creative Use Cases
Performance
- Generate live subtitles
- Create dynamic poetry
- Produce stage directions
- Generate character dialogue
Installation
- Visitor interaction systems
- Generative text displays
- Responsive narratives
- Educational experiences
Experimentation
- Text-based compositions
- Language exploration
- Conceptual art generation
- Hybrid human-AI creation
Related Processes
- AI Prompt Composer - Build complex prompts
- Text - Text display
- JavaScript - Process generated code