Voice Input
Enable voice-to-text functionality so users can speak their messages instead of typing.
Voice Input Benefits
Enable Voice Input
Basic Configuration
Enable the microphone button in the input area:
Supported APIs
Web Speech API (Default)
Built into modern browsers. Free, privacy-friendly, works offline.
Browser Support:
Chrome, Edge, Safari (limited). Not Firefox.
Google Cloud Speech-to-Text
More accurate, supports more languages. Requires API key and billing.
OpenAI Whisper
Excellent accuracy, multilingual. Requires OpenAI API key.
Language Configuration
Recognition Language
Set the language for speech recognition:
Supported Languages
Multi-Language Support
Allow users to select their preferred language:
Recording Behavior
Recording Modes
Push-to-Talk
Hold button to record, release to stop. Quick and intuitive.
🎤 Hold to speak...
Click-to-Toggle
Click once to start, click again to stop. Better for longer messages.
● Recording... Click to stop
Auto-Stop
Automatically stop after silence detected. Hands-free experience.
Visual Feedback
Provide clear feedback during recording:
- Animated Microphone Icon: Pulsing or waveform animation
- Recording Timer: Show elapsed time
- Waveform Visualization: Real-time audio level display
- Status Text: "Listening...", "Processing...", "Done!"
Recording...
0:05
Transcription Display
Real-Time Transcription
Show text as it's being transcribed:
Confidence Levels
Handle low-confidence transcriptions:
Permissions & Privacy
Microphone Permission
Handle browser microphone permissions gracefully:
First Time
Show explanation before requesting permission: "We need microphone access for voice input."
Permission Denied
Provide instructions to enable: "Please allow microphone access in your browser settings."
Permission Granted
Show success indicator and enable voice button.
Privacy Considerations
Respect user privacy:
- Clear Indicator: Always show when microphone is active
- Easy Cancellation: Allow stopping recording anytime
- No Background Recording: Only record when explicitly activated
- Data Retention: Don't store audio files unless necessary
- Privacy Policy: Disclose voice data handling in privacy policy
GDPR Compliance
Error Handling
Handle common voice input errors:
No Speech Detected
"We didn't hear anything. Please try again."
Network Error
"Connection lost. Please check your internet and try again."
Language Not Recognized
"We couldn't understand. Try speaking more clearly."
Accessibility
Make voice input accessible to all users:
- Keyboard Shortcuts: Spacebar to activate voice (when focused)
- Screen Reader Support: Announce recording status and transcription
- Visual Feedback: Don't rely solely on audio cues
- Alternative Methods: Always keep text input available
- Clear Labels: "Voice Input" button with descriptive ARIA labels
Next: File Upload
Learn how to enable file attachments so users can share images and documents.
File Upload Setup →