Frequently Asked Questions
Find answers to the most common questions about SpeakToWords
SpeakToWords supports a variety of common audio formats including:
- MP3
- WAV
- M4A
- AAC
If you have a file in a different format, we recommend converting it using a free online converter before uploading.
We currently support file uploads up to 500MB in size. For larger files, we recommend splitting them into smaller segments before uploading. This not only helps with the upload process but can also improve transcription accuracy.
Our transcription service typically achieves 90-95% accuracy for clear audio with minimal background noise. Factors that can affect accuracy include:
- Audio quality and clarity
- Background noise levels
- Speaker accents
- Technical or specialized vocabulary
Premium users have access to our enhanced accuracy models which can improve results for challenging audio.
Our standard rate is $1 per minute of audio. We offer volume discounts for larger uploads:
- 30+ minutes: 10% discount
- 60+ minutes: 15% discount
- 120+ minutes: 20% discount
Premium subscribers receive 30% off all transcriptions. For more details, visit our pricing page.
Transcription time depends on the length of your audio and current system load. As a general guideline:
- Files under 10 minutes: Usually complete within 5-10 minutes
- Files 10-30 minutes: Usually complete within 15-30 minutes
- Files 30+ minutes: May take 30+ minutes
Premium users receive priority processing that can reduce these times by up to 50%.
SpeakToWords supports transcription in over 30 languages, including:
- English (US, UK, AU variants)
- Spanish
- French
- German
- Italian
- Japanese
- Mandarin Chinese
- Russian
- Portuguese
- Hindi
- Turkish
- Bulgarian
- Czech
- Arabic
- Belarusian
- Ukrainian
- And many more...
Our system automatically detects the primary language, but for best results, we recommend specifying it during upload.
We take data security very seriously:
- All uploads and downloads are encrypted using TLS/SSL
- Files are stored encrypted at rest
- Audio files are automatically deleted 30 days after transcription
- We never share or sell your data to third parties
For more information, please review our Privacy Policy.
Yes, our service includes automatic speaker diarization, which identifies and labels different speakers in your audio. This feature works best when:
- The audio quality is good
- Speakers don't talk over each other frequently
- There are distinct vocal differences between speakers
Premium users have access to enhanced speaker identification that can better distinguish between similar voices.
We offer refunds in the following circumstances:
- If our system fails to process your file due to technical issues on our end
- If the transcription quality is significantly below our standard accuracy rates for clear audio
Refund requests are evaluated on a case-by-case basis. To request a refund, please contact our support team through the Contact page.
Becoming a premium member is easy:
- Log in to your account
- Visit the Upgrade page
- Select your preferred subscription plan
- Complete the secure payment process
Premium benefits are activated immediately after payment. You can cancel your subscription at any time from your account settings.