Frequently Asked Questions

What file formats do you support?

SpeakToWords supports a variety of common audio formats including:

MP3
WAV
M4A
AAC

If you have a file in a different format, we recommend converting it using a free online converter before uploading.

What is the maximum file size I can upload?

We currently support file uploads up to 500MB in size. For larger files, we recommend splitting them into smaller segments before uploading. This not only helps with the upload process but can also improve transcription accuracy.

How accurate is the transcription?

Our transcription service typically achieves 90-95% accuracy for clear audio with minimal background noise. Factors that can affect accuracy include:

Audio quality and clarity
Background noise levels
Speaker accents
Technical or specialized vocabulary

Premium users have access to our enhanced accuracy models which can improve results for challenging audio.

How much does transcription cost?

Our standard rate is $1 per minute of audio. We offer volume discounts for larger uploads:

30+ minutes: 10% discount
60+ minutes: 15% discount
120+ minutes: 20% discount

Premium subscribers receive 30% off all transcriptions. For more details, visit our pricing page.

How long does transcription take?

Transcription time depends on the length of your audio and current system load. As a general guideline:

Files under 10 minutes: Usually complete within 5-10 minutes
Files 10-30 minutes: Usually complete within 15-30 minutes
Files 30+ minutes: May take 30+ minutes

Premium users receive priority processing that can reduce these times by up to 50%.

What languages do you support?

SpeakToWords supports transcription in over 30 languages, including:

English (US, UK, AU variants)
Spanish
French
German
Italian
Japanese
Mandarin Chinese
Russian
Portuguese
Hindi
Turkish
Bulgarian
Czech
Arabic
Belarusian
Ukrainian
And many more...

Our system automatically detects the primary language, but for best results, we recommend specifying it during upload.

How secure is my data?

We take data security very seriously:

All uploads and downloads are encrypted using TLS/SSL
Files are stored encrypted at rest
Audio files are automatically deleted 30 days after transcription
We never share or sell your data to third parties

For more information, please review our Privacy Policy.

Can you identify different speakers in a conversation?

Yes, our service includes automatic speaker diarization, which identifies and labels different speakers in your audio. This feature works best when:

The audio quality is good
Speakers don't talk over each other frequently
There are distinct vocal differences between speakers

Premium users have access to enhanced speaker identification that can better distinguish between similar voices.

Do you offer refunds?

We offer refunds in the following circumstances:

If our system fails to process your file due to technical issues on our end
If the transcription quality is significantly below our standard accuracy rates for clear audio

Refund requests are evaluated on a case-by-case basis. To request a refund, please contact our support team through the Contact page.

How do I become a premium member?

Becoming a premium member is easy:

Log in to your account
Visit the Upgrade page
Select your preferred subscription plan
Complete the secure payment process

Premium benefits are activated immediately after payment. You can cancel your subscription at any time from your account settings.