Frequently Asked Questions

Find answers to the most common questions about SpeakToWords

What file formats do you support?

SpeakToWords supports a variety of common audio formats including:

  • MP3
  • WAV
  • M4A
  • AAC

If you have a file in a different format, we recommend converting it using a free online converter before uploading.

What is the maximum file size I can upload?

We currently support file uploads up to 500MB in size. For larger files, we recommend splitting them into smaller segments before uploading. This not only helps with the upload process but can also improve transcription accuracy.

How accurate is the transcription?

Our transcription service typically achieves 90-95% accuracy for clear audio with minimal background noise. Factors that can affect accuracy include:

  • Audio quality and clarity
  • Background noise levels
  • Speaker accents
  • Technical or specialized vocabulary

Premium users have access to our enhanced accuracy models which can improve results for challenging audio.

How much does transcription cost?

Our standard rate is $1 per minute of audio. We offer volume discounts for larger uploads:

  • 30+ minutes: 10% discount
  • 60+ minutes: 15% discount
  • 120+ minutes: 20% discount

Premium subscribers receive 30% off all transcriptions. For more details, visit our pricing page.

How long does transcription take?

Transcription time depends on the length of your audio and current system load. As a general guideline:

  • Files under 10 minutes: Usually complete within 5-10 minutes
  • Files 10-30 minutes: Usually complete within 15-30 minutes
  • Files 30+ minutes: May take 30+ minutes

Premium users receive priority processing that can reduce these times by up to 50%.

What languages do you support?

SpeakToWords supports transcription in over 30 languages, including:

  • English (US, UK, AU variants)
  • Spanish
  • French
  • German
  • Italian
  • Japanese
  • Mandarin Chinese
  • Russian
  • Portuguese
  • Hindi
  • Turkish
  • Bulgarian
  • Czech
  • Arabic
  • Belarusian
  • Ukrainian
  • And many more...

Our system automatically detects the primary language, but for best results, we recommend specifying it during upload.

How secure is my data?

We take data security very seriously:

  • All uploads and downloads are encrypted using TLS/SSL
  • Files are stored encrypted at rest
  • Audio files are automatically deleted 30 days after transcription
  • We never share or sell your data to third parties

For more information, please review our Privacy Policy.

Can you identify different speakers in a conversation?

Yes, our service includes automatic speaker diarization, which identifies and labels different speakers in your audio. This feature works best when:

  • The audio quality is good
  • Speakers don't talk over each other frequently
  • There are distinct vocal differences between speakers

Premium users have access to enhanced speaker identification that can better distinguish between similar voices.

Do you offer refunds?

We offer refunds in the following circumstances:

  • If our system fails to process your file due to technical issues on our end
  • If the transcription quality is significantly below our standard accuracy rates for clear audio

Refund requests are evaluated on a case-by-case basis. To request a refund, please contact our support team through the Contact page.

How do I become a premium member?

Becoming a premium member is easy:

  1. Log in to your account
  2. Visit the Upgrade page
  3. Select your preferred subscription plan
  4. Complete the secure payment process

Premium benefits are activated immediately after payment. You can cancel your subscription at any time from your account settings.