Philo-S

Speech to Text (STT)

Learning voice data through deep learning-based acoustic models and converting it into textual information that advances speech recognition, speaker classification, and voice quality through RNN- based speaker vector generation.

Real-time voice recognition API service

Speaker separation analysis and noise cancellation technology through speaker voice filtering

Speech synthesis though text information

Features

Speech with Noise

Noise elimination
by applying voice filtering

Speech recognition & separation + Quality enhancement

Use Case

Realtime call transcripts to text

Recognizes the voice of each speaker separately and records the call as a text document
Possible integration with video call platforms

AI Speaker

Improved Speech recognition ratio
Collaboration with leading AI speaker manufacturers
STT for AI Speaker's NPL process

Online Classes

Recognize instructor's voice and generates handwritten notes automatically
Natural communication & feedback environment (Q&A)

Video Conferencing

Enhancement of voice quality by eliminating noise for each speaker
Automatically generates meeting minutes by each speaker through feature extraction and separation

Automated Subtitle Generator

Generates subtitles automatically by extracting audios directly from movies or audio files
Automated lyrics generation

Telemedicine

Q&A between doctors & patients
Possible integration with video call platforms

7 Speech Synthesis

Reads text information in a selected voice
AI professional announcer’s voice function

Case Study

Philo-S

Online class service provider

AI Solution for advanced voice quality of

multi-user online lecture platform

7 month Development | 13 Professionals Manpower