AI & ML interests

On the edge of research in automatic speech recognition for telephony.

YehorĀ 
posted an update 17 days ago
view post
Post
153
Updated the demo for the new version of the W2V-BERT model for Ukrainian audio recognition.

This is a classic Automatic Speech Recognition or Speech to Text task.

What's new in version three:

• more data: 1200 hours
• new SentencePiece tokenizer with 512 tokens
• feature extraction is done via a Rust extension

Facts:

• Training was started from the previous model to speed up the learning process.
• Training takes place on two 3090 video cards with 24 GB each.
• It is well suited for fine-tuning because the training data is very diverse and mostly noisy.

You can try it here:

Yehor/w2v-bert-uk-v3

Download weights here:

speech-uk/w2v-bert-v3

If you wish to support the speech-uk initiative with a donation, here is the link to Monobank:

https://send.monobank.ua/jar/3Saxixsdua
YehorĀ 
posted an update 4 months ago
YehorĀ 
posted an update 7 months ago
YehorĀ 
posted an update 7 months ago
YehorĀ 
posted an update 7 months ago
view post
Post
286
Containerized Yehor/kulyk-en-uk and Yehor/kulyk-uk-en so you can just pull an image and run CPU-version to do machine translation:

docker run -p 3000:3000 --rm ghcr.io/egorsmkv/kulyk-rust:latest
YehorĀ 
posted an update 10 months ago
view post
Post
770
A new lightweight model to do machine translation from English to Ukrainian using recently published LFM2 model. Use demo Yehor/en-uk-translator to test it.

Facts:
- Fine-tuned with 40M samples (filtered by quality metric) from ~53.5M for 1.4 epochs
- 354M params
- Requires 1 GB of RAM to run with bf16
- BLEU on FLORES-200: 27.24
- Tokens per second: 229.93 (bs=1), 1664.40 (bs=10), 8392.48 (bs=64)
- License: lfm1.0

Mode page: Yehor/kulyk-en-uk
  • 5 replies
Ā·
YehorĀ 
posted an update about 1 year ago
YehorĀ 
posted an update about 1 year ago
view post
Post
2513
Made a workable program that uses IREE runtime using Rust to inference wav2vec2-bert model for Automatic Speech Recognition.
  • 1 reply
Ā·
YehorĀ 
posted an update about 1 year ago
view post
Post
2709
I have made a Rust project with integration of the latest state-of-the-art model for object detection, it outperforms YOLO!

Check it out: https://github.com/egorsmkv/rf-detr-usls
  • 2 replies
Ā·
YehorĀ 
posted an update about 1 year ago
YehorĀ 
posted an update about 1 year ago
view post
Post
2269
Create spectrogram using Rust!

Slightly improved nice project that creates spectrogram and built binaries for different platform using cross-rs I've mentioned earlier in my channel.

Repo: https://github.com/crs-org/sonogram
  • 1 reply
Ā·
YehorĀ 
posted an update about 1 year ago
YehorĀ 
posted an update about 1 year ago
YehorĀ 
posted an update about 1 year ago
view post
Post
2058
Are you interesting in different runtimes for AI models?

Check out IREE (iree.dev), it convert models to MLIR and then execute on different platforms.

I have tested it in Rust on CPU and CUDA: https://github.com/egorsmkv/eerie-yolo11
YehorĀ 
posted an update about 1 year ago
view post
Post
2259
Extract audio datasets with Rust on blazingly fast speeds!

With this tool you can extract audio files from a parquet or arrow file generated by Hugging Face datasets library.

Repository: https://github.com/egorsmkv/extract-audio
  • 1 reply
Ā·
YehorĀ 
posted an update about 1 year ago
YehorĀ 
posted an update about 1 year ago
YehorĀ 
updated a Space about 1 year ago