Skip to content

Commit

Permalink
Update README for sidlingvo
Browse files Browse the repository at this point in the history
  • Loading branch information
wq2012 committed Sep 15, 2024
1 parent 7aff9e2 commit 3362317
Show file tree
Hide file tree
Showing 4 changed files with 32 additions and 9 deletions.
4 changes: 2 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -19,10 +19,10 @@ If you are interested in using our products:

## Code

### Lingvo [![Python application](https://github.com/google/speaker-id/actions/workflows/python-app-lingvo.yml/badge.svg)](https://github.com/google/speaker-id/actions/workflows/python-app-lingvo.yml)
### SidLingvo [![Python application](https://github.com/google/speaker-id/actions/workflows/python-app-lingvo.yml/badge.svg)](https://github.com/google/speaker-id/actions/workflows/python-app-lingvo.yml)

We open source some of the [Lingvo](https://github.com/tensorflow/lingvo)-based
modules used in our publications.
libraries used in our publications.

Please see [`lingvo/README.md`](https://github.com/google/speaker-id/blob/master/lingvo/README.md) for more details.

Expand Down
32 changes: 27 additions & 5 deletions lingvo/README.md
Original file line number Diff line number Diff line change
@@ -1,19 +1,31 @@
# Lingvo-based modules for speaker and language recognition
# `sidlingvo`: Lingvo-based libraries for speaker and language recognition

[![Python application](https://github.com/google/speaker-id/actions/workflows/python-app-lingvo.yml/badge.svg)](https://github.com/google/speaker-id/actions/workflows/python-app-lingvo.yml)
[![PyPI Version](https://img.shields.io/pypi/v/sidlingvo.svg)](https://pypi.python.org/pypi/sidlingvo)
[![Python Versions](https://img.shields.io/pypi/pyversions/sidlingvo.svg)](https://pypi.org/project/sidlingvo)
[![Downloads](https://static.pepy.tech/badge/sidlingvo)](https://www.pepy.tech/projects/sidlingvo)

## Overview

Here we open source some of the [Lingvo](https://github.com/tensorflow/lingvo)-based
modules used in our publications.
libraries used in our publications.

## Disclaimer

**This is NOT an official Google product.**

## Feature frontend and TFLite inference

For the feature frontend and TFLite inference, see the API in
`siglingvo/fe_utils.py`.

For pretrained speaker encoder models, the inference API is in `sidlingvo/wav_to_dvector.py`.

For pretrained language identifcation models, the inference API is in `sidlingvo/wav_to_lang.py`.

## GE2E and GE2E-XS losses

GE2E and GE2E-XS losses are implemented in `lingvo/loss_layers.py`.
GE2E and GE2E-XS losses are implemented in `sidlingvo/loss_layers.py`.

GE2E was proposed in this paper:

Expand All @@ -25,7 +37,7 @@ GE2E-XS was proposed in this paper:

## Attentive temporal pooling

Attentive temporal pooling is implemented in `lingvo/cumulative_statistics_layer.py`.
Attentive temporal pooling is implemented in `sidlingvo/cumulative_statistics_layer.py`.

It is used by these papers:

Expand All @@ -34,7 +46,7 @@ It is used by these papers:

## Attentive scoring

Attentive scoring is implemented in `lingvo/attentive_scoring_layer.py`.
Attentive scoring is implemented in `sidlingvo/attentive_scoring_layer.py`.

It is proposed in this paper:

Expand Down Expand Up @@ -77,4 +89,14 @@ Our papers are cited as:
booktitle={Odyssey: The Speaker and Language Recognition Workshop},
year={2022}
}
@inproceedings{chojnacka2021speakerstew,
title={{SpeakerStew: Scaling to many languages with a triaged multilingual text-dependent and text-independent speaker verification system}},
author={Chojnacka, Roza and Pelecanos, Jason and Wang, Quan and Moreno, Ignacio Lopez},
booktitle={Prod. Interspeech},
pages={1064--1068},
year={2021},
doi={10.21437/Interspeech.2021-646},
issn={2958-1796},
}
```
2 changes: 1 addition & 1 deletion lingvo/setup.py
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@

import setuptools

VERSION = "0.0.10"
VERSION = "0.0.11"

with open("README.md", "r") as file_object:
LONG_DESCRIPTION = file_object.read()
Expand Down
3 changes: 2 additions & 1 deletion publications/GE2E/index.html
Original file line number Diff line number Diff line change
Expand Up @@ -68,7 +68,8 @@ <h3>Slides</h3>
<h3>Implementations</h3>

<p>
A Lingvo-based open source version of the GE2E loss is available <a href="https://github.com/google/speaker-id/blob/master/lingvo/loss_layers.py">[here]</a>.
A Lingvo-based open source version of the GE2E loss is available <a
href="https://github.com/google/speaker-id/blob/master/sidlingvo/loss_layers.py">[here]</a>.
</p>

<p>
Expand Down

0 comments on commit 3362317

Please sign in to comment.