SpeechFlow
/

spoken_language_identification

Audio Classification

Model card Files Files and versions Community

spoken_language_identification / README.md

emmazhong's picture

Update README.md

d800dff over 2 years ago

|

history blame contribute delete

1.71 kB

	---
	license: apache-2.0
	metrics:
	- accuracy
	language:
	- en
	- zh
	- ko
	- ja
	- de
	- fr
	- es
	- pt
	- vi
	- tr
	- it
	- ru
	- id
	tags:
	- keras
	- tensorflow
	libraries: TensorBoard
	pipeline_tag: audio-classification
	---

	# Spoken_language_identification

	## Model description

	This is a spoken language recognition model trained on 2k hours of private dataset using Tensorflow. Approximately 150 hours of speech supervision per language.
	the model uses the CRNN-Attention architecture that has previously been used for extracting utterance-level feature representations.
	The system is trained with recordings sampled at 16kHz, single channel, and 16-bit Signed Integer PCM encoding.

	More details can be found here: [GitHub](https://github.com/SpeechFlow-io/Spoken_language_identification)

	The model can classify a speech utterance according to the language spoken. It covers 13 different languages.

	\| Molde Parameters \| Supported Languages \|
	\|----------\|--------------------------\|
	\| 1 M \| chinese, english, french, german, indonesian, italian, japanese, korean, portuguese, russian, spanish, turkish, vietnamese\|

	## Example
	[![ Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/16-Nre8aDvn0wN2dsgGa3xUsZ7S61e1h8#scrollTo=Is60zUMuPqSi)
	Please see the provided Colab for details for runing an example.
	#### How to use

	```python

	import librosa
	from huggingface_hub import from_pretrained_keras
	from featurizers.speech_featurizers import TFSpeechFeaturizer,
	model = from_pretrained_keras("SpeechFlow/spoken_language_identification")
	signal, _ = librosa.load(wav_path, sr=16000)
	output, prob = model.predict_pb(signal)
	print(output)

	```