GitHub - Rosy0912/USMID

CLAP Model Training

1. Install necessary libraries:

librosa
soundfile
accelerate
ffmpeg
torchaudio
transformers==4.45.1

2. Data Processing(For example, using the LibriSpeech dataset):

2.1.1 Download the LibriSpeech dataset from LibriSpeech.

2.1.2 Extract the dataset to the `data` folder.

2.1.3 Use `data/Librispeech_process.ipynb` to preprocess audio data.

3. Model Training:

The model will be saved in checkpoints/model.

By default, the pre-trained model laion/clap-htsat-unfused is used.

python train.py

Note: Please remember to modify data path and other parameters in train.py before running.

Detection

1. Install necessary libraries:

cd local
pip install -r requirements.txt

2. Generate gibberish:

python generate.py

3. Optimize audios:

python clap_opt_1_minut.py

4. Detect:

The sample code is in local/vote.py.

cd local
python vote.py

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
checkpoints/model/epch-300-2024-10-10		checkpoints/model/epch-300-2024-10-10
data		data
local		local
models		models
remote		remote
.DS_Store		.DS_Store
clap_opt_1_minus_cosine.py		clap_opt_1_minus_cosine.py
config.py		config.py
readme.md		readme.md
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CLAP Model Training

1. Install necessary libraries:

2. Data Processing(For example, using the LibriSpeech dataset):

2.1.1 Download the LibriSpeech dataset from LibriSpeech.

2.1.2 Extract the dataset to the `data` folder.

2.1.3 Use `data/Librispeech_process.ipynb` to preprocess audio data.

3. Model Training:

Detection

1. Install necessary libraries:

2. Generate gibberish:

3. Optimize audios:

4. Detect:

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CLAP Model Training

1. Install necessary libraries:

2. Data Processing(For example, using the LibriSpeech dataset):

2.1.1 Download the LibriSpeech dataset from LibriSpeech.

2.1.2 Extract the dataset to the data folder.

2.1.3 Use data/Librispeech_process.ipynb to preprocess audio data.

3. Model Training:

Detection

1. Install necessary libraries:

2. Generate gibberish:

3. Optimize audios:

4. Detect:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

2.1.2 Extract the dataset to the `data` folder.

2.1.3 Use `data/Librispeech_process.ipynb` to preprocess audio data.

Packages