Skip to content

Dataset Generator that uses the TIMIT dataset to generate audio with splicing and copy-move forgery used for my Bachelor's Degree Final Project.

Notifications You must be signed in to change notification settings

JoseRuiz01/SplicingAndCopyMoveDatasetGenerator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

27 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

🎧 Splicing and Copy-Move Audio Forgery Dataset Generator

This project contains two audio forgery dataset generators based on the TIMIT speech corpus. It simulates splicing and copy-move forgeries for use in training and evaluating audio forensic systems.


πŸ› οΈ Overview

The dataset generation process involves applying transformations to authentic audio files from TIMIT using two distinct methods:

πŸ”€ 1. RandomPosition Method

Simulates forgeries by:

  • Selecting a random segment from the original audio.
  • Inserting that segment at a random new position.
  • Reconstructing the audio so that the inserted segment appears naturally within the waveform.
πŸ“Œ Forgery Sample Generation

Original A: ---[Original Audio A] Original B: ---[Original Audio B]---
Forgery: ---[Segment from A]---[Segment from B]---[Remaining A]---


πŸ” 2. Concatenation Method

Based on the paper:
"Autoencoder for Audio Forgery Detection using Spliced and Copy-Move Audio",
πŸ“„ Shaikh et al., 2021
Read the paper here

This method simulates forgeries by:

  • Extracts 2-second and 1-second segments from each audio file.
  • Concatenates them in different combinations to simulate forged samples.
  • Produces:
    • 3-second forgered audio
    • 2-second forgered audio
πŸ“Œ Forgery Sample Generation
  • Forgery: 2s [Segment from A] + 1s [Segment from B] β†’ 3s [Forgered Audio]
  • Forgery: 1s [Segment from A] + 1s [Segment from B] β†’ 2s [Forgered Audio]
  • Forgery: 1s [Segment from A] + 1s [Segment from B] + 1s [Segment from A] β†’ 3s [Forgered Audio]
  • Forgery: 0.5s [Segment from A] + 1s [Segment from B] + 0.5s [Segment from A] β†’ 2s [Forgered Audio]

πŸ“‚ Output

For each original audio file, this tool will generate:

  • Original audio dataset
  • Copy-move forgeries dataset
  • Splicing forgeries dataset

πŸ“Œ Use Cases

  • Training deep learning models for audio forgery detection
  • Evaluating robustness of audio forensic systems
  • Dataset creation for research in speech integrity

About

Dataset Generator that uses the TIMIT dataset to generate audio with splicing and copy-move forgery used for my Bachelor's Degree Final Project.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages