Embedding-Security

Embedding Inversion Simulation This script demonstrates how a vector embedding, which may seem anonymous, can be reverse-engineered to reconstruct the original sensitive data it represents. It uses a real sentence-transformer model to generate embeddings and a text generation model (GPT-2) to simulate the reconstruction attack.

Requirements Python 3.7+

pip (Python package installer)

Installation Clone or download the repository/script.

Navigate to the script's directory in your terminal.

Install the required Python libraries using the provided requirements.txt file. Run the following command:

pip install -r requirements.txt

This will install all necessary packages, including numpy, torch, sentence-transformers, transformers, and scipy.

Running the Simulation Once the installation is complete, you can run the simulation script directly from your terminal:

python3 simulation_embedding.py

The script will then execute the simulation, printing the original secret, the generated "anonymous" vector, the discovered semantic keywords, and the final reconstructed text to your console.

###########################################################

Second script demonstrate Data Poisoning the AI's knowledge base to skew its output or inject bias. Uploading malicious documents or compromising data feeds before they are vectorized.

python3 rag_poisoning.py

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
resumes_kb		resumes_kb
README.md		README.md
rag_hr_poisned.py		rag_hr_poisned.py
requirements.txt		requirements.txt
simulation_embedding.py		simulation_embedding.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Embedding-Security

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

PureStorage-OpenConnect/Embedding-Security

Folders and files

Latest commit

History

Repository files navigation

Embedding-Security

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages