An Exploration of Model-State Data in Anomaly Detection | by Sara NÃ³brega

An Exploration of Model-State Data in Anomaly Detection | by Sara NÃ³brega | Apr, 2024

The Data

MNIST Pixel Data

The first dataset employed here is the usual MNIST pixel data, comprised by hand-written numbers. Here, the background is black and the digits are white.

\"\" — Figure 1: MNIST pixel data | Image by author

Anomalous MNIST Pixel Data

To test the new procedure and compare it to the usual one, I created four simple types of anomalous data.

The goal was to test each methodâs detection capabilities across a small spectrum of noise variations, incrementally intensified from one anomaly type to the next.

The noise rate increases from the first to the fourth type of anomalous data. As you can see in the figure below, in the first and second types of data, the noise is not even detectable to the naked eye, while in the third type, you can already spot some white pixels.

\"\" — Figure 2: Four types of anomalous data | Image by author

Model-state data

While MNIST pixel data, with its hand-written digits against a stark backdrop, provides a classic foundation for anomaly detection, weâre trying something else. Itâs a bit of a leap, taking us right into the core of the trained ANN to see what the neurons are up to. This could give us a whole new angle on spotting anomalies.

As mentioned, this model state data is comprised by the state of the neurons in an ANN when trained with MNIST data. As such, to generate this data, we start with training a simple ANN with MNIST pixel data, both with normal and anomalous data (the anomalous are comprised by the noisy data showed before in Figure 2).

We then perform the usual: split the data into training and testing, and then we fit the ANN model:

model.fit(X_train,Y_cat_train,epochs=8, validation_data=(X_test, y_cat_test))

After that, we want to retrieve the names of the layers in model and store them in a list:

list(map(lambda x: x.name, model.layers))

Finally, we create a new model that takes the same input as the original model but produces output from a specific layer called âdenseâ:

intermediate_layer_model=Model(inputs=model.input, outputs=model.get_layer(\”dense\”).output)

This is useful for extracting information from intermediate layers of a neural network.

Letâs take a look at this model-state data:

model_state_data_layer1=pd.read_csv(\”first_layer_dense.csv\”,header=None)model_state_data_layer2=pd.read_csv(\”second_layer_dense.csv\”,header=None)

model_state_data_layer1.head(4)

\"\" — Figure 3: Snapshot of Model-state data (First layer). | Image by author

The model-state data of the first neural layer is comprised by 32 columns and 4 rows.

With just a few lines of code, we are able to extract data from the intermediate layers of a neural network.

To study the effectiveness of the new method, Iâll be using data from both the first and second layers of the neural network.

Source link

An Exploration of Model-State Data in Anomaly Detection | by Sara NÃ³brega | Apr, 2024

Oil prices fall slightly after Israel fends off large-scale aerial attack by Iran

Best Career Options and courses after 12th Commerce in 2024

Related Posts

How insurance companies can use synthetic data to fight bias

From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset

How Game Theory Can Make AI More Reliable

Decoding Decoder-Only Transformers: Insights from Google DeepMind’s Paper

Buffer of Thoughts (BoT): A Novel Thought-Augmented Reasoning AI Approach for Enhancing Accuracy, Efficiency, and Robustness of LLMs

Deciphering Doubt: Navigating Uncertainty in LLM Responses

Best Career Options and courses after 12th Commerce in 2024

Aleph Alpha, Mistral AI and AI21 are some of the challengers to the U.S.'s generative AI clout

Decoding the Differences: Data Science vs Machine Learning

Leave a Reply Cancel reply

Amazon’s Bedrock and Titan Generative AI Services Enter General Availability

Fireworks AI Open Sources FireLLaVA: A Commercially-Usable Version of the LLaVA Model Leveraging Only OSS Models for Data Generation and Training

9 Best Open Source Text-to-Speech (TTS) Engines

Creating contrast themes with CSS prefers-contrast and JavaScript

Creating Fluid Typography with the CSS clamp() Function — SitePoint

Link between adversity, psychiatric and cognitive decline

Can You Guess What Percentage Of Their Wealth The Rich Keep In Cash?

AI Compared: Which Assistant Is the Best?

How insurance companies can use synthetic data to fight bias

5 SLA metrics you should be monitoring

From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset

UGRO Capital: Targeting to hit milestone of Rs 20,000 cr loan book in 8-10 quarters: Shachindra Nath

CATEGORIES

SITEMAP

Welcome Back!

Create New Account!

Retrieve your password