Datasets
edgemark.models.datasets.data_template
This module contains a template class that other datasets should inherit from.
DatasetSupervisorTemplate
This class is a template for datasets. In order to create a new dataset, you should inherit from this class and implement its abstract functions.
Source code in edgemark/models/datasets/data_template.py
6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 | |
__init__
__init__(**kwargs)
Initializes the class by setting the following attributes
self.train_x (numpy.ndarray): The training data. self.train_y (numpy.ndarray): The training labels. self.test_x (numpy.ndarray): The test data. self.test_y (numpy.ndarray): The test labels.
For FC and CNN models, the following attributes should also be set: self.feature_shape (tuple): The shape of the features. self.num_labels (int): The number of labels. self.output_activation (str): The name of the activation function of the output layer. self.loss_function (str | tf.keras.losses.Loss): The name of the loss function or the loss function itself. self.metrics (list): The list of metrics.
For RNN models, the following attributes should also be set: self.input_size (int): The size of each element in the sequence. self.output_size (int): The size of the output. self.sequence_length (int): The length of the sequences. self.sequential_output (bool): If True, the output is sequential. self.output_activation (str): The name of the activation function of the output layer. self.loss_function (str | tf.keras.losses.Loss): The name of the loss function or the loss function itself. self.metrics (list): The list of metrics. self.char2index (dict, optional): The dictionary that maps characters to indices. Can be used for text generation. self.index2char (list, optional): The list that maps indices to characters. Can be used for text generation.
Source code in edgemark/models/datasets/data_template.py
12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 | |
load_dataset
load_dataset(**kwargs)
Loads the dataset and returns it in the format ((trainX, trainY), (testX, testY)). The data should be in numpy float32 and shuffled.
Returns:
| Type | Description |
|---|---|
tuple
|
((trainX, trainY), (testX, testY)) |
Source code in edgemark/models/datasets/data_template.py
60 61 62 63 64 65 66 67 68 | |