dataset
andmestik (2)
olemus
ISO/IEC 22989:
ühise vorminguga andmete kogum; näiteks
(i) trellsiltidega #ragbi ja #jalgpall märgistatud ikroblogipostitused juunist 2020
(ii) lillede makrofotod formaadis 256x256 pikslit
Märkus: andmestikke saab kasutada intellektitehnilise mudeli valideerimiseks või testimiseks, masinõppe kontekstis aga ka masinõppe algoritmi treenimiseks.
= collection of data with a shared format
EXAMPLE 1: Micro-blogging posts from June 2020 associated with hashtags #rugby and #football.
EXAMPLE 2: Macro photographs of flowers in 256x256 pixels.
Note. Datasets can be used for validating or testing an AI model. In a machine learning (3.3.5) context, datasets can also be used to train a machine learning algorithm
ülevaateid
https://www.sprinkledata.com/blogs/what-is-a-dataset-types-examples-and-the-techniques-involved
https://unacademy.com/content/csir-ugc/study-material/mathematical-sciences/a-short-note-on-types-of-datasets/
https://www.nimbleway.com/blog/what-are-datasets
näiteid
https://mavenanalytics.io/data-playground
https://www.dataquest.io/blog/free-datasets-for-projects/
https://monolixsuite.slp-software.com/monolix/2024R1/data-set-examples