→ Data Science → Start Learning / Download MP3 Flashcards

Question		Answer
siatka algorytmu k sąsiadów start learning		k neighbors mesh
węzeł jest czysty, gdy wszystkie próbki w węźle należą do tej samej klasy start learning		the node is clean when all samples in the node belong to the same class
Pi k - współczynnik występowania klas k wśród próbek uczących w i-tym węźle start learning		Pi k - coefficient of occurrence of k classes among training samples in the i-th node
nadmierne dopasowanie start learning		overfitting
ostre granice modelu drzew decyzyjncyh, lasy losowe wygładzają te rożnice start learning		sharp boundaries of the decision tree model, random forests smooth out these corners
Wzajemna informacja (MI) między dwiema zmiennymi losowymi jest wartością nieujemną, która mierzy zależność między zmiennymi. Jest równy zero wtedy i tylko wtedy, gdy dwie zmienne losowe są niezależne, a wyższe wartości oznaczają większą zależność. start learning		Mutual information (MI) between two random variables is a non-negative value, which measures the dependency between the variables. It is equal to zero if and only if two random variables are independent, and higher values mean higher dependency.
maszyna wektorów nośnych (SVM) start learning		support vector machine (SVM)
macierz konfuzji start learning		confusion matrix
precyzja – odpowiada na pytanie, jaka proporcja pozytywnych identyfikacji była rzeczywiście poprawna. start learning		precision - answers the question what proportion of positive identifications was actually correct.
Przywołanie - odpowiedz na pytanie, jaki odsetek rzeczywistych pozytywów został zidentyfikowany poprawnie. start learning		Recall - answer the question what proportion of actual positives was identified correctly.
FPR, FNR start learning		FPR, FNR False Positive Rate (FPR) - type I error. False Negative Rate (FNR) - type II error
f1 score - srednia harmoniczna precyzji i recall start learning		f1 score precision and recall harmonic mean
niedopasowanie modelu start learning		underfitting
danych nie da się oddzielić liniowo start learning		the data is not linearly separable
parkiet start learning		parquet type of data
no sql - nie relacyjne start learning		no sql - non relational the characteristic we can expect from a no sql database are that it is a non relational database as opposed to relational one
no sql rozpowszechnianie start learning		no sql distributed distributed database that is designed to manage large scale data while maintaing a high performance
no sql - skalowalność start learning		no sql scalability scalability throughput and availability skalowalność przepustowości i dostepność
przekleństwo wymiarowości start learning		curse of dimensionality

data science

You must sign in to write a comment

More