{ "cells": [ { "cell_type": "code", "execution_count": 1, "metadata": {}, "outputs": [], "source": [ "import matplotlib.pylab as plt\n", "import numpy as np\n", "\n", "from collections import Counter" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "# Datasets\n", "\n", "Scikit-lego includes several datasets which can be used for testing purposes. Each dataset has different options for returning the data:\n", "\n", "- When setting ``as_frame`` to True, the data, including the target, is returned as a dataframe.\n", "- When setting ``return_X_y`` to True, the data is returned directly as ``(data, target)`` instead of a dict object. \n", "\n", "This notebook describes the different sets included in Scikit-lego:\n", "\n", "- `sklego.datasets.load_abalone` loads in the abalone dataset\n", "- `sklego.datasets.load_arrests` loads in a dataset with fairness concerns\n", "- `sklego.datasets.load_chicken` loads in the joyful chickweight dataset\n", "- `sklego.datasets.load_heroes` loads a heroes of the storm dataset\n", "- `sklego.datasets.load_hearts` loads a dataset about hearts\n", "- `sklego.datasets.load_penguins` loads a lovely dataset about penguins\n", "- `sklego.datasets.fetch_creditcard` fetch a fraud dataset from openml\n", "- `sklego.datasets.make_simpleseries` make a simulated timeseries" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Abalone\n", "\n", "Loads the abalone dataset where the goal is to predict the gender of the creature." ] }, { "cell_type": "code", "execution_count": 2, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
sexlengthdiameterheightwhole_weightshucked_weightviscera_weightshell_weightrings
0M0.4550.3650.0950.51400.22450.10100.15015
1M0.3500.2650.0900.22550.09950.04850.0707
2F0.5300.4200.1350.67700.25650.14150.2109
3M0.4400.3650.1250.51600.21550.11400.15510
4I0.3300.2550.0800.20500.08950.03950.0557
\n", "
" ], "text/plain": [ " sex length diameter height whole_weight shucked_weight viscera_weight \\\n", "0 M 0.455 0.365 0.095 0.5140 0.2245 0.1010 \n", "1 M 0.350 0.265 0.090 0.2255 0.0995 0.0485 \n", "2 F 0.530 0.420 0.135 0.6770 0.2565 0.1415 \n", "3 M 0.440 0.365 0.125 0.5160 0.2155 0.1140 \n", "4 I 0.330 0.255 0.080 0.2050 0.0895 0.0395 \n", "\n", " shell_weight rings \n", "0 0.150 15 \n", "1 0.070 7 \n", "2 0.210 9 \n", "3 0.155 10 \n", "4 0.055 7 " ] }, "execution_count": 2, "metadata": {}, "output_type": "execute_result" } ], "source": [ "from sklego.datasets import load_abalone\n", "\n", "df_abalone = load_abalone(as_frame=True)\n", "df_abalone.head()" ] }, { "cell_type": "code", "execution_count": 3, "metadata": {}, "outputs": [ { "data": { "image/png": "iVBORw0KGgoAAAANSUhEUgAAAX0AAAEICAYAAACzliQjAAAABHNCSVQICAgIfAhkiAAAAAlwSFlzAAALEgAACxIB0t1+/AAAADh0RVh0U29mdHdhcmUAbWF0cGxvdGxpYiB2ZXJzaW9uMy4yLjIsIGh0dHA6Ly9tYXRwbG90bGliLm9yZy+WH4yJAAAXQ0lEQVR4nO3de7hddX3n8fenRFCLJVyOjCSBUI1Wht6YFOk4tbRYAVHDH9bCqESLT9oRtRY7Ct5ra4u1lUpH7UTJAOqAlOoQFauIOtR55BKsIhfRU25J5HK4ykVF9Dt/rF9geziXnLNPzgms9+t59nPW+q3fWuu7987z2Su/tdbeqSokSf3wcwtdgCRp/hj6ktQjhr4k9YihL0k9YuhLUo8Y+pLUI4a+ppXkH5O8bY62tXeSe5Ps0Oa/kuRVc7Httr3PJVk9V9ubwX7/MsltSW6e730P1PBHSf5+ofY/nSQvTPKJha6j7+J1+v2W5HpgT+BB4CfAVcAZwNqq+ukstvWqqvriDNb5CvCxqvrITPbV1n0n8LSqetlM151LSfYGrgH2qapbF6iGHYF/Bw6qqs1JlgPXAY+rqgcXoJ4J95/kCuC/VtXl812TOh7pC+CFVfUkYB/gJOBNwKlzvZMki+Z6m9uJvYHbFyrwm1XAt6tq81xsbBu+V2cCa7bRtrUVDH09pKrurqr1wB8Aq5PsD5DktCR/2ab3SPKZJHcluSPJvyb5uSQfpQu/T7fhmzcmWZ6kkhyb5EbgSwNtg6Hy1CSXJPl+knOT7Nb2dXCSTYM1Jrk+yXOTHAa8GfiDtr9vtuUPDRe1ut6a5IYktyY5I8kubdmWOlYnubENzbxlstcmyS5t/bG2vbe27T8XOB/Yq9Vx2gTrTviatWV7Jfnntt3rkryute+WZFOSF7b5nZOMJjlmkhIPB/7vwPyF7e9dra7fTPLUJF9Kcnt7vh9Psnjca/umJJcD9yVZlOSY9nxvT/K2La//wOt7QpJ/b8vP3vLeTbT/Nv8V4IjJXmdte4a+HqGqLgE2Ab81weI3tGUjdMNCb+5WqZcDN9L9r2HnqvqbgXV+G3gmcOgkuzwG+EPgKXTDTKdsRY3/AvwV8Im2v1+doNsr2uN3gF8Edgb+x7g+/wV4BnAI8PYkz5xkl/8A7NK289ut5le2oazDge+1Ol4xwboTvmYt+D8NfBNY0mp4fZJDq+oOutfkw0meDJwMfKOqzpikvl+mG2La4jnt7+JW19eAAH8N7EX3fiwD3jluO0fThfJi4OnAB4GX0r03u7Q6t3gtcGR7PfYC7gQ+MMX+Aa4Glif5hUmeh7YxQ1+T+R6w2wTtP6YLgH2q6sdV9a81/Ymhd1bVfVX1g0mWf7Sqrqiq+4C3AS/ZcqJ3SC8F3ldV11bVvcCJwFHj/pfx51X1g6r6Jl34PuLDo9VyFHBiVd1TVdcDfwe8fCvrmOw1+w1gpKreVVUPVNW1wIfbvqiqLwD/BFwAPB/4oyn2sRi4Z6oiqmq0qs6vqh9V1RjwPrrAHnRKVW1s79WLgU9X1Ver6gHg7cDge/3HwFuqalNV/YjuA+TF0wwNbalx8RR9tA0Z+prMEuCOCdrfC4wCX0hybZITtmJbG2ew/AbgccAeW1Xl1PZq2xvc9iK6o+0tBq+2uZ/ufwPj7dFqGr+tJRP0nchkr9k+dMNCd2150P0vYLC+tcD+wGlVdfsU+7gTeNJURSTZM8lZSTYn+T7wMR75Og++F3sNzlfV/cBgDfsAnxqo/Wq6iwEG6x9vS413TVWrth1DX4+Q5DfoAu2r45e1I903VNUvAi8Cjk9yyJbFk2xyuv8JLBuY3pvuyPg24D7giQN17UA3RLK12/0eXTANbvtB4JZp1hvvtlbT+G1t1UnTKV6zjcB1VbV44PGkqno+PPR819JdTfXqJE+bYjeX0w3HPLTbCfr8VWv/5ar6BeBldEM+P1PuwPRNwNItM0meAOw+sHwjcPi4+h/fTiZP9t48E7i+qr4/xXPRNmTo6yFJfiHJC4Cz6C6j/NYEfV6Q5GlJAtxNd2S35dLOW+jGvGfqZUn2S/JE4F3AOVX1E+A7wOOTHJHkccBbgZ0G1ruFbnx4sn/HZwJ/mmTfJDvz8DmAGV3C2Go5G3h3kicl2Qc4nu5IeVpTvGaXAPe0k6dPSLJDkv3bhy60sX+6sf33AmdMMex1Hj87VDPW9jH4fjwJuBe4O8kS4L9PU/o5wAuT/Od0l4S+k5/9kPhHutdkn/Y8R5KsmmL/tBo/N81+tQ0Z+oLuipt76I7c3kI31vvKSfquAL5IFx5fAz5YVV9uy/4aeGv77/6fzWD/HwVOoxtqeTzwOuiuJgJeDXyE7qj6ProTolv8U/t7e5KvT7DddW3bF9JdM/5DupOPs/Hatv9r6f4H9L/b9rfGhK9Z+zB5AfBrrb7b6J7rLkn+E90HyzGt33voPgAmG077NPBLSfaCh4Zi3g38v/Z+HAT8OXAA3QfPZ4FPTlV0VV3ZnvdZdEf99wK3Aj9qXd4PrKcbtroHuAh41hT7h+5E8f+c9hXTNuPNWdJjRJI1wH5V9fpttP2d6cbiV1TVdbNY/4XAy6vqJXNenLaaoS9pUi2oL6Ab1vk7uiP5A7biii1tpxzekTSVVXQnxL9HN0x1lIH/6DZt6CdZl+5uxivGtb82ybeTXJnkbwbaT2x3Dl6T5NCB9sNa2+hWXuYnaYFV1avaVTm7VNUhVXXN9Gtpezbt8E6S59CdwDmjqrbclv87dCf8jqiqHyV5clXdmmQ/uismDqS7xveLPHwZ2XeA36M7EXcpcHRVXbUNnpMkaRLTfqlSVV2Y7hvzBv034KR2Fx4DXzS1CjirtV+XZJTuAwBgtN1xSJKzWt8pQ3+PPfao5cvH71qSNJXLLrvstqoamWjZbL9J7+nAbyV5N91lcH9WVZfS3dBz0UC/TTx81+LGce3PmmjD7QqENQB77703GzZsmGWJktRPSW6YbNlsT+QuovteloPobvA4u914MrSqWltVK6tq5cjIhB9UkqRZmu2R/ibgk+0s/iVJfkr3HR6b+dlb6pfy8K3qk7VLkubJbI/0/w/d19WS5OnAjnR3E66n+xbDnZLsS3eJ1yV0J25XtNvhd6T7FsH1wxYvSZqZaY/0k5wJHAzske4HLd5Bd/v5unYZ5wPA6nbUf2WSs+lO0D4IHNduISfJa4DPAzsA69ot3pKkebRd35G7cuXK8kSuJM1MksuqauVEy7wjV5J6xNCXpB4x9CWpRwx9SeqR2V6n/6iw/ITPLnQJj1nXn3TEQpcgaRY80pekHjH0JalHDH1J6hFDX5J6xNCXpB4x9CWpRwx9SeoRQ1+SesTQl6QeMfQlqUcMfUnqEUNfknrE0JekHpk29JOsS3Jr+z3c8cvekKSS7NHmk+SUJKNJLk9ywEDf1Um+2x6r5/ZpSJK2xtYc6Z8GHDa+Mcky4HnAjQPNhwMr2mMN8KHWdze6H1R/FnAg8I4kuw5TuCRp5qYN/aq6ELhjgkUnA28EBn9ZfRVwRnUuAhYneQpwKHB+Vd1RVXcC5zPBB4kkadua1Zh+klXA5qr65rhFS4CNA/ObWttk7RNte02SDUk2jI2NzaY8SdIkZhz6SZ4IvBl4+9yXA1W1tqpWVtXKkZGRbbELSeqt2RzpPxXYF/hmkuuBpcDXk/wHYDOwbKDv0tY2WbskaR7NOPSr6ltV9eSqWl5Vy+mGag6oqpuB9cAx7Sqeg4C7q+om4PPA85Ls2k7gPq+1SZLm0dZcsnkm8DXgGUk2JTl2iu7nAdcCo8CHgVcDVNUdwF8Al7bHu1qbJGkeLZquQ1UdPc3y5QPTBRw3Sb91wLoZ1idJmkPekStJPWLoS1KPTDu8I0mTWX7CZxe6hMes6086Ypts1yN9SeoRQ1+SesTQl6QeMfQlqUc8kavthicFt51tdVJQjz4e6UtSjxj6ktQjhr4k9YihL0k9YuhLUo8Y+pLUI4a+JPWIoS9JPWLoS1KPGPqS1CNb8xu565LcmuSKgbb3Jvl2ksuTfCrJ4oFlJyYZTXJNkkMH2g9rbaNJTpj7pyJJms7WHOmfBhw2ru18YP+q+hXgO8CJAEn2A44C/mNb54NJdkiyA/AB4HBgP+Do1leSNI+mDf2quhC4Y1zbF6rqwTZ7EbC0Ta8CzqqqH1XVdcAocGB7jFbVtVX1AHBW6ytJmkdzMab/h8Dn2vQSYOPAsk2tbbL2R0iyJsmGJBvGxsbmoDxJ0hZDhX6StwAPAh+fm3KgqtZW1cqqWjkyMjJXm5UkMcT36Sd5BfAC4JCqqta8GVg20G1pa2OKdknSPJnVkX6Sw4A3Ai+qqvsHFq0HjkqyU5J9gRXAJcClwIok+ybZke5k7/rhSpckzdS0R/pJzgQOBvZIsgl4B93VOjsB5ycBuKiq/riqrkxyNnAV3bDPcVX1k7ad1wCfB3YA1lXVldvg+UiSpjBt6FfV0RM0nzpF/3cD756g/TzgvBlVJ0maU96RK0k9YuhLUo8Y+pLUI4a+JPWIoS9JPWLoS1KPGPqS1COGviT1iKEvST1i6EtSjxj6ktQjhr4k9YihL0k9YuhLUo8Y+pLUI4a+JPWIoS9JPWLoS1KPTBv6SdYluTXJFQNtuyU5P8l3299dW3uSnJJkNMnlSQ4YWGd16//dJKu3zdORJE1la470TwMOG9d2AnBBVa0ALmjzAIcDK9pjDfAh6D4k6H5Q/VnAgcA7tnxQSJLmz7ShX1UXAneMa14FnN6mTweOHGg/ozoXAYuTPAU4FDi/qu6oqjuB83nkB4kkaRub7Zj+nlV1U5u+GdizTS8BNg7029TaJmt/hCRrkmxIsmFsbGyW5UmSJjL0idyqKqDmoJYt21tbVSurauXIyMhcbVaSxOxD/5Y2bEP7e2tr3wwsG+i3tLVN1i5JmkezDf31wJYrcFYD5w60H9Ou4jkIuLsNA30eeF6SXdsJ3Oe1NknSPFo0XYckZwIHA3sk2UR3Fc5JwNlJjgVuAF7Sup8HPB8YBe4HXglQVXck+Qvg0tbvXVU1/uSwJGkbmzb0q+roSRYdMkHfAo6bZDvrgHUzqk6SNKe8I1eSesTQl6QeMfQlqUcMfUnqEUNfknrE0JekHjH0JalHDH1J6hFDX5J6xNCXpB4x9CWpRwx9SeoRQ1+SesTQl6QeMfQlqUcMfUnqEUNfknrE0JekHhkq9JP8aZIrk1yR5Mwkj0+yb5KLk4wm+USSHVvfndr8aFu+fC6egCRp68069JMsAV4HrKyq/YEdgKOA9wAnV9XTgDuBY9sqxwJ3tvaTWz9J0jwadnhnEfCEJIuAJwI3Ab8LnNOWnw4c2aZXtXna8kOSZMj9S5JmYNahX1Wbgb8FbqQL+7uBy4C7qurB1m0TsKRNLwE2tnUfbP13H7/dJGuSbEiyYWxsbLblSZImMMzwzq50R+/7AnsBPw8cNmxBVbW2qlZW1cqRkZFhNydJGjDM8M5zgeuqaqyqfgx8Eng2sLgN9wAsBTa36c3AMoC2fBfg9iH2L0maoWFC/0bgoCRPbGPzhwBXAV8GXtz6rAbObdPr2zxt+ZeqqobYvyRphoYZ07+Y7oTs14FvtW2tBd4EHJ9klG7M/tS2yqnA7q39eOCEIeqWJM3Coum7TK6q3gG8Y1zztcCBE/T9IfD7w+xPkjQc78iVpB4x9CWpRwx9SeoRQ1+SesTQl6QeMfQlqUcMfUnqEUNfknrE0JekHjH0JalHDH1J6hFDX5J6xNCXpB4x9CWpRwx9SeoRQ1+SesTQl6QeMfQlqUeGCv0ki5Ock+TbSa5O8ptJdktyfpLvtr+7tr5JckqS0SSXJzlgbp6CJGlrDXuk/37gX6rql4BfBa6m+8HzC6pqBXABD/8A+uHAivZYA3xoyH1LkmZo1qGfZBfgOcCpAFX1QFXdBawCTm/dTgeObNOrgDOqcxGwOMlTZl25JGnGhjnS3xcYA/5Xkn9L8pEkPw/sWVU3tT43A3u26SXAxoH1N7W2n5FkTZINSTaMjY0NUZ4kabxhQn8RcADwoar6deA+Hh7KAaCqCqiZbLSq1lbVyqpaOTIyMkR5kqTxhgn9TcCmqrq4zZ9D9yFwy5Zhm/b31rZ8M7BsYP2lrU2SNE9mHfpVdTOwMckzWtMhwFXAemB1a1sNnNum1wPHtKt4DgLuHhgGkiTNg0VDrv9a4ONJdgSuBV5J90FydpJjgRuAl7S+5wHPB0aB+1tfSdI8Gir0q+obwMoJFh0yQd8Cjhtmf5Kk4XhHriT1iKEvST1i6EtSjxj6ktQjhr4k9YihL0k9YuhLUo8Y+pLUI4a+JPWIoS9JPWLoS1KPGPqS1COGviT1iKEvST1i6EtSjxj6ktQjhr4k9YihL0k9MnToJ9khyb8l+Uyb3zfJxUlGk3yi/X4uSXZq86Nt+fJh9y1Jmpm5ONL/E+Dqgfn3ACdX1dOAO4FjW/uxwJ2t/eTWT5I0j4YK/SRLgSOAj7T5AL8LnNO6nA4c2aZXtXna8kNaf0nSPBn2SP/vgTcCP23zuwN3VdWDbX4TsKRNLwE2ArTld7f+kqR5MuvQT/IC4NaqumwO6yHJmiQbkmwYGxuby01LUu8Nc6T/bOBFSa4HzqIb1nk/sDjJotZnKbC5TW8GlgG05bsAt4/faFWtraqVVbVyZGRkiPIkSePNOvSr6sSqWlpVy4GjgC9V1UuBLwMvbt1WA+e26fVtnrb8S1VVs92/JGnmtsV1+m8Cjk8ySjdmf2prPxXYvbUfD5ywDfYtSZrCoum7TK+qvgJ8pU1fCxw4QZ8fAr8/F/uTJM2Od+RKUo8Y+pLUI4a+JPWIoS9JPWLoS1KPGPqS1COGviT1iKEvST1i6EtSjxj6ktQjhr4k9YihL0k9YuhLUo8Y+pLUI4a+JPWIoS9JPWLoS1KPGPqS1COGviT1yKxDP8myJF9OclWSK5P8SWvfLcn5Sb7b/u7a2pPklCSjSS5PcsBcPQlJ0tYZ5kj/QeANVbUfcBBwXJL9gBOAC6pqBXBBmwc4HFjRHmuADw2xb0nSLMw69Kvqpqr6epu+B7gaWAKsAk5v3U4HjmzTq4AzqnMRsDjJU2ZduSRpxuZkTD/JcuDXgYuBPavqprboZmDPNr0E2Diw2qbWNn5ba5JsSLJhbGxsLsqTJDVDh36SnYF/Bl5fVd8fXFZVBdRMtldVa6tqZVWtHBkZGbY8SdKAoUI/yePoAv/jVfXJ1nzLlmGb9vfW1r4ZWDaw+tLWJkmaJ8NcvRPgVODqqnrfwKL1wOo2vRo4d6D9mHYVz0HA3QPDQJKkebBoiHWfDbwc+FaSb7S2NwMnAWcnORa4AXhJW3Ye8HxgFLgfeOUQ+5YkzcKsQ7+qvgpkksWHTNC/gONmuz9J0vC8I1eSesTQl6QeMfQlqUcMfUnqEUNfknrE0JekHjH0JalHDH1J6hFDX5J6xNCXpB4x9CWpRwx9SeoRQ1+SesTQl6QeMfQlqUcMfUnqEUNfknrE0JekHpn30E9yWJJrkowmOWG+9y9JfTavoZ9kB+ADwOHAfsDRSfabzxokqc/m+0j/QGC0qq6tqgeAs4BV81yDJPXWonne3xJg48D8JuBZgx2SrAHWtNl7k1wzT7UttD2A2xa6iK2V9yx0BduFR8175vv1kL68Z/tMtmC+Q39aVbUWWLvQdcy3JBuqauVC16Gt53v26ON7Nv/DO5uBZQPzS1ubJGkezHfoXwqsSLJvkh2Bo4D181yDJPXWvA7vVNWDSV4DfB7YAVhXVVfOZw3bsd4NaT0G+J49+vT+PUtVLXQNkqR54h25ktQjhr4k9Yihv4CSVJKPDcwvSjKW5DMLWZemluQnSb4x8Fi+0DVpeknuXegatgfb3XX6PXMfsH+SJ1TVD4Dfw0tYHw1+UFW/ttBFSLPhkf7COw84ok0fDZy5gLVIeowz9BfeWcBRSR4P/Apw8QLXo+k9YWBo51MLXYw0Ew7vLLCquryNCR9Nd9Sv7Z/DO3rUMvS3D+uBvwUOBnZf2FIkPZYZ+tuHdcBdVfWtJAcvdDGSHrsM/e1AVW0CTlnoOiQ99vk1DJLUI169I0k9YuhLUo8Y+pLUI4a+JPWIoS9JPWLoS1KPGPqS1CP/H4Gq92f8u2YZAAAAAElFTkSuQmCC\n", "text/plain": [ "
" ] }, "metadata": { "needs_background": "light" }, "output_type": "display_data" } ], "source": [ "X, y = load_abalone(return_X_y=True)\n", "\n", "plt.bar(Counter(y).keys(), Counter(y).values())\n", "plt.title('Distribution of sex (target)');" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Arrests\n", "\n", "Loads the arrests dataset which can serve as a benchmark for fairness. It is data on\n", "the police treatment of individuals arrested in Toronto for simple possession of small\n", "quantities of marijuana. The goal is to predict whether or not the arrestee was released\n", "with a summons while maintaining a degree of fairness." ] }, { "cell_type": "code", "execution_count": 4, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
releasedcolouryearagesexemployedcitizenchecks
0YesWhite200221MaleYesYes3
1NoBlack199917MaleYesYes3
2YesWhite200024MaleYesYes3
3NoBlack200046MaleYesYes1
4YesBlack199927FemaleYesYes1
\n", "
" ], "text/plain": [ " released colour year age sex employed citizen checks\n", "0 Yes White 2002 21 Male Yes Yes 3\n", "1 No Black 1999 17 Male Yes Yes 3\n", "2 Yes White 2000 24 Male Yes Yes 3\n", "3 No Black 2000 46 Male Yes Yes 1\n", "4 Yes Black 1999 27 Female Yes Yes 1" ] }, "execution_count": 4, "metadata": {}, "output_type": "execute_result" } ], "source": [ "from sklego.datasets import load_arrests\n", "\n", "df_arrests = load_arrests(as_frame=True)\n", "df_arrests.head()" ] }, { "cell_type": "code", "execution_count": 5, "metadata": {}, "outputs": [ { "data": { "image/png": "iVBORw0KGgoAAAANSUhEUgAAAX0AAAEICAYAAACzliQjAAAABHNCSVQICAgIfAhkiAAAAAlwSFlzAAALEgAACxIB0t1+/AAAADh0RVh0U29mdHdhcmUAbWF0cGxvdGxpYiB2ZXJzaW9uMy4yLjIsIGh0dHA6Ly9tYXRwbG90bGliLm9yZy+WH4yJAAAU5ElEQVR4nO3de5RlZX3m8e9DN4gKcpEOgebSEBjHJl5wEdCJEwk4XLzBygTFZaCDZNAVwpiEJEBwhCAYzCyvM/HCCgQUFRnGDHiJhHCJOgaxFSUCi6FFke4gNHSDoIiCv/ljv2WORVVXFVRXQb3fz1pn1d7v3vvdv33Oqefs8+5d3akqJEl92GS+C5AkzR1DX5I6YuhLUkcMfUnqiKEvSR0x9CWpI4Z+B5J8KMl/m6W+dknyYJJFbf6aJL83G323/v4+yYrZ6m8G+z0zyT1Jvj8LfS1LUkkWz0ZtsyHJd5O8fAPLD07yf+aypplI8vwkX57vOhYCQ/8prv0yP5TkgST3Jflykjcn+flrW1Vvrqq3T7OvSYOh9fW9qtqiqh6dhdpPT3LhuP4PraoLnmjfM6xjF+BEYHlV/fJc7vtJ5Czg7LGZ9qG1x3wVM37/VXUDcF+SV89XTQuFob8wvLqqtgR2ZfjFPQk4d7Z38mQ6c51luwD3VtXd01l5oT0PSX4N2Kqqrp2l/jJ60jGLPga8aSP02xVDfwGpqvur6jLgdcCKJL8KkOT8JGe26e2SfKZ9K1iX5ItJNknyUYbw+3QbvvmzkWGKY5N8D7hqkqGLX0lyXZIfJLk0ybZtX/snWT1a49i3iSSHAH8OvK7t75tt+c+Hi1pdb01ye5K7k3wkyVZt2VgdK5J8rw3NnDrZc5Nkq7b92tbfW1v/LweuAHZsdZw/wbb7J1md5KQ2/PO3bduTk3w7yb1JLh477kn2fW6SO5OsaUNJY8Njv5LkqtbHPUk+lmTrkW1Pats8kOSWJAeOPDeT7j/JUe04793Q89IcCvzTyLZfaJPfbM/J65Js0943a5Osb9M7jWxzTZKzkvxf4EfA7kkOajXfn+QDSf4pI0OBSd6Y5ObW3+VJdp1s/23+GuDAJE+b4ni0AYb+AlRV1wGrgf84weIT27IlwPYMwVtVdRTwPYZvDVtU1V+NbPMy4LnAwZPs8mjgjcAOwCPA+6dR4+eBdwCfbPt7wQSr/W57/CawO7AF8D/HrfNS4DnAgcDbkjx3kl3+D2Cr1s/LWs3HVNU/MoTev7Y6fneS7X8Z2Jbh29RxwAnA4a2vHYH1wF9Psu35DM/LHsDewEHAWPgF+MvWx3OBnYHTAZI8B/gD4NfaN7mDge+27Sbdf5LlwAeBo9qyZwM/D+gJPA+4ZWymqn6jTb6gPSefZMiKv23HvwvwEI99LY5qz82WwP3AJcApbf+3AP9hbMUkhzG8936L4b34ReATG9g/VbUG+CnD663Hq6p8PIUfDCHw8gnarwVObdPnA2e26TOAS4E9puoLWAYUsPsEbYvb/DXA2SPLlwM/ARYB+wOrJ9sHQ7hdOG75NcDvtekrgd8fWfYchl/6xSN17DSy/DrgyAmOa1GraflI25uAa9r0Y+oct/3+bfvNR9puBg4cmd9hgtoWM3ywPgw8fWTd1wNXT7Kvw4Hr2/QewN3Ay4FNx623of2/DbhoZNkzW/2PeZ+05VcAbx7XVhO9R0aWvxBYP+51O2Nk/mjgn0fmA9wx8tr+PXDsyPJNGL4h7Lqh/QNrgN+Y79+7p/LDM/2FaymwboL2/w6sAv4hyW1JTp5GX3fMYPntwKbAdtOqcsN2bP2N9j0WpGNG77b5EcO3gfG2azWN72vpDGpZW1U/HpnfFfi7Nkx2H0MIPzqutrH1NgXuHFn3w8AvASTZPslFbQjnB8CFrV6qahXwhwwfjne39Xacxv53ZOQ1qaofAvdu4NjWM5ydTyrJM5J8uA0Z/QD4ArD12DBVM/o+GF9DMXzDHH1e3jdS/zqGD4apXpMtgfumWEcbYOgvQBkuzC0FvjR+WVU9UFUnVtXuwGuAPx4bJ2Y4u5rIVP8U684j07swnHHeA/wQeMZIXYsYvspPt99/ZQiH0b4fAe6aYrvx7mk1je9rzQz6GF/rHcChVbX1yGPzGoYgxq/3MLDdyHrPqqq92vJ3tL6fV1XPAn6HIfyGnVZ9vKpe2mov4J3T2P+djLwmSZ7BMMQymRuAfzfF8Z/I8E1rv1bn2BBMRtYZfY7uZGRIKUn4xSGmO4A3jav/6VU16W2ZSZYCmzEyFKWZM/QXkCTPSvIq4CKGYZN/mWCdVyXZo/0S3s9wdviztvguhjHvmfqdJMtbuJwBXFLDLZ3/D9g8ySuTbAq8FRi9CHcXsCyT3+nxCeCPkuyWZAv+7RrAIzMprtVyMXBWki3bBcM/Zjirfrw+1Pobu/i4pI1Tj9/3ncA/AO9qr88m7eLty9oqWwIPAve3UPvTsW2TPCfJAe3C5Y8ZxtHHXqsN7f8S4FVJXppkM4bXZEO/659juDYwavx7Ycu2//vaBePTNtAfwGeB5yU5PMNF/+MZrouM+RBwSpK9Wv1bJTliA/un1XhVVT08xb61AYb+wvDpJA8wnD2dCrwbOGaSdfcE/pEhaP4Z+EBVXd2W/SXw1vaV+09msP+PMlw3+D6wOfBfYbibCPh94G8Yzqp/yC9+xf9f7ee9Sb4+Qb/ntb6/AHyHIfhOmEFdo05o+7+N4RvQx1v/j9f7gMsYhskeYLiGst8k6x7NcIZ6E8NQyiUMY/AAfwG8iOED+LPAp0a2exrDLbj3MDy3v8RwYXSD+6+qGxlC9uMMZ9zr+cXn/RdU1dcZPnRG6z8duKC9F14LvBd4eqvlWuDzk/XX+rwHOAL4K4ahpeXASoZvPVTV3zF8a7moDRd9i+GC+mT7B3gDw4eFnoC0iyOSOpbkIIaL5odvpP43YfjgecPIScZMtn8+8OGqesmsF9cZQ1/SRpHkYOArDMNCf8rw7WP3qnpoXgvrnMM7kjaWlwDfZhgSejVwuIE//zzTl6SOeKYvSR15Uv/DUdttt10tW7ZsvsuQpKeUr33ta/dU1ZKJlj2pQ3/ZsmWsXLlyvsuQpKeUJLdPtszhHUnqiKEvSR0x9CWpI4a+JHXE0Jekjhj6ktQRQ1+SOmLoS1JHDH1J6siT+i9yn6hlJ392vkvQk9R3z37lfJcgzQvP9CWpI4a+JHXE0Jekjhj6ktQRQ1+SOmLoS1JHDH1J6oihL0kdMfQlqSOGviR1xNCXpI4Y+pLUkWmHfpJFSa5P8pk2v1uSryRZleSTSTZr7U9r86va8mUjfZzS2m9JcvBsH4wkacNmcqb/FuDmkfl3Au+pqj2A9cCxrf1YYH1rf09bjyTLgSOBvYBDgA8kWfTEypckzcS0Qj/JTsArgb9p8wEOAC5pq1wAHN6mD2vztOUHtvUPAy6qqoer6jvAKmDf2TgISdL0TPdM/73AnwE/a/PPBu6rqkfa/GpgaZteCtwB0Jbf39b/efsE2/xckuOSrEyycu3atTM4FEnSVKYM/SSvAu6uqq/NQT1U1TlVtU9V7bNkyZK52KUkdWM6/3PWrwOvSfIKYHPgWcD7gK2TLG5n8zsBa9r6a4CdgdVJFgNbAfeOtI8Z3UaSNAemPNOvqlOqaqeqWsZwIfaqqnoDcDXw2221FcClbfqyNk9bflVVVWs/st3dsxuwJ3DdrB2JJGlKT+T/yD0JuCjJmcD1wLmt/Vzgo0lWAesYPiioqhuTXAzcBDwCHF9Vjz6B/UuSZmhGoV9V1wDXtOnbmODum6r6MXDEJNufBZw10yIlSbPDv8iVpI4Y+pLUEUNfkjpi6EtSRwx9SeqIoS9JHTH0Jakjhr4kdcTQl6SOGPqS1BFDX5I6YuhLUkcMfUnqiKEvSR0x9CWpI4a+JHXE0Jekjhj6ktQRQ1+SOmLoS1JHDH1J6oihL0kdMfQlqSOGviR1xNCXpI4Y+pLUEUNfkjpi6EtSRwx9SeqIoS9JHTH0Jakjhr4kdcTQl6SOGPqS1BFDX5I6YuhLUkcMfUnqiKEvSR2ZMvSTbJ7kuiTfTHJjkr9o7bsl+UqSVUk+mWSz1v60Nr+qLV820tcprf2WJAdvrIOSJE1sOmf6DwMHVNULgBcChyR5MfBO4D1VtQewHji2rX8ssL61v6etR5LlwJHAXsAhwAeSLJrNg5EkbdiUoV+DB9vspu1RwAHAJa39AuDwNn1Ym6ctPzBJWvtFVfVwVX0HWAXsOytHIUmalmmN6SdZlOQbwN3AFcC3gfuq6pG2ympgaZteCtwB0JbfDzx7tH2CbUb3dVySlUlWrl27duZHJEma1LRCv6oeraoXAjsxnJ3/+41VUFWdU1X7VNU+S5Ys2Vi7kaQuzejunaq6D7gaeAmwdZLFbdFOwJo2vQbYGaAt3wq4d7R9gm0kSXNgOnfvLEmydZt+OvCfgJsZwv+322orgEvb9GVtnrb8qqqq1n5ku7tnN2BP4LrZOhBJ0tQWT70KOwAXtDttNgEurqrPJLkJuCjJmcD1wLlt/XOBjyZZBaxjuGOHqroxycXATcAjwPFV9ejsHo4kaUOmDP2qugHYe4L225jg7puq+jFwxCR9nQWcNfMyJUmzwb/IlaSOGPqS1BFDX5I6YuhLUkcMfUnqiKEvSR0x9CWpI4a+JHXE0Jekjhj6ktQRQ1+SOmLoS1JHDH1J6oihL0kdMfQlqSOGviR1xNCXpI4Y+pLUEUNfkjpi6EtSRwx9SeqIoS9JHTH0Jakjhr4kdcTQl6SOGPqS1BFDX5I6YuhLUkcMfUnqiKEvSR0x9CWpI4a+JHXE0Jekjhj6ktQRQ1+SOmLoS1JHDH1J6oihL0kdMfQlqSNThn6SnZNcneSmJDcmeUtr3zbJFUlubT+3ae1J8v4kq5LckORFI32taOvfmmTFxjssSdJEpnOm/whwYlUtB14MHJ9kOXAycGVV7Qlc2eYBDgX2bI/jgA/C8CEBnAbsB+wLnDb2QSFJmhtThn5V3VlVX2/TDwA3A0uBw4AL2moXAIe36cOAj9TgWmDrJDsABwNXVNW6qloPXAEcMqtHI0naoBmN6SdZBuwNfAXYvqrubIu+D2zfppcCd4xstrq1TdY+fh/HJVmZZOXatWtnUp4kaQrTDv0kWwD/G/jDqvrB6LKqKqBmo6CqOqeq9qmqfZYsWTIbXUqSmmmFfpJNGQL/Y1X1qdZ8Vxu2of28u7WvAXYe2Xyn1jZZuyRpjkzn7p0A5wI3V9W7RxZdBozdgbMCuHSk/eh2F8+LgfvbMNDlwEFJtmkXcA9qbZKkObJ4Guv8OnAU8C9JvtHa/hw4G7g4ybHA7cBr27LPAa8AVgE/Ao4BqKp1Sd4OfLWtd0ZVrZuVo5AkTcuUoV9VXwIyyeIDJ1i/gOMn6es84LyZFChJmj3+Ra4kdcTQl6SOGPqS1BFDX5I6YuhLUkcMfUnqiKEvSR0x9CWpI4a+JHXE0Jekjhj6ktQRQ1+SOmLoS1JHDH1J6oihL0kdMfQlqSOGviR1xNCXpI4Y+pLUEUNfkjpi6EtSRwx9SeqIoS9JHTH0Jakjhr4kdcTQl6SOGPqS1BFDX5I6YuhLUkcMfUnqiKEvSR0x9CWpI4a+JHXE0Jekjhj6ktQRQ1+SOmLoS1JHDH1J6oihL0kdmTL0k5yX5O4k3xpp2zbJFUlubT+3ae1J8v4kq5LckORFI9usaOvfmmTFxjkcSdKGTOdM/3zgkHFtJwNXVtWewJVtHuBQYM/2OA74IAwfEsBpwH7AvsBpYx8UkqS5M2XoV9UXgHXjmg8DLmjTFwCHj7R/pAbXAlsn2QE4GLiiqtZV1XrgCh77QSJJ2sge75j+9lV1Z5v+PrB9m14K3DGy3urWNln7YyQ5LsnKJCvXrl37OMuTJE3kCV/IraoCahZqGevvnKrap6r2WbJkyWx1K0ni8Yf+XW3Yhvbz7ta+Bth5ZL2dWttk7ZKkOfR4Q/8yYOwOnBXApSPtR7e7eF4M3N+GgS4HDkqyTbuAe1BrkyTNocVTrZDkE8D+wHZJVjPchXM2cHGSY4Hbgde21T8HvAJYBfwIOAagqtYleTvw1bbeGVU1/uKwJGkjmzL0q+r1kyw6cIJ1Czh+kn7OA86bUXXSArfs5M/Odwl6kvru2a/cKP36F7mS1BFDX5I6YuhLUkcMfUnqiKEvSR0x9CWpI4a+JHXE0Jekjhj6ktQRQ1+SOmLoS1JHDH1J6oihL0kdMfQlqSOGviR1xNCXpI4Y+pLUEUNfkjpi6EtSRwx9SeqIoS9JHTH0Jakjhr4kdcTQl6SOGPqS1BFDX5I6YuhLUkcMfUnqiKEvSR0x9CWpI4a+JHXE0Jekjhj6ktQRQ1+SOmLoS1JHDH1J6oihL0kdMfQlqSOGviR1ZM5DP8khSW5JsirJyXO9f0nq2ZyGfpJFwF8DhwLLgdcnWT6XNUhSz+b6TH9fYFVV3VZVPwEuAg6b4xokqVuL53h/S4E7RuZXA/uNrpDkOOC4NvtgklvmqLaFbjvgnvku4ski75zvCjQB36MjnuB7dNfJFsx16E+pqs4BzpnvOhaaJCurap/5rkOajO/RuTHXwztrgJ1H5ndqbZKkOTDXof9VYM8kuyXZDDgSuGyOa5Ckbs3p8E5VPZLkD4DLgUXAeVV141zW0DGHzPRk53t0DqSq5rsGSdIc8S9yJakjhr4kdcTQXyAy+FKSQ0fajkjy+fmsS5pIkkryrpH5P0ly+jyW1A1Df4Go4eLMm4F3J9k8yRbAO4Dj57cyaUIPA7+VZLv5LqQ3hv4CUlXfAj4NnAS8DbgQODXJdUmuT3IYQJK9Wts3ktyQZM95LFt9eoThbp0/Gr8gybIkV7X35pVJdpn78hYu795ZYJI8E/g68BPgM8CNVXVhkq2B64C9gbOBa6vqY+3vJRZV1UPzVrS6k+RBYEfgBuAFwH8Btqiq05N8Grikqi5I8kbgNVV1+DyWu6AY+gtQkjOAB4HXApsznFUBbAsczBD8pwIfAT5VVbfOR53qV5IHq2qL9l79KfAQ/xb69wA7VNVPk2wK3FlVDgPNEod3FqaftUeA/1xVL2yPXarq5qr6OPAahl+0zyU5YD6LVdfeCxwLPHO+C+mFob+wXQ6ckCQASfZuP3cHbquq9wOXAs+fvxLVs6paB1zMEPxjvszwT7QAvAH44lzXtZAZ+gvb24FNgRuS3NjmYRj2+VaSbwC/yjDMI82XdzH8s8pjTgCOSXIDcBTwlnmpaoFyTF+SOuKZviR1xNCXpI4Y+pLUEUNfkjpi6EtSRwx9SeqIoS9JHfn/Y3LwSBaKtsoAAAAASUVORK5CYII=\n", "text/plain": [ "
" ] }, "metadata": { "needs_background": "light" }, "output_type": "display_data" } ], "source": [ "X, y = load_arrests(return_X_y=True)\n", "\n", "plt.bar(Counter(y).keys(), Counter(y).values())\n", "plt.title('Distribution of released (target)');" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Chicken\n", "Loads the chicken dataset. The chicken data has 578 rows and 4 columns\n", "from an experiment on the effect of diet on early growth of chicks.\n", "The body weights of the chicks were measured at birth and every second\n", "day thereafter until day 20. They were also measured on day 21.\n", "There were four groups on chicks on different protein diets." ] }, { "cell_type": "code", "execution_count": 6, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
weighttimechickdiet
042011
151211
259411
364611
476811
\n", "
" ], "text/plain": [ " weight time chick diet\n", "0 42 0 1 1\n", "1 51 2 1 1\n", "2 59 4 1 1\n", "3 64 6 1 1\n", "4 76 8 1 1" ] }, "execution_count": 6, "metadata": {}, "output_type": "execute_result" } ], "source": [ "from sklego.datasets import load_chicken\n", "\n", "df_chicken = load_chicken(as_frame=True)\n", "df_chicken.head()" ] }, { "cell_type": "code", "execution_count": 7, "metadata": {}, "outputs": [ { "data": { "image/png": "iVBORw0KGgoAAAANSUhEUgAAAXcAAAEICAYAAACktLTqAAAABHNCSVQICAgIfAhkiAAAAAlwSFlzAAALEgAACxIB0t1+/AAAADh0RVh0U29mdHdhcmUAbWF0cGxvdGxpYiB2ZXJzaW9uMy4yLjIsIGh0dHA6Ly9tYXRwbG90bGliLm9yZy+WH4yJAAAYGUlEQVR4nO3de7xcZX3v8c/XgKCCBMxuGiBhE4wXtDbSLaWnyMFiKxcx2lKaHOVisYFWevRoTw1iNfoSibboqS1CY0kBLwEEKXjAVkQk2oq4oyGEmwTYSGJINkEuAYoEfv3jeUYWk5m9Z++ZvWd48n2/XvPaaz3r9ps1e76z5llrZhQRmJlZWV7Q7QLMzKzzHO5mZgVyuJuZFcjhbmZWIIe7mVmBHO5mZgVyuG8HJJ0r6W86tK5ZkrZImpLHvyvpPZ1Yd17fNyWd0Kn1jWG7n5T0gKT7J3AbWyTNbnHekPTyNrd3sqT/1846JpKkoyVd3O06SiVf5/78JmkImA5sBZ4GbgUuBJZGxDPjWNd7IuLbY1jmu8CXI+Kfx7KtvOxi4OUR8a6xLttJkmYBdwD7RMSmbtZSIymAORGxtsG0E0mP08EjLP9C4C7goIhYL6kfuAfYMSK2TkjRI2i2fUlrgP8VEasnu6bS+ci9DEdHxK7APsAS4EPAeZ3eiKQdOr3OHjEL2Nwrwd4h84DbI2J9J1Y2gY/9cmDhBK17+xYRvj2Pb8AQ8Oa6tgOBZ4DX5vHzgU/m4WnA/wceAh4Evkd6kf9SXuYJYAvw10A/EMBJwM+AFZW2HfL6vgucCdwIPAJcAeyRpx0KrGtUL3A48Evgqby9myrre08efgHwEeBeYBPpHclueVqtjhNybQ8Ap4+wn3bLyw/n9X0kr//N+T4/k+s4v8Gy1wN/lId/N2/3qDx+GLCqMu+fArcBvwD+nfRuoDYtSO9UAF4GfCPvsx8BnwS+XzfvKcCd+bE6GxDwauC/SO/StgAPNbm/y4CPVMZ/lte5Jd9+B9gP+A6wOe+/rwBT6x6rDwGrgSeBHYDj8/7bDPwNlf+/vD8Xkd4xbAYuqfwvbLP9yv68p9vPoxJvPnIvUETcCKwD3thg8gfztD5Sd86H0yJxHOkJeHRE7BIRn6ks8z9JofKWJps8nhRqM0jdQ59vocZ/Az4FXJy395sNZjsx394EzAZ2Af6xbp6DgVeSQvajkl7dZJP/QAr42fn+HA+8O1IX1BHAz3MdJzZY9nrSCxV52buBQyrj1wNImkfan39I2r/fIx2ZNnI28Bjw66QXqEbnGd4KvAF4HXAs8JaIuI0U+j/I9U5tsv7fIHU11dTqnZqX+wHpxeJMYE/S4zsTWFy3ngXAUcBU4BXAF4B3kh7r3YC9KvP+JfB20j7Zk/QCd/YI24f0Qtgv6aVN7oeNk8O9XD8H9mjQ/hTpiblPRDwVEd+LfAg1gsUR8VhEPNFk+pciYk1EPEY6mju2dsK1Te8EPhsRd0fEFuA0YH5dF8HHI+KJiLgJuAnY5kUi1zIfOC0iHo2IIeAs4LgW67ieFFiQQurMyvivwp0UumdGxG2R+pU/BcyVtE+Dev4I+FhEPB4RtwIXNNjukoh4KCJ+BlwHzG2xXkhh/OhIM0TE2oi4JiKejIhh4LOV+1Xz+Yi4Lz/2xwDfiIjvR8QvgY+SjsZrTiG9e1oXEU+SXiiOGaVLp1ZjsxcpGyeHe7n2InW71PtbYC3wLUl3S1rUwrruG8P0e4EdSd0/7dozr6+67h1I7zhqqle3PE46uq83LddUv669GszbyA+AV0iaTgrYC4GZkqaRusBW5Pn2Af5e0kOSat1earCdvnw/qvut0T5u5b418wtg15FmkDRd0kWS1kt6BPgy2z5u1br2rI5HxOOk7peafYDLK/f/NlL3UfXxqler8aGRarWxc7gXSNIbSIHy/fpp+cj1gxExG3gb8AFJh9UmN1nlaEf2MyvDs0jvDh4gdTu8uFLXFFKwtbren5MCo7rurcDGUZar90CuqX5dLZ1szCG2EngfsCYftf4n8AHgroh4IM96H3ByREyt3F4UEf9Zt8rhfD/2rrTNpHWtXOK2mtSNMtIyn8rtvxERLwXeRXoxaratDVRqlvQi0rmDmvuAI+ru/86RTuo2q/nVwFBEPNLCfbIxcLgXRNJLJb0VuIh0eeLNDeZ5q6SXSxLwMOnIqnbJ5EZSn/RYvUvS/pJeDHwCuDQingZ+Cuws6ShJO5JOYu5UWW4jqb+12f/hcuD/SNpX0i4820c/pkv5ci2XAGdI2jV3k3yAdKTaquuBU3m2C+a7deMA5wKnSXoNgKTdJP1xk3q+DiyW9GJJryKdA2jVRmDvfLljM1fz3C6WYdLjXH18dyWd3HxY0l7A/x1lu5cCR0v6H3nbi3nui8G5pH28D4Ckvnweotn2yTV+c5Tt2jg43MvwDUmPko6cTif1nb67ybxzgG+TntQ/AL4QEdflaWcCH8lvq/9qDNv/EumKnPuBnYH/DRARDwN/Afwz6Sj5MdLJ3Jqv5b+bJf24wXqX5XWvIF0j/V+kk3bj8Zd5+3eT3tF8Na+/VdeTwnBFk3Ei4nLg08BFuZtjDelkbSOnkk5I3k+6j8tJV6S04jvALcD9kh5oMs83gFdJ2jPX9jhwBvAf+fE9CPg4cADpRf4q0gtOUxFxC2k/XkQ6it9CuoqpVvffA1eSuvweBW4AfnuE7UM6YftPLd5vGwN/iMmsB0j6NPDrEdGxT+dKWgjsHxHv79Q669a/C6mvfE5E3DOO5Y8GjouIYztenDnczbohd8W8ELiZdLnj1aTr+/+1q4WNIgfytaTumLNIR+YHtHDFlU0yd8uYdceupG6Qx4CLSUF5RVcras080onun5O6+OY72HuTj9zNzArkI3czswL1xBdBTZs2Lfr7+7tdhpnZ88rKlSsfiIi+RtN6Itz7+/sZHBzsdhlmZs8rku5tNs3dMmZmBXK4m5kVyOFuZlYgh7uZWYFGDXdJyyRtyr91WGu7WNKqfBuStCq390t6ojLt3Iks3szMGmvlapnzSb9+c2GtISL+pDYs6SzSFw/V3BURY/lRATMz67BRwz0iVuRfLt9G/trYY4Hf62xZZmbWjnb73N8IbIyIOytt+0r6iaTrJTX6DU8zM5tg7X6IaQHP/QHgDcCsiNgs6beAf5X0mka/spK/jnQhwKxZs9osw8zMqsYd7vlHb/8Q+K1aW/5R3Cfz8EpJd5F+6mubj59GxFJgKcDAwEBb317Wv+iqdhYft6ElR3Vlu2Zmo2mnW+bNwO0R8atf1sk/qzUlD88mfSXo3e2VaGZmY9XKpZDLST/H9kpJ6ySdlCfN57ldMgCHAKvzpZGXAqdExIOdLNjMzEbXytUyC5q0n9ig7TLgsvbLMjOzdvgTqmZmBXK4m5kVyOFuZlYgh7uZWYEc7mZmBXK4m5kVyOFuZlYgh7uZWYEc7mZmBXK4m5kVyOFuZlYgh7uZWYEc7mZmBXK4m5kVyOFuZlYgh7uZWYEc7mZmBXK4m5kVyOFuZlYgh7uZWYEc7mZmBRo13CUtk7RJ0ppK22JJ6yWtyrcjK9NOk7RW0h2S3jJRhZuZWXOtHLmfDxzeoP1zETE3364GkLQ/MB94TV7mC5KmdKpYMzNrzajhHhErgAdbXN884KKIeDIi7gHWAge2UZ+ZmY1DO33up0panbttds9tewH3VeZZl9u2IWmhpEFJg8PDw22UYWZm9cYb7ucA+wFzgQ3AWWNdQUQsjYiBiBjo6+sbZxlmZtbIuMI9IjZGxNMR8QzwRZ7telkPzKzMunduMzOzSTSucJc0ozL6DqB2Jc2VwHxJO0naF5gD3NheiWZmNlY7jDaDpOXAocA0SeuAjwGHSpoLBDAEnAwQEbdIugS4FdgKvDcinp6Y0s3MrJlRwz0iFjRoPm+E+c8AzminKDMza48/oWpmViCHu5lZgRzuZmYFcribmRXI4W5mViCHu5lZgRzuZmYFcribmRXI4W5mViCHu5lZgRzuZmYFcribmRXI4W5mViCHu5lZgRzuZmYFcribmRXI4W5mViCHu5lZgRzuZmYFcribmRXI4W5mVqBRw13SMkmbJK2ptP2tpNslrZZ0uaSpub1f0hOSVuXbuRNZvJmZNdbKkfv5wOF1bdcAr42I1wE/BU6rTLsrIubm2ymdKdPMzMZi1HCPiBXAg3Vt34qIrXn0BmDvCajNzMzGqRN97n8KfLMyvq+kn0i6XtIbmy0kaaGkQUmDw8PDHSjDzMxq2gp3SacDW4Gv5KYNwKyIeD3wAeCrkl7aaNmIWBoRAxEx0NfX104ZZmZWZ9zhLulE4K3AOyMiACLiyYjYnIdXAncBr+hAnWZmNgbjCndJhwN/DbwtIh6vtPdJmpKHZwNzgLs7UaiZmbVuh9FmkLQcOBSYJmkd8DHS1TE7AddIArghXxlzCPAJSU8BzwCnRMSDDVdsZmYTZtRwj4gFDZrPazLvZcBl7RZlZmbt8SdUzcwK5HA3MyuQw93MrEAOdzOzAjnczcwK5HA3MyuQw93MrEAOdzOzAjnczcwK5HA3MyuQw93MrECjfreMNde/6KqubHdoyVFd2a6ZPX/4yN3MrEAOdzOzAjnczcwK5HA3MyuQw93MrEAOdzOzAjnczcwK5HA3MyuQw93MrEAthbukZZI2SVpTadtD0jWS7sx/d8/tkvR5SWslrZZ0wEQVb2ZmjbV65H4+cHhd2yLg2oiYA1ybxwGOAObk20LgnPbLNDOzsWgp3CNiBfBgXfM84II8fAHw9kr7hZHcAEyVNKMTxZqZWWva6XOfHhEb8vD9wPQ8vBdwX2W+dbntOSQtlDQoaXB4eLiNMszMrF5HTqhGRAAxxmWWRsRARAz09fV1ogwzM8vaCfeNte6W/HdTbl8PzKzMt3duMzOzSdJOuF8JnJCHTwCuqLQfn6+aOQh4uNJ9Y2Zmk6ClH+uQtBw4FJgmaR3wMWAJcImkk4B7gWPz7FcDRwJrgceBd3e4ZjMzG0VL4R4RC5pMOqzBvAG8t52izMysPf6EqplZgRzuZmYFcribmRXI4W5mViCHu5lZgRzuZmYFcribmRXI4W5mViCHu5lZgRzuZmYFcribmRXI4W5mViCHu5lZgRzuZmYFcribmRWope9zt97Sv+iqbpcw6YaWHNXtEsyeV3zkbmZWIIe7mVmBHO5mZgVyuJuZFWjcJ1QlvRK4uNI0G/goMBX4M2A4t384Iq4ed4VmZjZm4w73iLgDmAsgaQqwHrgceDfwuYj4u45UaGZmY9apbpnDgLsi4t4Orc/MzNrQqXCfDyyvjJ8qabWkZZJ279A2zMysRW2Hu6QXAm8DvpabzgH2I3XZbADOarLcQkmDkgaHh4cbzWJmZuPUiSP3I4AfR8RGgIjYGBFPR8QzwBeBAxstFBFLI2IgIgb6+vo6UIaZmdV0ItwXUOmSkTSjMu0dwJoObMPMzMagre+WkfQS4PeBkyvNn5E0FwhgqG6amZlNgrbCPSIeA15W13ZcWxWZmVnb/AlVM7MCOdzNzArkcDczK5DD3cysQA53M7MCOdzNzArkcDczK5DD3cysQA53M7MCOdzNzArkcDczK5DD3cysQA53M7MCOdzNzArkcDczK5DD3cysQA53M7MCOdzNzArkcDczK5DD3cysQA53M7MC7dDuCiQNAY8CTwNbI2JA0h7AxUA/MAQcGxG/aHdbZmbWmk4dub8pIuZGxEAeXwRcGxFzgGvzuJmZTZKJ6paZB1yQhy8A3j5B2zEzswY6Ee4BfEvSSkkLc9v0iNiQh+8HptcvJGmhpEFJg8PDwx0ow8zMatrucwcOjoj1kn4NuEbS7dWJERGSon6hiFgKLAUYGBjYZrqZmY1f20fuEbE+/90EXA4cCGyUNAMg/93U7nbMzKx1bR25S3oJ8IKIeDQP/wHwCeBK4ARgSf57RbuF2vatf9FVXdv20JKjurZts/Fqt1tmOnC5pNq6vhoR/ybpR8Alkk4C7gWObXM7ZmY2Bm2Fe0TcDfxmg/bNwGHtrNvMzMbPn1A1MyuQw93MrEAOdzOzAjnczcwK5HA3MyuQw93MrEAOdzOzAjnczcwK5HA3MytQJ74V0qxo3fpeG3+njbXDR+5mZgVyuJuZFcjhbmZWIIe7mVmBHO5mZgVyuJuZFcjhbmZWIIe7mVmBHO5mZgVyuJuZFcjhbmZWoHGHu6SZkq6TdKukWyS9L7cvlrRe0qp8O7Jz5ZqZWSva+eKwrcAHI+LHknYFVkq6Jk/7XET8XfvlmZnZeIw73CNiA7AhDz8q6TZgr04VZmZm49eRPndJ/cDrgR/mplMlrZa0TNLuTZZZKGlQ0uDw8HAnyjAzs6ztcJe0C3AZ8P6IeAQ4B9gPmEs6sj+r0XIRsTQiBiJioK+vr90yzMysoq1wl7QjKdi/EhFfB4iIjRHxdEQ8A3wROLD9Ms3MbCzauVpGwHnAbRHx2Ur7jMps7wDWjL88MzMbj3aulvld4DjgZkmrctuHgQWS5gIBDAEnt1WhmZmNWTtXy3wfUINJV4+/HDMz6wR/QtXMrEAOdzOzArXT525mE6h/0VVd2/bQkqO6tm3rDB+5m5kVyOFuZlYgh7uZWYEc7mZmBfIJVTPbRrdO5vpEbuf4yN3MrEAOdzOzAjnczcwK5HA3MyuQw93MrEAOdzOzAvlSSDPrGb4Es3N85G5mViCHu5lZgRzuZmYFcribmRXI4W5mViBfLWNm270Sf/Vqwo7cJR0u6Q5JayUtmqjtmJnZtiYk3CVNAc4GjgD2BxZI2n8itmVmZtuaqCP3A4G1EXF3RPwSuAiYN0HbMjOzOhPV574XcF9lfB3w29UZJC0EFubRLZLumKBampkGPDDJ22yXa54crnlyuGZAn25r8X2aTejaCdWIWAos7db2JQ1GxEC3tj8ernlyuObJ4Zon1kR1y6wHZlbG985tZmY2CSYq3H8EzJG0r6QXAvOBKydoW2ZmVmdCumUiYqukU4F/B6YAyyLilonYVhu61iXUBtc8OVzz5HDNE0gR0e0azMysw/z1A2ZmBXK4m5kVaLsJd0lDkm6WtErSYG7bQ9I1ku7Mf3fvco3LJG2StKbS1rBGJZ/PX++wWtIBPVTzYknr875eJenIyrTTcs13SHpLF+qdKek6SbdKukXS+3J7z+7nEWru2f2ca9hZ0o2Sbsp1fzy37yvph7m+i/NFF0jaKY+vzdP7e6Te8yXdU9nPc3N71/83RhQR28UNGAKm1bV9BliUhxcBn+5yjYcABwBrRqsROBL4JiDgIOCHPVTzYuCvGsy7P3ATsBOwL3AXMGWS650BHJCHdwV+muvq2f08Qs09u59zHQJ2ycM7Aj/M+/ASYH5uPxf48zz8F8C5eXg+cHGP1Hs+cEyD+bv+vzHSbbs5cm9iHnBBHr4AeHsXayEiVgAP1jU3q3EecGEkNwBTJc2YnEqf1aTmZuYBF0XEkxFxD7CW9FUVkyYiNkTEj/Pwo8BtpE9U9+x+HqHmZrq+nwHyPtuSR3fMtwB+D7g0t9fv69pjcClwmCRNUrkj1dtM1/83RrI9hXsA35K0Mn/1AcD0iNiQh+8HpnentBE1q7HRVzyM9ISfbKfmt6rLKt1dPVVzftv/etIR2vNiP9fVDD2+nyVNkbQK2ARcQ3oX8VBEbG1Q26/qztMfBl7WzXojorafz8j7+XOSdqqvN+up5+D2FO4HR8QBpG+qfK+kQ6oTI73P6unrQp8PNWbnAPsBc4ENwFndLWdbknYBLgPeHxGPVKf16n5uUHPP7+eIeDoi5pI+pX4g8KoulzSi+nolvRY4jVT3G4A9gA91scSWbTfhHhHr899NwOWkf7SNtbdR+e+m7lXYVLMae/YrHiJiY36SPAN8kWe7BHqiZkk7kkLyKxHx9dzc0/u5Uc29vp+rIuIh4Drgd0jdF7UPUFZr+1XdefpuwOZJLhV4Tr2H526xiIgngX+hh/dz1XYR7pJeImnX2jDwB8Aa0lcinJBnOwG4ojsVjqhZjVcCx+cz9gcBD1e6Fbqqrt/xHaR9Danm+fmqiH2BOcCNk1ybgPOA2yLis5VJPbufm9Xcy/s519cnaWoefhHw+6TzBdcBx+TZ6vd17TE4BvhOfhfVzXpvr7zoi3R+oLqfe/I5CGwfV8sAs0lXD9wE3AKcnttfBlwL3Al8G9ijy3UuJ729forUf3dSsxpJZ+jPJvVh3gwM9FDNX8o1rSY9AWZU5j8913wHcEQX6j2Y1OWyGliVb0f28n4eoeae3c+5htcBP8n1rQE+mttnk15s1gJfA3bK7Tvn8bV5+uweqfc7eT+vAb7Ms1fUdP1/Y6Sbv37AzKxA20W3jJnZ9sbhbmZWIIe7mVmBHO5mZgVyuJuZFcjhbmZWIIe7mVmB/hvF+DkMqRYlQQAAAABJRU5ErkJggg==\n", "text/plain": [ "
" ] }, "metadata": { "needs_background": "light" }, "output_type": "display_data" } ], "source": [ "X, y = load_chicken(return_X_y=True)\n", "\n", "plt.hist(y)\n", "plt.title('Distribution of weight (target)');" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Heroes\n", "\n", "A dataset from a video game: \"heroes of the storm\". The goal of the dataset\n", "is to predict the attack type. Note that the pandas dataset returns more information." ] }, { "cell_type": "code", "execution_count": 8, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
nameattack_typerolehealthattackattack_spd
0ArtanisMeleeBruiser2470.0111.01.00
1ChenMeleeBruiser2473.090.01.11
2DehakaMeleeBruiser2434.0100.01.11
3ImperiusMeleeBruiser2450.0122.00.83
4LeoricMeleeBruiser2550.0109.00.77
\n", "
" ], "text/plain": [ " name attack_type role health attack attack_spd\n", "0 Artanis Melee Bruiser 2470.0 111.0 1.00\n", "1 Chen Melee Bruiser 2473.0 90.0 1.11\n", "2 Dehaka Melee Bruiser 2434.0 100.0 1.11\n", "3 Imperius Melee Bruiser 2450.0 122.0 0.83\n", "4 Leoric Melee Bruiser 2550.0 109.0 0.77" ] }, "execution_count": 8, "metadata": {}, "output_type": "execute_result" } ], "source": [ "from sklego.datasets import load_heroes\n", "\n", "df_heroes = load_heroes(as_frame=True)\n", "df_heroes.head()" ] }, { "cell_type": "code", "execution_count": 9, "metadata": {}, "outputs": [ { "data": { "image/png": "iVBORw0KGgoAAAANSUhEUgAAAXAAAAEICAYAAABGaK+TAAAABHNCSVQICAgIfAhkiAAAAAlwSFlzAAALEgAACxIB0t1+/AAAADh0RVh0U29mdHdhcmUAbWF0cGxvdGxpYiB2ZXJzaW9uMy4yLjIsIGh0dHA6Ly9tYXRwbG90bGliLm9yZy+WH4yJAAAUd0lEQVR4nO3de5RlZX3m8e9jN4rInW4Z5NZkQCM6Cq4WncVEUdGloyOOMopRAzMkaMZ4iWQlGjXB8QLGxNskGUVAWk28o3jBKFHQqARtAVHBCyGNgggNNAiKKPCbP/Zbcjhd1XWqu6qr3/b7WeusPvv27t85tevZ73n3PtWpKiRJ/bnHYhcgSdo4BrgkdcoAl6ROGeCS1CkDXJI6ZYBLUqcM8C1IknckefU8tbVPkluSLGnT5yb5/flou7X3mSRHz1d7c9jv65Jcl+Qnm3vfG5LksCRXLnYdc5HkwCSrk2Sxa5lOkt2TXJrkXotdy5bKAN9MkqxJcmuSm5PcmOSrSV6Q5Nc/g6p6QVW9dsK2Dt/QOlX1w6ravqrumIfaT0jyvrH2n1RVqza17TnWsQ9wPHBgVf2Hjdh+vZNYkkqy/3zVOMd6Fjv0Xwv8dbUvg0xyXC2k8f1X1TXAOcBxi1XTls4A37z+W1XtAOwLnAT8GXDqfO8kydL5bnMLsQ9wfVVdu9iF9C7JHsBjgI/PY5tL5qutEf8APH8B2t06VJWPzfAA1gCHj807BLgTeHCbPh14XXu+DPgUcCNwA/AvDCfc97ZtbgVuAf4UWAEUcCzwQ+BLI/OWtvbOBU4Evgb8FDgT2LUtOwy4crp6gScCvwR+1fb3zZH2fr89vwfwKuAK4FrgPcBObdlUHUe32q4DXrmB92mntv3a1t6rWvuHt9d8Z6vj9Gm23aW9Z2uBde35Xm3Z64E7gF+07f+2vU8F/KzNe9aG2mjt7Aq8G/hxW/7x6d5D4MXAJaPbjtV6n7HXcwtwP+DnwG4j6z2s1bINcAzwlVb7TcB3gceNvXenAlcDVwGvA5bMsP/fA/55ZHq946rN/zDwk7a/LwEPGtnmdOD/AWe19/DwVu+FwM1t2w/Sjum2zVOAixiO668CD5ll/0vbe7LvYv8Ob4mPRS/gN+XBNAHe5v8Q+MP2/HTuCvATgXe0X9xtgN8BMl1b3BWS72nBcG+mD/CrgAe3dT4KvK8tu1v4jO8DOGFq3ZHl53JXgP8v4DLgt4DtgTOA947V9q5W10OB24AHzvA+vYfh5LJD2/b7wLEz1Tm27W7AM4Dt2vYfpgXseM0j8wrYfw5tfLqF0i7t5/Lo8dqAvwAuAJbPckxM976fNXU8tOm3AP+3PT8GuB3447bvZzEE69SJ+GPAO9vP974MJ+vnz7DvNwF/N9sx2n62OwD3At4KXDSy7PS2/0MZTrI7Mpx0X9LqezrDyX/qmD6Y4QT/CGAJw0l9DXCvWX5HLgaeuti/w1viwyGUxfdjhl7duF8BezD0PH5VVf9S7WjegBOq6mdVdesMy99bVd+uqp8BrwaeOU8fe58DvLmqLq+qW4BXAEeNDeW8pqpurapvAt9kCPK7abUcBbyiqm6uqjXA3wDPm6SIqrq+qj5aVT+vqpsZet2PnssL2VAbbdjhScALqmpd+7l88e4vIW8GngA8pqrWzmXfzSrgua2xJcCzGXqnU64F3tr2/UHge8CTk+wO/Ffgpe0YuJYh/I+aYT87M/SSN6iqTms/i9sYTuQPTbLTyCpnVtVXqupO4CCGHvPbW31nMJxEphwHvLOqzq+qO2q4hnIb8MhZyri51asxBvji25NhiGTcmxh6tZ9LcnmSl0/Q1o/msPwKhl7Ssomq3LD7tfZG214K7D4yb/SukZ8z9NTHLWs1jbe15yRFJNkuyTuTXJHkpwwf+Xeey0lqljb2Bm6oqnUzbL4zQ0idWFU3TbrPMWcCBybZD3g8cFNVjYbgVWMn8isY3v99Gd67q9tF8hsZeuP3nWE/6xh61jNKsiTJSUn+rb0Xa9qi0WNm9Ji63zT1jS7fFzh+qr5W495tuw3ZgWHIRWMM8EWU5OEM4fTl8WWt13N8Vf0W8FTgZUkeN7V4hiZn66HvPfJ8H4Ze/nUM45fbjdS1BFg+h3Z/zPDLOdr27cA1s2w37rpW03hbV024/fHAA4BHVNWOwKPa/Knb5Cb505sbauNHwK5JZuoNrmMY4313kkMn2Nd69VTVL4APMfTCn8fde98Ae47d9rcPw/v/I4be7LKq2rk9dqyqB82w74uB+89Sz+8CRzCMbe/EMKQFd72f49tcPU19o8fcj4DXj9S3c1VtV1Xvn2H/Uxfk92f41KYxBvgiSLJjkqcAH2AYW/7WNOs8Jcn+7ZfhJoYLcHe2xdcwjDfP1XPbvb/bAf8H+EgNtxl+H9g2yZOTbMNw4XD03ttrgBWjtzyOeT/wx0n2S7I98Abgg1V1+1yKa7V8CHh9kh2S7Au8DHjfhrf8tR0YLoLdmGRX4C/Hlk/3vo3Pm7GNqroa+Azw90l2SbJNkkeNNlZV5zIMKZ2R5JBZ6r0G2G1sSAKG6wDHMJy4xwP8vsCL277/B/BA4KxW2+eAv2nH1z2S/MckMw0hnQ08LMm2Y/WMvxe3AdcznODfMMvrOY/hOP2jJEuTHMFwoX7Ku4AXJHlEBvdpx9zUJ4Hpfj6HAGuq6gq0HgN88/pkkpsZeiKvBN4M/M8Z1j0A+GeGK/LnAX9fVee0ZScCr2ofQ/9kDvt/L8OFp58A2zLcKUH7uP+/gVMYers/A0bvT/5w+/f6JBdM0+5pre0vAf/OcKfHi+ZQ16gXtf1fzvDJ5B9b+5N4K8OF0uuAfwX+aWz524Ajk6xL8vY27wRgVXsvnzlBG89j+JTwXYbx6JeOF1FVZzNc/PtkkofNVGxVfZfh5Hd52//92vyvMJysL5gmuM5nODauYxifP7Kqrm/Lfg+4J8PdL+uAjzBcR5lu39cAX2DoYU8ZP67ewzBEc1Vr819nei2tzV8yXLg8lmHI47kMd/Hc1pavBv6A4S6adQxDhMdsYP8wnAzfsaH9/iabuqtB0hYkyReAf6yqU0bmHcNwF81/mad9HMhw0fSQCS6Qb+w+zgfeUVXv3oht7wt8ETi4DS1pzNb6hQ+pW+3ayMO4e+943lXVJcDD57PNNmTzPYZPCM8BHsL6n2Im0u6keeD8Vbf1cQhFWkBJ/jzD36QZf3xmhvVXMQydvbTdxtibBzBccLyR4YLwkW18XgvAIRRJ6pQ9cEnq1GYdA1+2bFmtWLFic+5Skrr3jW9847qqWj4+f7MG+IoVK1i9evXm3KUkdS/JtPfBO4QiSZ0ywCWpUwa4JHXKAJekThngktQpA1ySOmWAS1KnDHBJ6pQBLkmd8s/JSvNkxcs/vdglaAu15qQnL0i79sAlqVMGuCR1ygCXpE4Z4JLUKQNckjplgEtSpwxwSeqUAS5JnTLAJalTBrgkdcoAl6ROGeCS1CkDXJI6ZYBLUqcmDvAkS5JcmORTbXq/JOcnuSzJB5Pcc+HKlCSNm0sP/CXApSPTbwTeUlX7A+uAY+ezMEnShk0U4En2Ap4MnNKmAzwW+EhbZRXwtIUoUJI0vUl74G8F/hS4s03vBtxYVbe36SuBPee5NknSBsz6X6oleQpwbVV9I8lhc91BkuOA4wD22WefORc4xf+uSjNZqP+uStrSTdIDPxR4apI1wAcYhk7eBuycZOoEsBdw1XQbV9XJVbWyqlYuX758HkqWJMEEAV5Vr6iqvapqBXAU8IWqeg5wDnBkW+1o4MwFq1KStJ5NuQ/8z4CXJbmMYUz81PkpSZI0iVnHwEdV1bnAue355cAh81+SJGkSfhNTkjplgEtSpwxwSeqUAS5JnTLAJalTBrgkdcoAl6ROGeCS1CkDXJI6ZYBLUqcMcEnqlAEuSZ0ywCWpUwa4JHXKAJekThngktQpA1ySOmWAS1KnDHBJ6pQBLkmdMsAlqVMGuCR1ygCXpE4Z4JLUKQNckjplgEtSpwxwSeqUAS5JnTLAJalTBrgkdcoAl6ROGeCS1CkDXJI6ZYBLUqcMcEnqlAEuSZ0ywCWpUwa4JHVq1gBPsm2SryX5ZpLvJHlNm79fkvOTXJbkg0nuufDlSpKmTNIDvw14bFU9FDgIeGKSRwJvBN5SVfsD64BjF65MSdK4WQO8Bre0yW3ao4DHAh9p81cBT1uQCiVJ05poDDzJkiQXAdcCZwP/BtxYVbe3Va4E9lyYEiVJ05kowKvqjqo6CNgLOAT47Ul3kOS4JKuTrF67du1GlilJGjenu1Cq6kbgHOA/AzsnWdoW7QVcNcM2J1fVyqpauXz58k0qVpJ0l0nuQlmeZOf2/N7A44FLGYL8yLba0cCZC1WkJGl9S2dfhT2AVUmWMAT+h6rqU0kuAT6Q5HXAhcCpC1inJGnMrAFeVRcDB08z/3KG8XBJ0iLwm5iS1CkDXJI6ZYBLUqcMcEnqlAEuSZ0ywCWpUwa4JHXKAJekThngktQpA1ySOmWAS1KnDHBJ6pQBLkmdMsAlqVMGuCR1ygCXpE4Z4JLUKQNckjplgEtSpwxwSeqUAS5JnTLAJalTBrgkdcoAl6ROGeCS1CkDXJI6ZYBLUqcMcEnqlAEuSZ0ywCWpUwa4JHXKAJekThngktQpA1ySOmWAS1KnDHBJ6pQBLkmdMsAlqVMGuCR1atYAT7J3knOSXJLkO0le0ubvmuTsJD9o/+6y8OVKkqZM0gO/HTi+qg4EHgm8MMmBwMuBz1fVAcDn27QkaTOZNcCr6uqquqA9vxm4FNgTOAJY1VZbBTxtoYqUJK1vTmPgSVYABwPnA7tX1dVt0U+A3WfY5rgkq5OsXrt27SaUKkkaNXGAJ9ke+Cjw0qr66eiyqiqgptuuqk6uqpVVtXL58uWbVKwk6S4TBXiSbRjC+x+q6ow2+5oke7TlewDXLkyJkqTpTHIXSoBTgUur6s0jiz4BHN2eHw2cOf/lSZJmsnSCdQ4Fngd8K8lFbd6fAycBH0pyLHAF8MyFKVGSNJ1ZA7yqvgxkhsWPm99yJEmT8puYktQpA1ySOmWAS1KnDHBJ6pQBLkmdMsAlqVMGuCR1ygCXpE4Z4JLUKQNckjplgEtSpwxwSeqUAS5JnTLAJalTBrgkdcoAl6ROGeCS1CkDXJI6ZYBLUqcMcEnqlAEuSZ0ywCWpUwa4JHXKAJekThngktQpA1ySOmWAS1KnDHBJ6pQBLkmdMsAlqVMGuCR1ygCXpE4Z4JLUKQNckjplgEtSpwxwSeqUAS5JnTLAJalTswZ4ktOSXJvk2yPzdk1ydpIftH93WdgyJUnjJumBnw48cWzey4HPV9UBwOfbtCRpM5o1wKvqS8ANY7OPAFa156uAp81zXZKkWWzsGPjuVXV1e/4TYPeZVkxyXJLVSVavXbt2I3cnSRq3yRcxq6qA2sDyk6tqZVWtXL58+abuTpLUbGyAX5NkD4D277XzV5IkaRIbG+CfAI5uz48GzpyfciRJk5rkNsL3A+cBD0hyZZJjgZOAxyf5AXB4m5YkbUZLZ1uhqp49w6LHzXMtkqQ58JuYktQpA1ySOmWAS1KnDHBJ6pQBLkmdMsAlqVMGuCR1ygCXpE4Z4JLUKQNckjplgEtSpwxwSeqUAS5JnTLAJalTBrgkdcoAl6ROGeCS1CkDXJI6ZYBLUqcMcEnqlAEuSZ0ywCWpUwa4JHXKAJekThngktQpA1ySOmWAS1KnDHBJ6pQBLkmdMsAlqVMGuCR1ygCXpE4Z4JLUKQNckjplgEtSpwxwSeqUAS5JnTLAJalTmxTgSZ6Y5HtJLkvy8vkqSpI0u40O8CRLgL8DngQcCDw7yYHzVZgkacM2pQd+CHBZVV1eVb8EPgAcMT9lSZJms3QTtt0T+NHI9JXAI8ZXSnIccFybvCXJ9zZhn7rLMuC6xS5iS5A3LnYFmoHHaDMPx+i+083clACfSFWdDJy80Pv5TZNkdVWtXOw6pJl4jC68TRlCuQrYe2R6rzZPkrQZbEqAfx04IMl+Se4JHAV8Yn7KkiTNZqOHUKrq9iR/BHwWWAKcVlXfmbfKNBuHpbSl8xhdYKmqxa5BkrQR/CamJHXKAJekThngiyxJJXnfyPTSJGuTfGqW7Q6bbR1pYyS5I8lFSb6d5JNJdl7EWtYkWbZY+9/SGeCL72fAg5Pcu00/Hm/H1OK6taoOqqoHAzcAL1zsgjQ9A3zLcBbw5Pb82cD7pxYkuU+S05J8LcmFSdb7cwUzrZNkSZI3Jfl6kouTPH+zvBptTc5j+NY1SQ5Jcl47xr6a5AFt/jFJzkjyT0l+kOSvpjZOcmyS77dj811J/rbNX57ko+3Y/HqSQ9v83ZJ8Lsl3kpwCZPO/5H4Y4FuGDwBHJdkWeAhw/siyVwJfqKpDgMcAb0pyn7HtZ1rnWOCmqno48HDgD5Lst8CvRVuJ9gfrHsdd3+/4LvA7VXUw8BfAG0ZWPwh4FvCfgGcl2TvJ/YBXA48EDgV+e2T9twFvacfmM4BT2vy/BL5cVQ8CPgbssxCvbWux4F+l1+yq6uIkKxh632eNLX4C8NQkf9Kmt2X9g3qmdZ4APCTJkW3+TsABwL/P6wvQ1ubeSS5i6HlfCpzd5u8ErEpyAFDANiPbfL6qbgJIcgnD3+5YBnyxqm5o8z8M3L+tfzhwYPLrDvaOSbYHHgU8HaCqPp1k3cK8xK2DAb7l+ATw18BhwG4j8wM8o6ru9kfAkuw+wToBXlRVn12QirW1urWqDkqyHcMX9V4IvB14LXBOVf331uE4d2Sb20ae38Hs2XIP4JFV9YvRmSOBrgk4hLLlOA14TVV9a2z+Z4EXtTAmycHTbDvTOp8F/jDJNm3+/acZfpGmVVU/B14MHJ9kKUMPfOoC+zETNPF14NFJdmnbP2Nk2eeAF01NJDmoPf0S8Ltt3pOAXTblNWztDPAtRFVdWVVvn2bRaxk+ql6c5DttetJ1TgEuAS5I8m3gnfipS3NQVRcCFzMM7/0VcGKSC5ngOKqqqxjGyb8GfAVYA9zUFr8YWNkurl8CvKDNfw3wqHYcPx344fy9mq2PX6WXtGCSbF9Vt7Qe+McY/mbSxxa7rq2FPXBJC+mEdkH02wwXzz++yPVsVeyBS1Kn7IFLUqcMcEnqlAEuSZ0ywCWpUwa4JHXq/wNX6aAW24ku1gAAAABJRU5ErkJggg==\n", "text/plain": [ "
" ] }, "metadata": { "needs_background": "light" }, "output_type": "display_data" } ], "source": [ "X, y = load_heroes(return_X_y=True)\n", "\n", "plt.bar(Counter(y).keys(), Counter(y).values())\n", "plt.title('Distribution of attack_type (target)');" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Hearts\n", "\n", "Loads the Cleveland Heart Diseases dataset. The goal is to predict the presence of a heart disease (target values 1, 2, 3, and 4).\n", "The data originates from research to heart diseases by four institutions and originally contains 76 attributes.\n", "Yet, all published experiments refer to using a subset of 13 features and one target.\n", "This implementation loads the Cleveland dataset of the research which is the only set used by ML researchers to this date." ] }, { "cell_type": "code", "execution_count": 10, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
agesexcptrestbpscholfbsrestecgthalachexangoldpeakslopecathaltarget
063111452331215002.330fixed0
167141602860210811.523normal1
267141202290212912.622reversible0
337131302500018703.530normal0
441021302040217201.410normal0
\n", "
" ], "text/plain": [ " age sex cp trestbps chol fbs restecg thalach exang oldpeak slope \\\n", "0 63 1 1 145 233 1 2 150 0 2.3 3 \n", "1 67 1 4 160 286 0 2 108 1 1.5 2 \n", "2 67 1 4 120 229 0 2 129 1 2.6 2 \n", "3 37 1 3 130 250 0 0 187 0 3.5 3 \n", "4 41 0 2 130 204 0 2 172 0 1.4 1 \n", "\n", " ca thal target \n", "0 0 fixed 0 \n", "1 3 normal 1 \n", "2 2 reversible 0 \n", "3 0 normal 0 \n", "4 0 normal 0 " ] }, "execution_count": 10, "metadata": {}, "output_type": "execute_result" } ], "source": [ "from sklego.datasets import load_hearts\n", "\n", "df_hearts = load_hearts(as_frame=True)\n", "df_hearts.head()" ] }, { "cell_type": "code", "execution_count": 11, "metadata": {}, "outputs": [ { "data": { "image/png": "iVBORw0KGgoAAAANSUhEUgAAAXcAAAEICAYAAACktLTqAAAABHNCSVQICAgIfAhkiAAAAAlwSFlzAAALEgAACxIB0t1+/AAAADh0RVh0U29mdHdhcmUAbWF0cGxvdGxpYiB2ZXJzaW9uMy4yLjIsIGh0dHA6Ly9tYXRwbG90bGliLm9yZy+WH4yJAAAVT0lEQVR4nO3ce9RddX3n8feHRJBbKTQpAyEkKkFBpqVOFLtql0zxwkUbnRkR22pELNJi1Rm8oNLRNUqlq2NZ4zjiQKWAtGgctVDvSEVqkWKw1HKRGiOQxAQSUAxeqoHf/LF/j2wO57nfwo/3a61nPefs396//T378jn7+e1znpRSkCS1ZZf5LkCSNPMMd0lqkOEuSQ0y3CWpQYa7JDXIcJekBj2mwz3JB5P88Qz1dXCS+5MsqM+vTvLqmei79vfZJKtnqr9JrPfdSbYl2TLX654pSZ6f5G/mu46dTZInJ7kxyfYkrxvSPqPH8ExL8sokX+k9vz/JE+ezpmGSvDfJH8z1epsN9yS3J/lxPXC/n+TaJKcl+flrLqWcVkp51wT7es5Y85RS7iyl7FVKeWAGan9nkksH+j+ulHLxdPueZB0HA2cAh5dS/t2Q9qOTbJzLmqa4/rOBc2a7nkehNwNfKqXsXUp533wWMhPHUj3/1s9UTTPofwJvS7LrXK602XCvXlhK2RtYRndyvwX40EyvJMnCme5zJ3EwcE8p5e7Z6HwutluSpwP7lFKum+Lyre5b6M6Lm+e7iMa3MaWUzcA3gd+e6xU3+QPcDjxnYNozgAeBI+rzi4B318eLgE8B3wfuBf6e7s3vw3WZHwP3013tLAcKcApwJ3BNb9rC2t/VwHuA64EfAJcD+9W2o4GNw+oFjgV+Cvysru+fe/29uj7eBTgLuAO4G7iELsDo1bG61rYNePsY22mfuvzW2t9Ztf/n1Nf8YK3jooHl9hxovx84sG7jr9btuBl4P7Brb7kCnA58C/hOnfbmOu93gVfXeQ6pbbvRXfncCdwFfBDYfbT1D3l9/x34i4FpBXgdsL5unz8DdqltrwT+ATgXuAd492g1jHXc1LYDgY/Xbfsd4HW9Gt4JrKnbfjtdyK7stS8FPlGXvQd4f6/tVcCtwPeAzwPLxti/v137/n49hg6r0/8OeAD4Sd12hw5Z9mrgXXV7bAe+ACzqtT8TuLb2/c/A0b22k2uN2+t2fk2v7WhgI93F1hbgYxPcl78EXEF3Pl1fa/vKwH4dOW6OB26p698EvLE33wuAG2vd1wK/0ms7E/h2Xe4W4MW9tkOALwP31ePmo722pwBX1mPgNuDEgdrfDvzlnGbgXK5sTl/YkHCv0+8E/qA+voiHwv09dCft4+rPbwIZ1hcPBegldCGzO8PDfRNwRJ3n48Cl/YN7tHrpTvxLh5xoI+H+KmAd8ERgL7oQ+PBAbRfUun4V+DfqST1ke1xC98azd132X4FTRqtzYNlhr+M/0J30C2t/twJv6LWXehLsV+s7lu4EfyqwB3ApDz9Jz6U7oferNf4t8J6J1Ffn+RjwpoFpBfhS7fPg+ppHtu0rgR3AH9XXsPs4NQw9bujeIG+ge3PZte6r9cDze/v4J3QhtKD2c11tW0AXlufSHTuPB55V21bVfX9Yre8s4NpRXvuhwA+B59ba3lyX3XXwmBpl+avpgu7Quh2uBs6pbUvo3nSOr6/1ufX54tp+AvCkui2eDfwIeFpvv+0A/pTujXP3Ce7Lj9C9Ie5Jd15tYvRw3wz8Zn28b2/dv0Z3QXRU3c6r6c693Wr7S+jelHcBXlq33wG17TK6kN5lYJ/sCWyge0NbWNexjW44c6S2/wR8fU4zcC5XNqcvbPRwv456JcvDw/1/0IXcIeP1xUMB+sQh0/rhfk6v/XC6K/IFww5kJhfuVwF/2Gt7Mt2V/kigFuCgXvv1wElDXteCWlP/IHwNcHV9POYJN157necNwCd7zwvwW73nF1KDsj4/pM5zCF0w/BB4Uq/913noin8i678SOG1gWgGO7T3/Q+Cq+viVwJ29tvFqGHrc0IXHnQPT3kq9eqv7+IsDx8ePe/1vHTmWBvr4LPXNtz7fhS44lw2Z94+BNQPzbqJeYTOxcD9rYDt9rj5+C/WCotf+eWD1KH39DfD63n77KfD4iR5L9Vj9GfCU3rQ/YfRwv5PuWP6FgX7OA941MO024NmjrPdGYFV9fAlwPr1zq05/KfD3A9P+L/CO3vPnAuvHOlZn+qf1MfdhltD96TToz+iuar6QZH2SMyfQ14ZJtN9Bd/W0aEJVju3A2l+/74XA/r1p/U+3/IjuCn/QolrTYF9LplpYkkOTfCrJliQ/oDsBB19zf7scOPC8/3gx3dX8DfWm+PeBz9XpE/U9uqvtQYP75sAp1jDacbMMOHBkmbrc2xh7Hz2+jj8vBe4opewYUvcy4H/1+ryX7g1o2D572HFSSnmwvrbJ7N/RjqNlwEsGXt+zgAMAkhyX5Lok99a243n4cbC1lPKTSdSxmO4YH9xvo/nPdZ13JPlykl/v1X3GQN1Lqfs/ySvqJ4hG2o7o1f1mum19fZKbk7yq1+dRA33+LtD/EMLedMNAc6bpGxmD6s21JcBXBttKKdvpPhlyRpIjgL9L8rVSylV0VwTDjDZ9xNLe44Pprjy20V0J7tGrawEPD6zx+v0u3QHV73sH3XjwQeMs27et1rSMbnxxpK9NE1x+WJ3nAf8EvKyUsj3JG4D/MsZym3l4zf1tto1uLPappZRhNY23nQC+QTesMGgpD91MPJhumw7rd8waRjtu6ELoO6WUFROocdAG4OAkC4cE/Abg7FLKX02gn+8C/37kSZLQve6J7t/xavxwKeX3BxuS7EY3DPkK4PJSys/qR1HTm21w3423L7fSHeNL6W5OQrffhiqlfA1YleRxwGvphnOW8tD2O3tI3cvohjOPAb5aSnkgyY0jdZdStgC/X+d9FvDFJNfUPr9cSnnuGPUfRjfUNmceE1fuSX4hyQvoxuwuLaX8y5B5XpDkkHoC3Ed3s+nB2nwX3ZjpZP1eksOT7EH35/v/K91HJf+V7irthHrwnUU39jjiLmB5/2ObAy4D/muSJyTZi+7q+KOjXOmNqtayBjg7yd714P5vdOPeE3EX8EtJ9ulN25vuhtf9SZ4CjPf53jXAyUkOq9vp5987qFeaFwDnJvllgCRLkjx/jPUP+gzdmO+gNyXZN8lS4PXAR4ctPF4NYxw31wPbk7wlye5JFiQ5ol5gjOd6uje9c5LsmeTxSX6jtn0QeGuSp9b175PkJaP0swY4Ickx9Tg7g+7+y7UTqGE8lwIvTPcdggW1xqOTHER3j2E3aiAnOQ543jj9jbkv67H6CeCdSfZIcjjdePkjJNk1ye8m2aeU8jO643HkXL4AOC3JUensWc/DvenGzkutmyQn0125j/T7kvr6oPuLsNR+PwUcmuTlSR5Xf56e5LBeWc+mG1KbM62H+98m2U73zvp24M/pbnoMswL4It2d+q8CHyilfKm2vQc4q/7J9cZJrP/DdOP6W+huwLwOoJRyH9345V/QXUX9kO7TAyM+Vn/fk+TrQ/q9sPZ9Dd2nMH5CdwNwKv6orn893V80f137H1cp5Zt0bzTr67Y5EHgj8Dt0nza4gFFCs9fHZ4H30d3gXEd3TwS6EIJubHcdcF26YZ4v0t1jGG39g/1/HbgvyVEDTZfT3fC8Efg0Y39EdtQaGOW4qWH0AuBIun20jW5/j/VGNFLzA8AL6e473El3bLy0tn2S7kbkR2otNwHHjdLPbcDvAf+7rv+FdB8P/ul4NUygxg10N3ffRheGG4A30X1SaDvdsb6GLgR/h+6G9Fj9jbsv6a7A96I7ny4C/nKMLl8O3F630Wl0wySUUtbSXX2/v9a2ju4+C6WUW4D30u3Hu+j+6vmHXp9PB/4xyf319by+lLK+vt7nASfR/bW0hYduFpPkALp7KnP6RbqRT4NIO4V6tXMT3acXJvWXyBh9Po/uBvSL6vMCrCilrJuJ/qWxJHkv8O1SygfmdL2Gu+ZbkhfTDZ/sAVwMPDgSxLO0PsNdzWt9WEaPDq+h++zxt+nGrOf8/3BIrfHKXZIa5JW7JDVop/ic+6JFi8ry5cvnuwxJelS54YYbtpVShn6pb6cI9+XLl7N27dr5LkOSHlWSjPotXYdlJKlBhrskNchwl6QGGe6S1CDDXZIaZLhLUoMMd0lqkOEuSQ0y3CWpQTvFN1SnY/mZn57vErQTu/2cE+a7BGleeOUuSQ0y3CWpQYa7JDXIcJekBhnuktQgw12SGmS4S1KDDHdJapDhLkkNMtwlqUGGuyQ1yHCXpAYZ7pLUIMNdkho0brgnWZrkS0luSXJzktfX6fsluTLJt+rvfev0JHlfknVJvpHkabP9IiRJDzeRK/cdwBmllMOBZwKnJzkcOBO4qpSyAriqPgc4DlhRf04FzpvxqiVJYxo33Espm0spX6+PtwO3AkuAVcDFdbaLgRfVx6uAS0rnOuAXkxww45VLkkY1qTH3JMuBXwP+Edi/lLK5Nm0B9q+PlwAbeottrNMkSXNkwuGeZC/g48AbSik/6LeVUgpQJrPiJKcmWZtk7datWyezqCRpHBMK9ySPowv2vyqlfKJOvmtkuKX+vrtO3wQs7S1+UJ32MKWU80spK0spKxcvXjzV+iVJQ0zk0zIBPgTcWkr5817TFcDq+ng1cHlv+ivqp2aeCdzXG76RJM2BhROY5zeAlwP/kuTGOu1twDnAmiSnAHcAJ9a2zwDHA+uAHwEnz2jFkqRxjRvupZSvABml+Zgh8xfg9GnWJUmaBr+hKkkNMtwlqUGGuyQ1yHCXpAYZ7pLUIMNdkhpkuEtSgwx3SWqQ4S5JDTLcJalBhrskNchwl6QGGe6S1CDDXZIaZLhLUoMMd0lqkOEuSQ0y3CWpQYa7JDXIcJekBhnuktQgw12SGmS4S1KDDHdJapDhLkkNMtwlqUGGuyQ1yHCXpAYZ7pLUIMNdkhpkuEtSgwx3SWqQ4S5JDTLcJalBhrskNchwl6QGGe6S1CDDXZIaNG64J7kwyd1JbupNe2eSTUlurD/H99remmRdktuSPH+2CpckjW4iV+4XAccOmX5uKeXI+vMZgCSHAycBT63LfCDJgpkqVpI0MeOGeynlGuDeCfa3CvhIKeXfSinfAdYBz5hGfZKkKZjOmPtrk3yjDtvsW6ctATb05tlYp0mS5tBUw/084EnAkcBm4L2T7SDJqUnWJlm7devWKZYhSRpmSuFeSrmrlPJAKeVB4AIeGnrZBCztzXpQnTasj/NLKStLKSsXL148lTIkSaOYUrgnOaD39MXAyCdprgBOSrJbkicAK4Drp1eiJGmyFo43Q5LLgKOBRUk2Au8Ajk5yJFCA24HXAJRSbk6yBrgF2AGcXkp5YHZKlySNZtxwL6W8bMjkD40x/9nA2dMpSpI0PX5DVZIaZLhLUoMMd0lqkOEuSQ0y3CWpQYa7JDXIcJekBhnuktQgw12SGmS4S1KDDHdJapDhLkkNMtwlqUGGuyQ1yHCXpAYZ7pLUIMNdkhpkuEtSgwx3SWqQ4S5JDTLcJalBhrskNchwl6QGGe6S1CDDXZIaZLhLUoMMd0lqkOEuSQ0y3CWpQYa7JDXIcJekBhnuktQgw12SGmS4S1KDDHdJapDhLkkNMtwlqUGGuyQ1yHCXpAaNG+5JLkxyd5KbetP2S3Jlkm/V3/vW6UnyviTrknwjydNms3hJ0nATuXK/CDh2YNqZwFWllBXAVfU5wHHAivpzKnDezJQpSZqMccO9lHINcO/A5FXAxfXxxcCLetMvKZ3rgF9McsBMFStJmpipjrnvX0rZXB9vAfavj5cAG3rzbazTHiHJqUnWJlm7devWKZYhSRpm2jdUSykFKFNY7vxSyspSysrFixdPtwxJUs9Uw/2ukeGW+vvuOn0TsLQ330F1miRpDk013K8AVtfHq4HLe9NfUT8180zgvt7wjSRpjiwcb4YklwFHA4uSbATeAZwDrElyCnAHcGKd/TPA8cA64EfAybNQsyRpHOOGeynlZaM0HTNk3gKcPt2iJEnT4zdUJalBhrskNchwl6QGGe6S1CDDXZIaZLhLUoMMd0lqkOEuSQ0y3CWpQYa7JDVo3H8/IGn6lp/56fkuQTup2885YVb69cpdkhpkuEtSgwx3SWqQ4S5JDTLcJalBhrskNchwl6QGGe6S1CDDXZIaZLhLUoMMd0lqkOEuSQ0y3CWpQYa7JDXIcJekBhnuktQgw12SGmS4S1KDDHdJapDhLkkNMtwlqUGGuyQ1yHCXpAYZ7pLUIMNdkhpkuEtSgwx3SWrQwuksnOR2YDvwALCjlLIyyX7AR4HlwO3AiaWU702vTEnSZMzElft/LKUcWUpZWZ+fCVxVSlkBXFWfS5Lm0GwMy6wCLq6PLwZeNAvrkCSNYbrhXoAvJLkhyal12v6llM318RZg/2ELJjk1ydoka7du3TrNMiRJfdMacweeVUrZlOSXgSuTfLPfWEopScqwBUsp5wPnA6xcuXLoPJKkqZnWlXspZVP9fTfwSeAZwF1JDgCov++ebpGSpMmZcrgn2TPJ3iOPgecBNwFXAKvrbKuBy6dbpCRpcqYzLLM/8MkkI/38dSnlc0m+BqxJcgpwB3Di9MuUJE3GlMO9lLIe+NUh0+8BjplOUZKk6fEbqpLUIMNdkhpkuEtSgwx3SWqQ4S5JDTLcJalBhrskNchwl6QGGe6S1CDDXZIaZLhLUoMMd0lqkOEuSQ0y3CWpQYa7JDXIcJekBhnuktQgw12SGmS4S1KDDHdJapDhLkkNMtwlqUGGuyQ1yHCXpAYZ7pLUIMNdkhpkuEtSgwx3SWqQ4S5JDTLcJalBhrskNchwl6QGGe6S1CDDXZIaZLhLUoMMd0lqkOEuSQ0y3CWpQbMW7kmOTXJbknVJzpyt9UiSHmlWwj3JAuD/AMcBhwMvS3L4bKxLkvRIs3Xl/gxgXSllfSnlp8BHgFWztC5J0oCFs9TvEmBD7/lG4Kj+DElOBU6tT+9Pctss1fJYswjYNt9F7Czyp/NdgYbwGO2Z5jG6bLSG2Qr3cZVSzgfOn6/1tyrJ2lLKyvmuQxqNx+jcmK1hmU3A0t7zg+o0SdIcmK1w/xqwIskTkuwKnARcMUvrkiQNmJVhmVLKjiSvBT4PLAAuLKXcPBvr0iM41KWdncfoHEgpZb5rkCTNML+hKkkNMtwlqUGGe0P8lw/amSW5MMndSW6a71oeCwz3RvgvH/QocBFw7HwX8VhhuLfDf/mgnVop5Rrg3vmu47HCcG/HsH/5sGSeapE0zwx3SWqQ4d4O/+WDpJ8z3Nvhv3yQ9HOGeyNKKTuAkX/5cCuwxn/5oJ1JksuArwJPTrIxySnzXVPL/PcDktQgr9wlqUGGuyQ1yHCXpAYZ7pLUIMNdkhpkuEtSgwx3SWrQ/wdnPvpXsryvsgAAAABJRU5ErkJggg==\n", "text/plain": [ "
" ] }, "metadata": { "needs_background": "light" }, "output_type": "display_data" } ], "source": [ "X, y = load_hearts(return_X_y=True)\n", "\n", "y_counted = Counter([str(integer) for integer in y])\n", "\n", "plt.bar(y_counted.keys(), y_counted.values())\n", "plt.title('Distribution of target (presence of heart disease)');" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Load penguins\n", "\n", "Loads the penguins dataset, which is a lovely alternative for the iris dataset. We've\n", "added this dataset for educational use.\n", "\n", "Data were collected and made available by Dr. Kristen Gorman and the Palmer Station,\n", "Antarctica LTER, a member of the Long Term Ecological Research Network. The goal\n", "of the dataset is to predict which species of penguin a penguin belongs to." ] }, { "cell_type": "code", "execution_count": 12, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
speciesislandbill_length_mmbill_depth_mmflipper_length_mmbody_mass_gsex
0AdelieTorgersen39.118.7181.03750.0male
1AdelieTorgersen39.517.4186.03800.0female
2AdelieTorgersen40.318.0195.03250.0female
3AdelieTorgersenNaNNaNNaNNaNNaN
4AdelieTorgersen36.719.3193.03450.0female
\n", "
" ], "text/plain": [ " species island bill_length_mm bill_depth_mm flipper_length_mm \\\n", "0 Adelie Torgersen 39.1 18.7 181.0 \n", "1 Adelie Torgersen 39.5 17.4 186.0 \n", "2 Adelie Torgersen 40.3 18.0 195.0 \n", "3 Adelie Torgersen NaN NaN NaN \n", "4 Adelie Torgersen 36.7 19.3 193.0 \n", "\n", " body_mass_g sex \n", "0 3750.0 male \n", "1 3800.0 female \n", "2 3250.0 female \n", "3 NaN NaN \n", "4 3450.0 female " ] }, "execution_count": 12, "metadata": {}, "output_type": "execute_result" } ], "source": [ "from sklego.datasets import load_penguins\n", "\n", "df_penguins = load_penguins(as_frame=True)\n", "df_penguins.head()" ] }, { "cell_type": "code", "execution_count": 13, "metadata": {}, "outputs": [ { "data": { "image/png": "iVBORw0KGgoAAAANSUhEUgAAAXcAAAEICAYAAACktLTqAAAABHNCSVQICAgIfAhkiAAAAAlwSFlzAAALEgAACxIB0t1+/AAAADh0RVh0U29mdHdhcmUAbWF0cGxvdGxpYiB2ZXJzaW9uMy4yLjIsIGh0dHA6Ly9tYXRwbG90bGliLm9yZy+WH4yJAAAY00lEQVR4nO3de5xcZX3H8c+XhDtIiFkjJIENEtSUouIKWFBTsRa5mPRVRFLAYENTkYoKikERqC+pWKsUi7cISEDkUlRIhSopEkHLbYOEW0BjSEhiQha5CKiEwK9/nGfLYTKzuzNnNpt9+L5fr3ntuT7nN2ey33nmOWc2igjMzCwvmw11AWZm1n4OdzOzDDnczcwy5HA3M8uQw93MLEMOdzOzDDncMyXpm5I+26a2dpH0tKQRaX6BpOPa0XZq778lzWhXe00c9/OSHpW0ZmMfu04tR0m6fhDa7ZD0gKSt2912O0jaMtXXMdS15Ea+z334kbQMGAusB54H7gcuBuZExAsttHVcRPxPE/ssAL4bEec3c6y075nA7hFxdLP7tpOkXYAHgV0jYu1Q1jKYJH0Z6ImIs9P8Alp87dpUzwbHl3QKMDYiTh6KmnLlnvvwdVhEbA/sCpwNfAq4oN0HkTSy3W1uInYBfpd5sG8JzAC+28Y2B+Pfw/eAGalea5eI8GOYPYBlwLtqlu0DvADsmeYvAj6fpscAPwKeAB4DbqZ4Y78k7fNH4GngFKATCGAm8DBwU2nZyNTeAuALwO3A74FrgNFp3RRgZb16gYOAdcBz6XiLSu0dl6Y3A04DlgNrKT6R7JDW9dYxI9X2KPCZPs7TDmn/ntTeaan9d6Xn/EKq46I6+9Y9Z6XncyrFJ6bHge8AW5X2PRS4K+37v8BepXUTgB+kmn4HnJeWHwv8vLTd64D56dgPAkeU1h2cjv0UsAr4RIPn/3ZgSWn+LIpPen9Kz7v32OcCK9JruRB4W2mfM4GrKN4gfg8cB0xM/y6eAv4H+BpFb7x3n/3S834CWARM6ev4ad2vgXcM9e9WTo8hL8CPFl60OuGelj8MHJ+mL+LFcP8C8E1g8/R4Gy8Oyb2krVKAXgxsC2xN/XBfBeyZtvl+7y83fYR7mj6zHASl9nrD/e+BJcBuwHYpCC+pqe3bqa43AM8Cr29wni6meOPZPu37K2Bmozpr9u3vnN1LEdSjgV+UzvWbKN6U9gVGULwRLQO2TPOLgHPSedsKOCDtdywp3NO6FcAHgZGpzUeByWn9alIAAzsCezd4DicA1zY616VlRwOvTMc6GVhDerNKr9dzwDSKN8atgVuAfwO2AA6gCP3e138cxZvWwWn7v0rzHY2On5bPA04c6t+tnB4elsnLbynCptZzwE4U48vPRcTNkX6j+nBmRDwTEX9ssP6SiLg3Ip4BPgsc0XvBtaKjgK9ExNKIeJqih3xkzXDAP0fEHyNiEUVYvqG2kVTLkcCpEfFURCwDvgwcM8A6+jtn50XEioh4jKJHOj0tnwV8KyJui4jnI2IuxRvQfhSfrnYGPpnO7Z8i4ud1jn0osCwivhMR6yPilxRvoO8r1TZZ0isi4vGIuLPBcxhF0bvuU0R8NyJ+l471ZYo3oteWNrklIq6O4npOB/AW4PSIWJfqn1fa9mjguoi4LiJeiIj5QDdF2PflqVSvtYnDPS/jKD7G1/oSRW/4eklLJc0eQFsrmli/nKJ3O2ZAVfZt59Reue2RFBeQe5XvbvkDRQ+/1phUU21b4wZYR3/nrPb575ymdwVOlvRE74Oih79z+rk8Itb3c+xdgX1r2jgKeHVa/7cUYblc0s8kvbVBO49TfGrpk6RPSFos6cl0rB146WtZfq47A49FxB8arN8VeF9N7QdQvFH2ZXuKYRxrE4d7JiS9hSK4NugJpp7ryRGxG/Be4CRJB/aubtBkfz37CaXpXSh6k48CzwDblOoaQdHbG2i7v6UIiHLb64FH+tmv1qOpptq2Vg1k537OGWz4/H+bplcAZ0XEqNJjm4i4LK3bZQAXJVcAP6tpY7uIOD7VdkdETAVeBVwNXNmgnbuBPWqfWnlG0tsorrUcAewYEaOAJwE12Gc1MFrSNqVl5XOxguJTXbn2bSPdrVN7/JLXU3wKszZxuA9zkl4h6VDgcopxz3vqbHOopN0lieIX93mKi4lQhOZuLRz6aEmT0y/554CrIuJ5inHtrSQdImlziouY5bsgHgE6JTX6t3cZ8HFJEyVtB/wLcMUAersvkWq5EjhL0vaSdgVOYoB3jvRzzgBOkDRe0mjgM8AVafm3gQ9J2leFbdO52J7iAvRq4Oy0fCtJ+9c5/I+APSQdI2nz9HiLpNdL2iLdE79DRDxHMd7d6PbX24FRksqfVmpf7+0p3jx7gJGSTgde0ei8RMRyimGWM1MtbwUOK23yXeAwSX8taUR6jlMkjW9wfFJ9o4FbGx3XmudwH77+S9JTFD2lzwBfobgAV88kirsanqa4GPb1iLgxrfsCcFr6CP2JJo5/CcVF2zUUFwZPBIiIJ4EPA+dT9JKfAVaW9vvP9PN3kuqNFV+Y2r4JeIjizoqPNFFX2UfS8ZdSfKL5Xmp/IPo6Z6S2rk9t/wb4PEBEdAP/AJxHMSyyhOJiae8bzmHA7hQXv1cC7689cEQ8Bbyb4prBbynO8Rd58U3yGGCZpN8DH6IYstlARKyjeI3K3yk4Fzhc0uOSvgr8BPgxxZvycorz3d+Q3FHAWykulH6e4o3t2XTMFcBU4NMUbxgrgE/yYtbUHh/g74C5EfFsP8e1JvhLTGZNauWLX0MlffPzZuBNfVwcr3qMK4AHIuKMFvbdkmI45u2R8XcOhoJ77mYZi4ieiHhdO4M9DRG9RtJmkg6i6Klf3WJ9z6b6HOxtluu3D81s8Lya4vsHr6QYWjo+3a5pmxAPy5iZZcjDMmZmGdokhmXGjBkTnZ2dQ12GmdmwsnDhwkcjou6fS94kwr2zs5Pu7u6hLsPMbFiRtLzROg/LmJllyOFuZpYhh7uZWYYc7mZmGXK4m5llyOFuZpYhh7uZWYYc7mZmGXK4m5llaJP4hmoVnbOvHeoSsrXs7EOGugQza5F77mZmGXK4m5llyOFuZpYhh7uZWYYc7mZmGXK4m5llyOFuZpYhh7uZWYYc7mZmGXK4m5llyOFuZpYhh7uZWYb6DXdJF0paK+neOutOlhSSxqR5SfqqpCWS7pa092AUbWZmfRtIz/0i4KDahZImAO8GHi4tfg8wKT1mAd+oXqKZmTWr33CPiJuAx+qsOgc4BYjSsqnAxVG4FRglaae2VGpmZgPW0pi7pKnAqohYVLNqHLCiNL8yLavXxixJ3ZK6e3p6WinDzMwaaDrcJW0DfBo4vcqBI2JORHRFRFdHR0eVpszMrEYr/xPTa4CJwCJJAOOBOyXtA6wCJpS2HZ+WmZnZRtR0zz0i7omIV0VEZ0R0Ugy97B0Ra4B5wAfSXTP7AU9GxOr2lmxmZv0ZyK2QlwG3AK+VtFLSzD42vw5YCiwBvg18uC1VmplZU/odlomI6f2s7yxNB3BC9bLMzKwKf0PVzCxDDnczsww53M3MMuRwNzPLkMPdzCxDDnczsww53M3MMuRwNzPLkMPdzCxDDnczsww53M3MMuRwNzPLkMPdzCxDDnczsww53M3MMuRwNzPLUCv/h6pZJZ2zrx3qErK17OxDhroE20S4525mlqGB/B+qF0paK+ne0rIvSXpA0t2SfihpVGndqZKWSHpQ0l8PVuFmZtbYQHruFwEH1SybD+wZEXsBvwJOBZA0GTgS+LO0z9cljWhbtWZmNiD9hntE3AQ8VrPs+ohYn2ZvBcan6anA5RHxbEQ8BCwB9mljvWZmNgDtGHP/e+C/0/Q4YEVp3cq0bAOSZknqltTd09PThjLMzKxXpXCX9BlgPXBps/tGxJyI6IqIro6OjiplmJlZjZZvhZR0LHAocGBERFq8CphQ2mx8WmZmZhtRSz13SQcBpwDvjYg/lFbNA46UtKWkicAk4PbqZZqZWTP67blLugyYAoyRtBI4g+LumC2B+ZIAbo2ID0XEfZKuBO6nGK45ISKeH6zizcysvn7DPSKm11l8QR/bnwWcVaUoMzOrxt9QNTPLkMPdzCxDDnczsww53M3MMuRwNzPLkMPdzCxDDnczsww53M3MMuRwNzPLkMPdzCxDDnczsww53M3MMuRwNzPLkMPdzCxDDnczsww53M3MMuRwNzPLkMPdzCxD/Ya7pAslrZV0b2nZaEnzJf06/dwxLZekr0paIuluSXsPZvFmZlbfQHruFwEH1SybDdwQEZOAG9I8wHuASekxC/hGe8o0M7Nm9BvuEXET8FjN4qnA3DQ9F5hWWn5xFG4FRknaqV3FmpnZwLQ65j42Ilan6TXA2DQ9DlhR2m5lWmZmZhtR5QuqERFANLufpFmSuiV19/T0VC3DzMxKWg33R3qHW9LPtWn5KmBCabvxadkGImJORHRFRFdHR0eLZZiZWT2thvs8YEaangFcU1r+gXTXzH7Ak6XhGzMz20hG9reBpMuAKcAYSSuBM4CzgSslzQSWA0ekza8DDgaWAH8APjgINZuZWT/6DfeImN5g1YF1tg3ghKpFmZlZNf6GqplZhhzuZmYZcribmWXI4W5mliGHu5lZhhzuZmYZcribmWXI4W5mliGHu5lZhhzuZmYZcribmWXI4W5mliGHu5lZhhzuZmYZcribmWXI4W5mliGHu5lZhhzuZmYZcribmWWoUrhL+rik+yTdK+kySVtJmijpNklLJF0haYt2FWtmZgPTcrhLGgecCHRFxJ7ACOBI4IvAORGxO/A4MLMdhZqZ2cBVHZYZCWwtaSSwDbAaeCdwVVo/F5hW8RhmZtaklsM9IlYB/wY8TBHqTwILgSciYn3abCUwrt7+kmZJ6pbU3dPT02oZZmZWR5VhmR2BqcBEYGdgW+Cgge4fEXMioisiujo6Olotw8zM6qgyLPMu4KGI6ImI54AfAPsDo9IwDcB4YFXFGs3MrElVwv1hYD9J20gScCBwP3AjcHjaZgZwTbUSzcysWVXG3G+juHB6J3BPamsO8CngJElLgFcCF7ShTjMza8LI/jdpLCLOAM6oWbwU2KdKu2ZmVo2/oWpmliGHu5lZhhzuZmYZcribmWXI4W5mliGHu5lZhhzuZmYZcribmWXI4W5mliGHu5lZhir9+QEze3nonH3tUJeQrWVnHzIo7brnbmaWIYe7mVmGHO5mZhlyuJuZZcjhbmaWIYe7mVmGHO5mZhmqFO6SRkm6StIDkhZLequk0ZLmS/p1+rlju4o1M7OBqdpzPxf4cUS8DngDsBiYDdwQEZOAG9K8mZltRC2Hu6QdgLcDFwBExLqIeAKYCsxNm80FplUt0szMmlOl5z4R6AG+I+mXks6XtC0wNiJWp23WAGPr7SxplqRuSd09PT0VyjAzs1pVwn0ksDfwjYh4E/AMNUMwERFA1Ns5IuZERFdEdHV0dFQow8zMalUJ95XAyoi4Lc1fRRH2j0jaCSD9XFutRDMza1bL4R4Ra4AVkl6bFh0I3A/MA2akZTOAaypVaGZmTav6J38/AlwqaQtgKfBBijeMKyXNBJYDR1Q8hpmZNalSuEfEXUBXnVUHVmnXzMyq8TdUzcwy5HA3M8uQw93MLEMOdzOzDDnczcwy5HA3M8uQw93MLEMOdzOzDDnczcwy5HA3M8uQw93MLEMOdzOzDDnczcwy5HA3M8uQw93MLEMOdzOzDDnczcwy5HA3M8tQ5XCXNELSLyX9KM1PlHSbpCWSrkj/v6qZmW1E7ei5fxRYXJr/InBOROwOPA7MbMMxzMysCZXCXdJ44BDg/DQv4J3AVWmTucC0KscwM7PmVe25/ztwCvBCmn8l8ERErE/zK4Fx9XaUNEtSt6Tunp6eimWYmVlZy+Eu6VBgbUQsbGX/iJgTEV0R0dXR0dFqGWZmVsfICvvuD7xX0sHAVsArgHOBUZJGpt77eGBV9TLNzKwZLffcI+LUiBgfEZ3AkcBPI+Io4Ebg8LTZDOCaylWamVlTBuM+908BJ0laQjEGf8EgHMPMzPpQZVjm/0XEAmBBml4K7NOOds3MrDX+hqqZWYYc7mZmGXK4m5llyOFuZpYhh7uZWYYc7mZmGXK4m5llyOFuZpYhh7uZWYYc7mZmGXK4m5llyOFuZpYhh7uZWYYc7mZmGXK4m5llyOFuZpYhh7uZWYYc7mZmGWo53CVNkHSjpPsl3Sfpo2n5aEnzJf06/dyxfeWamdlAVOm5rwdOjojJwH7ACZImA7OBGyJiEnBDmjczs42o5XCPiNURcWeafgpYDIwDpgJz02ZzgWlVizQzs+a0ZcxdUifwJuA2YGxErE6r1gBj23EMMzMbuMrhLmk74PvAxyLi9+V1ERFANNhvlqRuSd09PT1VyzAzs5JK4S5pc4pgvzQifpAWPyJpp7R+J2BtvX0jYk5EdEVEV0dHR5UyzMysRpW7ZQRcACyOiK+UVs0DZqTpGcA1rZdnZmatGFlh3/2BY4B7JN2Vln0aOBu4UtJMYDlwRLUSzcysWS2He0T8HFCD1Qe22q6ZmVXnb6iamWXI4W5mliGHu5lZhhzuZmYZcribmWXI4W5mliGHu5lZhhzuZmYZcribmWXI4W5mliGHu5lZhhzuZmYZcribmWXI4W5mliGHu5lZhhzuZmYZcribmWXI4W5mliGHu5lZhgYt3CUdJOlBSUskzR6s45iZ2YYGJdwljQC+BrwHmAxMlzR5MI5lZmYbGqye+z7AkohYGhHrgMuBqYN0LDMzqzFykNodB6woza8E9i1vIGkWMCvNPi3pwUGqZVMzBnh0qIsYCH1xqCvYZPg1G16GzesFlV+zXRutGKxw71dEzAHmDNXxh4qk7ojoGuo6bOD8mg0vfr0KgzUsswqYUJofn5aZmdlGMFjhfgcwSdJESVsARwLzBulYZmZWY1CGZSJivaR/An4CjAAujIj7BuNYw9DLbigqA37Nhhe/XoAiYqhrMDOzNvM3VM3MMuRwNzPLkMO9RZKmSQpJr2uwfoGkPm/HKm8j6TpJowaj1pczSWMlfU/SUkkLJd0i6W9abOtjkrZpd40vV5JeLelySb9Jr811kmZJ+lGD7c9v5Zvukt4o6eDqFQ8vDvfWTQd+nn5WFhEHR8QT7WjLCpIEXA3cFBG7RcSbKe7cGt9ikx8DHO5tkF6bHwILIuI16bU5FRjbaJ+IOC4i7m/hcG8E6oa7pCH7rs9gc7i3QNJ2wAHATIqwQNLWqReyWNIPga1L27879RjvlPSfaf/aNpdJGpOmj5Z0u6S7JH0r/a0ea947gXUR8c3eBRGxPCL+Q9IISV+SdIekuyX9I4CkKekT1VWSHpB0qQonAjsDN0q6MW07XdI9ku6VXvyeYaPl9hJ/CTxX89osAm4Gtqs9/7DBJ92nJZ0laZGkWyWNTcvfl877Ikk3pVuxPwe8P/0+vV/SmZIukfQL4BJJnZJuTr+fd0r6i9TWlNTGtemPIH5T0vDJzIjwo8kHcBRwQZr+X+DNwEkUt3wC7AWsB7oovgp9E7BtWvcp4PQ0vQDoStPL0ravB/4L2Dwt/zrwgaF+zsPxAZwInNNg3SzgtDS9JdANTASmAE9S9O43A24BDii/Rml6Z+BhoIPiluKfAtMaLR/qc7GpPRq9Nv2c//LvSwCHpel/Lb2W9wDj0vSo9PNY4LzSMc4EFgJbp/ltgK3S9CSgu1TLn4DdKG7png8cPtTnbqCPbD+SDLLpwLlp+vI0vzvwVYCIuFvS3Wn9fhR/GfMXqQOyBcU/2EYOpHizuCNtvzWwts31vyxJ+hrFJ651wHJgL0mHp9U7UPxirwNuj4iVaZ+7gE6KIbiyt1AMKfSk7S4F3k4ROvWWXz14zyw7Azn/64DesfmFwF+l6V8AF0m6EvhBH8eYFxF/TNObA+dJeiPwPLBHTS1LUy2XUfz7uaqVJ7WxOdybJGk0xcf9P5cUFO/oAfyy0S7A/IgY6Ni8gLkRcWrlYu0+4G97ZyLihDT01U3Ru/5IRPykvIOkKcCzpUXP49+TwXAfcHiDdQM5/89F6l6Xt4mID0naFzgEWCjpzQ2O8Uxp+uPAI8AbKD4t/Km0rvaLQMPmi0HDZ/xo03E4cElE7BoRnRExAXiIovfwdwCS9qQYmgG4Fdhf0u5p3baS9qjTbq8bgMMlvSptP1pSw7/8Zn36KbCVpONLy3oviP4EOF7S5gCS9pC0bT/tPQVsn6ZvB94haUy6JjId+Fkfy+2lfgpsqeKvwwIgaS/gbVUalfSaiLgtIk4Heij+xlX5datnB2B1RLwAHEPRYeu1j4o/o7IZ8H42/ASxyXK4N286xVX+su9TjNduJ2kxxQWchQDp4/mxwGVpqOYWoO7tk2n7+4HTgOvT9vOBndr8HF4WUs9uGkXYPiTpdmAuxXWP84H7gTsl3Qt8i/576HOAH0u6MSJWA7OBG4FFwMKIuKbR8kF4esNaem3+BniXilsh7wO+AKyp2PSXei9mU1wPW0TxWkzuvaBaZ5+vAzMkLaL43Sz36u8AzgMWU3Tian/3N1n+8wNmZnWkIbpPRMShQ11LK9xzNzPLkHvuZmYZcs/dzCxDDnczsww53M3MMuRwNzPLkMPdzCxD/wfcfkJfXM+H0wAAAABJRU5ErkJggg==\n", "text/plain": [ "
" ] }, "metadata": { "needs_background": "light" }, "output_type": "display_data" } ], "source": [ "X, y = load_penguins(return_X_y=True)\n", "\n", "plt.bar(Counter(y).keys(), Counter(y).values())\n", "plt.title('Distribution of species (target)');" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Fetch creditcard\n", "\n", "Load the creditcard dataset. Download it if necessary.\n", "\n", "Note that internally this is using `fetch_openml` from scikit-learn, which is experimental.\n", "\n", "```\n", "============== ==============\n", "Samples total 284807\n", "Dimensionality 29\n", "Features real\n", "Target int 0, 1\n", "============== ==============\n", "```\n", "The datasets contains transactions made by credit cards in September 2013 by european\n", "cardholders. This dataset present transactions that occurred in two days, where we have\n", "492 frauds out of 284,807 transactions. The dataset is highly unbalanced, the positive\n", "class (frauds) account for 0.172% of all transactions.\n", "\n", "Please cite:\n", " Andrea Dal Pozzolo, Olivier Caelen, Reid A. Johnson and Gianluca Bontempi.\n", " Calibrating Probability with Undersampling for Unbalanced Classification.\n", " In Symposium on Computational Intelligence and Data Mining (CIDM), IEEE, 2015" ] }, { "cell_type": "code", "execution_count": 14, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
V1V2V3V4V5V6V7V8V9V10...V21V22V23V24V25V26V27V28AmountClass
0-1.359807-0.0727812.5363471.378155-0.3383210.4623880.2395990.0986980.3637870.090794...-0.0183070.277838-0.1104740.0669280.128539-0.1891150.133558-0.021053149.620
11.1918570.2661510.1664800.4481540.060018-0.082361-0.0788030.085102-0.255425-0.166974...-0.225775-0.6386720.101288-0.3398460.1671700.125895-0.0089830.0147242.690
2-1.358354-1.3401631.7732090.379780-0.5031981.8004990.7914610.247676-1.5146540.207643...0.2479980.7716790.909412-0.689281-0.327642-0.139097-0.055353-0.059752378.660
3-0.966272-0.1852261.792993-0.863291-0.0103091.2472030.2376090.377436-1.387024-0.054952...-0.1083000.005274-0.190321-1.1755750.647376-0.2219290.0627230.061458123.500
4-1.1582330.8777371.5487180.403034-0.4071930.0959210.592941-0.2705330.8177390.753074...-0.0094310.798278-0.1374580.141267-0.2060100.5022920.2194220.21515369.990
\n", "

5 rows × 30 columns

\n", "
" ], "text/plain": [ " V1 V2 V3 V4 V5 V6 V7 \\\n", "0 -1.359807 -0.072781 2.536347 1.378155 -0.338321 0.462388 0.239599 \n", "1 1.191857 0.266151 0.166480 0.448154 0.060018 -0.082361 -0.078803 \n", "2 -1.358354 -1.340163 1.773209 0.379780 -0.503198 1.800499 0.791461 \n", "3 -0.966272 -0.185226 1.792993 -0.863291 -0.010309 1.247203 0.237609 \n", "4 -1.158233 0.877737 1.548718 0.403034 -0.407193 0.095921 0.592941 \n", "\n", " V8 V9 V10 ... V21 V22 V23 V24 \\\n", "0 0.098698 0.363787 0.090794 ... -0.018307 0.277838 -0.110474 0.066928 \n", "1 0.085102 -0.255425 -0.166974 ... -0.225775 -0.638672 0.101288 -0.339846 \n", "2 0.247676 -1.514654 0.207643 ... 0.247998 0.771679 0.909412 -0.689281 \n", "3 0.377436 -1.387024 -0.054952 ... -0.108300 0.005274 -0.190321 -1.175575 \n", "4 -0.270533 0.817739 0.753074 ... -0.009431 0.798278 -0.137458 0.141267 \n", "\n", " V25 V26 V27 V28 Amount Class \n", "0 0.128539 -0.189115 0.133558 -0.021053 149.62 0 \n", "1 0.167170 0.125895 -0.008983 0.014724 2.69 0 \n", "2 -0.327642 -0.139097 -0.055353 -0.059752 378.66 0 \n", "3 0.647376 -0.221929 0.062723 0.061458 123.50 0 \n", "4 -0.206010 0.502292 0.219422 0.215153 69.99 0 \n", "\n", "[5 rows x 30 columns]" ] }, "execution_count": 14, "metadata": {}, "output_type": "execute_result" } ], "source": [ "from sklego.datasets import fetch_creditcard\n", "\n", "dict_creditcard = fetch_creditcard(as_frame=True)\n", "\n", "df_creditcard = dict_creditcard['frame']\n", "df_creditcard.head()" ] }, { "cell_type": "code", "execution_count": 15, "metadata": {}, "outputs": [ { "data": { "image/png": "iVBORw0KGgoAAAANSUhEUgAAAYkAAAEICAYAAACqMQjAAAAABHNCSVQICAgIfAhkiAAAAAlwSFlzAAALEgAACxIB0t1+/AAAADh0RVh0U29mdHdhcmUAbWF0cGxvdGxpYiB2ZXJzaW9uMy4yLjIsIGh0dHA6Ly9tYXRwbG90bGliLm9yZy+WH4yJAAAWCElEQVR4nO3dfbSdZXnn8e+viaBT1IBkEAMaqmmX0VlFTSGu2qmVLghYB9pRi9NKdBjTVmjrVKfiS4VRaHFmqVNGpYuWLAKogGhLOkYxorPUNSIERV6lnIVgEhEi4c33Bq/5Y98Hd477PuckOTknku9nrb3O3tdzP/dz7ZPD/u3nZW9SVUiSNMovzHUDkqQ9lyEhSeoyJCRJXYaEJKnLkJAkdRkSkqQuQ0K7JMnfJfmrGZrr6Um+m2Ree/x/k/yXmZi7zffJJCtnar4d2O6ZSb6T5Nud5X+S5J723J8yC/1ckOTMSZYvTPL1JE/Y3b3sjCT7tv4WznUvewNDQl1J7kzygyQPJ3kgyf9L8sdJHv27qao/rqp3TXOu355sTFV9s6r2q6pHZqD3M5JcPGH+Y6tqza7OvYN9PB14I7C0qp46YvnjgPcCR7fnft9s9tdxGnBBVf0AZj6sd9TE7VfVj4DVDPrUbmZIaCovq6onAs8AzgbeDJw/0xtJMn+m59xDPB24r6ru7Sw/CHg8cPOohbP9e0myL7ASuHiqsTsw5+54Dh8GVrZ+tRsZEpqWqnqwqtYCv8/gP87nwvaHLpIcmOT/tL2OrUm+kOQXklzE4MXyn9shlb9MsjhJJTk5yTeBzw7Vhl9UnpnkmiQPJbkiyQFtWy9Osmm4x/G9lSQrgLcCv9+297W2/NF3pK2vtye5K8m9SS5M8uS2bLyPlUm+2Q4Vva33u0ny5Lb+ljbf29v8vw2sB57W+rhgwnq/DNzWHj6Q5LOtXklOSXI7cHur/W2Sje33cF2S3xiaZ7vDRxN/N0mel+QrbY/wUgah1HMk8EBVbWrrngX8BvD+9hzeP41+zkhyeZKLkzwEvCbJYUk+33r4TJIPDO/pJVne9lQfSPK1JC+ebPutv/uB5ZM8F80AQ0I7pKquATYx+A93oje2ZQsZvEN+62CVejXwTQZ7JftV1f8YWuc3gWcDx3Q2eRLwn4GDgW3AOdPo8VPAXwOXtu396ohhr2m33wJ+CdgPeP+EMS8CfgU4CnhHkmd3Nvm/gSe3eX6z9fzaqvoMcCzwrdbHayb0+S/Ac9rDBVX1kqHFJzB4wV7aHl8LHA4cwOBd9EeTTPZiD0CSfYB/Ai5q634U+I+TrPLv+GlwUVVvA74AnNqew6nT7Od44HJgAfChNuYa4CnAGcCrh3pcBHwCOLPN9ybgY0kWTrJ9gFuBUf+2mkGGhHbGtxj8xzzRvzJ4MX9GVf1rVX2hpv5ysDOq6nvjx79HuKiqbqqq7wF/Bbwy7cT2LvoD4L1VdUdVfRd4C3DihL2Y/15VP6iqrwFfY8QLUuvlROAtVfVwVd0JvIehF8Gd9DdVtXX891JVF1fVfVW1rareA+zLIMCmshx4HPC/2r/J5Qxe4HsWAA9PNek0+vlSVf1TVf2EwZuGXwPeUVU/rqovAmuHxv4hsK6q1lXVT6pqPbABOG6KNh5u/Wo3MiS0MxYBW0fU/ycwBnw6yR1JpnNiceMOLL+LwQvegdPqcnJPa/MNzz2fwR7QuOGrkb7PYG9jogNbTxPnWrSL/W33e0nypiS3JnkwyQMM9lym83t4GrB5Qljf1RvM4BDOE6eadBr9DPf/NGBrVX2/s/wZwCvaoaYH2nwvYvCGYzJPBB6YqlftGkNCOyTJrzF4AfzixGXtnfQbq+qXgP8A/EWSo8YXd6acak/j0KH7T2ewt/Id4HvAvxnqax6Dd6zTnfdbDF6chufeBtwzxXoTfaf1NHGuzTs4z0SP9t+O9/8l8Epg/6paADwIpA3Z7ncBDF9FdTewKEmGak+fZLs3AL/c62Wa/Uxc527ggCTDPQ7/u25ksMe4YOj2i1V19qjtD3k2gz087UaGhKYlyZOS/A5wCXBxVd04YszvJHlWe0F6EHgE+ElbfA+DY/Y76g+TLG0vMO8ELm+XyP4L8PgkL83gMtK3MzjkMe4eYHGGLted4CPAf20nVPfjp+cwtu1Ic62Xy4CzkjwxyTOAv2AGrw5i8I55G7AFmJ/kHcCThpZfDxyX5IAkTwXeMLTsS23dP0vyuCS/BxwxybauARa08wTjJv7bTdXPdqrqLgaHj85Isk+SFwIvGxpyMfCyJMckmZfk8e3k+yGd7Y+fxzgAuHqS56IZYEhoKv+c5GEG7/bexuCa/td2xi4BPgN8l8GL0wer6nNt2d8Ab2+HE960A9u/CLiAwaGfxwN/BoOrrYDXA//A4F379xicNB/30fbzviRfGTHv6jb354FvAD8E/nQH+hr2p237dzDYw/pwm3+mXAl8ikEw3sWg1+HDNRcxeEd9J/Bp4NLxBVX1Y+D3GJyk38rg6rSP9zbUxl/A4DzBuL8FXp7k/iTnTKOfUf4AeCFwH4MT1JcCP2rb3MjgRPdbGQTPRuC/8dPXp4nbB/hPwJr2mQntRvF/OiRpWAafZP4C8LxJLijY1W1cCny9qk7fiXX3ZRCK/36Sz59ohhgSkna7di5rK4O9tqMZXJb7wqr66pw2pik9Vj/lKmnP8lQGh7mewuCw4J8YED8f3JOQJHV54lqS1PWYO9x04IEH1uLFi+e6DUn6uXLdddd9p6p+5uvXH3MhsXjxYjZs2DDXbUjSz5UkIz+J7+EmSVKXISFJ6jIkJEldhoQkqcuQkCR1GRKSpC5DQpLUZUhIkroMCUlS12PuE9e7YvFpn5jrFrSHuvPsl851C9KccE9CktRlSEiSugwJSVKXISFJ6jIkJEldhoQkqcuQkCR1GRKSpC5DQpLUZUhIkroMCUlSlyEhSeoyJCRJXYaEJKnLkJAkdRkSkqQuQ0KS1GVISJK6DAlJUpchIUnqMiQkSV1ThkSSQ5N8LsktSW5O8uetfkaSzUmub7fjhtZ5S5KxJLclOWaovqLVxpKcNlQ/LMmXW/3SJPu0+r7t8Vhbvngmn7wkaXLT2ZPYBryxqpYCy4FTkixty95XVYe32zqAtuxE4DnACuCDSeYlmQd8ADgWWAq8amied7e5ngXcD5zc6icD97f6+9o4SdIsmTIkquruqvpKu/8wcCuwaJJVjgcuqaofVdU3gDHgiHYbq6o7qurHwCXA8UkCvAS4vK2/BjhhaK417f7lwFFtvCRpFuzQOYl2uOd5wJdb6dQkNyRZnWT/VlsEbBxabVOr9epPAR6oqm0T6tvN1ZY/2MZP7GtVkg1JNmzZsmVHnpIkaRLTDokk+wEfA95QVQ8B5wLPBA4H7gbes1s6nIaqOq+qllXVsoULF85VG5L0mDOtkEjyOAYB8aGq+jhAVd1TVY9U1U+Av2dwOAlgM3Do0OqHtFqvfh+wIMn8CfXt5mrLn9zGS5JmwXSubgpwPnBrVb13qH7w0LDfBW5q99cCJ7Yrkw4DlgDXANcCS9qVTPswOLm9tqoK+Bzw8rb+SuCKoblWtvsvBz7bxkuSZsH8qYfw68CrgRuTXN9qb2VwddLhQAF3An8EUFU3J7kMuIXBlVGnVNUjAElOBa4E5gGrq+rmNt+bgUuSnAl8lUEo0X5elGQM2MogWCRJs2TKkKiqLwKjrihaN8k6ZwFnjaivG7VeVd3BTw9XDdd/CLxiqh4lSbuHn7iWJHUZEpKkLkNCktRlSEiSugwJSVKXISFJ6jIkJEldhoQkqcuQkCR1GRKSpC5DQpLUZUhIkroMCUlSlyEhSeoyJCRJXYaEJKnLkJAkdRkSkqQuQ0KS1GVISJK6DAlJUpchIUnqMiQkSV2GhCSpy5CQJHUZEpKkLkNCktQ1ZUgkOTTJ55LckuTmJH/e6gckWZ/k9vZz/1ZPknOSjCW5Icnzh+Za2cbfnmTlUP0FSW5s65yTJJNtQ5I0O6azJ7ENeGNVLQWWA6ckWQqcBlxVVUuAq9pjgGOBJe22CjgXBi/4wOnAkcARwOlDL/rnAq8bWm9Fq/e2IUmaBVOGRFXdXVVfafcfBm4FFgHHA2vasDXACe3+8cCFNXA1sCDJwcAxwPqq2lpV9wPrgRVt2ZOq6uqqKuDCCXON2oYkaRbs0DmJJIuB5wFfBg6qqrvbom8DB7X7i4CNQ6ttarXJ6ptG1JlkGxP7WpVkQ5INW7Zs2ZGnJEmaxLRDIsl+wMeAN1TVQ8PL2h5AzXBv25lsG1V1XlUtq6plCxcu3J1tSNJeZVohkeRxDALiQ1X18Va+px0qov28t9U3A4cOrX5Iq01WP2REfbJtSJJmwXSubgpwPnBrVb13aNFaYPwKpZXAFUP1k9pVTsuBB9shoyuBo5Ps305YHw1c2ZY9lGR529ZJE+YatQ1J0iyYP40xvw68GrgxyfWt9lbgbOCyJCcDdwGvbMvWAccBY8D3gdcCVNXWJO8Crm3j3llVW9v91wMXAE8APtluTLINSdIsmDIkquqLQDqLjxoxvoBTOnOtBlaPqG8Anjuift+obUiSZoefuJYkdRkSkqQuQ0KS1GVISJK6DAlJUpchIUnqMiQkSV2GhCSpy5CQJHUZEpKkLkNCktRlSEiSugwJSVKXISFJ6jIkJEldhoQkqcuQkCR1GRKSpC5DQpLUZUhIkroMCUlSlyEhSeoyJCRJXYaEJKnLkJAkdRkSkqQuQ0KS1DVlSCRZneTeJDcN1c5IsjnJ9e123NCytyQZS3JbkmOG6itabSzJaUP1w5J8udUvTbJPq+/bHo+15Ytn6klLkqZnOnsSFwArRtTfV1WHt9s6gCRLgROB57R1PphkXpJ5wAeAY4GlwKvaWIB3t7meBdwPnNzqJwP3t/r72jhJ0iyaMiSq6vPA1mnOdzxwSVX9qKq+AYwBR7TbWFXdUVU/Bi4Bjk8S4CXA5W39NcAJQ3OtafcvB45q4yVJs2RXzkmcmuSGdjhq/1ZbBGwcGrOp1Xr1pwAPVNW2CfXt5mrLH2zjJUmzZGdD4lzgmcDhwN3Ae2aso52QZFWSDUk2bNmyZS5bkaTHlJ0Kiaq6p6oeqaqfAH/P4HASwGbg0KGhh7Rar34fsCDJ/An17eZqy5/cxo/q57yqWlZVyxYuXLgzT0mSNMJOhUSSg4ce/i4wfuXTWuDEdmXSYcAS4BrgWmBJu5JpHwYnt9dWVQGfA17e1l8JXDE018p2/+XAZ9t4SdIsmT/VgCQfAV4MHJhkE3A68OIkhwMF3An8EUBV3ZzkMuAWYBtwSlU90uY5FbgSmAesrqqb2ybeDFyS5Ezgq8D5rX4+cFGSMQYnzk/c5WcrSdohU4ZEVb1qRPn8EbXx8WcBZ42orwPWjajfwU8PVw3Xfwi8Yqr+JEm7j5+4liR1GRKSpC5DQpLUZUhIkroMCUlSlyEhSeoyJCRJXYaEJKnLkJAkdRkSkqQuQ0KS1GVISJK6DAlJUpchIUnqMiQkSV2GhCSpy5CQJHUZEpKkLkNCktRlSEiSugwJSVKXISFJ6jIkJEldhoQkqcuQkCR1GRKSpC5DQpLUZUhIkrqmDIkkq5Pcm+SmodoBSdYnub393L/Vk+ScJGNJbkjy/KF1VrbxtydZOVR/QZIb2zrnJMlk25AkzZ7p7ElcAKyYUDsNuKqqlgBXtccAxwJL2m0VcC4MXvCB04EjgSOA04de9M8FXje03ooptiFJmiVThkRVfR7YOqF8PLCm3V8DnDBUv7AGrgYWJDkYOAZYX1Vbq+p+YD2woi17UlVdXVUFXDhhrlHbkCTNkp09J3FQVd3d7n8bOKjdXwRsHBq3qdUmq28aUZ9sGz8jyaokG5Js2LJly048HUnSKLt84rrtAdQM9LLT26iq86pqWVUtW7hw4e5sRZL2KjsbEve0Q0W0n/e2+mbg0KFxh7TaZPVDRtQn24YkaZbsbEisBcavUFoJXDFUP6ld5bQceLAdMroSODrJ/u2E9dHAlW3ZQ0mWt6uaTpow16htSJJmyfypBiT5CPBi4MAkmxhcpXQ2cFmSk4G7gFe24euA44Ax4PvAawGqamuSdwHXtnHvrKrxk+GvZ3AF1ROAT7Ybk2xDkjRLpgyJqnpVZ9FRI8YWcEpnntXA6hH1DcBzR9TvG7UNSdLs8RPXkqQuQ0KS1GVISJK6DAlJUpchIUnqMiQkSV2GhCSpy5CQJHUZEpKkLkNCktRlSEiSugwJSVKXISFJ6jIkJEldhoQkqcuQkCR1GRKSpC5DQpLUZUhIkroMCUlSlyEhSeoyJCRJXYaEJKnLkJAkdRkSkqQuQ0KS1GVISJK6dikkktyZ5MYk1yfZ0GoHJFmf5Pb2c/9WT5JzkowluSHJ84fmWdnG355k5VD9BW3+sbZudqVfSdKOmYk9id+qqsOrall7fBpwVVUtAa5qjwGOBZa02yrgXBiECnA6cCRwBHD6eLC0Ma8bWm/FDPQrSZqm3XG46XhgTbu/BjhhqH5hDVwNLEhyMHAMsL6qtlbV/cB6YEVb9qSqurqqCrhwaC5J0izY1ZAo4NNJrkuyqtUOqqq72/1vAwe1+4uAjUPrbmq1yeqbRtR/RpJVSTYk2bBly5ZdeT6SpCHzd3H9F1XV5iT/Flif5OvDC6uqktQubmNKVXUecB7AsmXLdvv2JGlvsUt7ElW1uf28F/hHBucU7mmHimg/723DNwOHDq1+SKtNVj9kRF2SNEt2OiSS/GKSJ47fB44GbgLWAuNXKK0Ermj31wIntauclgMPtsNSVwJHJ9m/nbA+GriyLXsoyfJ2VdNJQ3NJkmbBrhxuOgj4x3ZV6nzgw1X1qSTXApclORm4C3hlG78OOA4YA74PvBagqrYmeRdwbRv3zqra2u6/HrgAeALwyXaTJM2SnQ6JqroD+NUR9fuAo0bUCzilM9dqYPWI+gbguTvboyRp1/iJa0lSlyEhSeoyJCRJXYaEJKnLkJAkdRkSkqQuQ0KS1GVISJK6DAlJUpchIUnqMiQkSV2GhCSpy5CQJHUZEpKkLkNCktRlSEiSugwJSVKXISFJ6jIkJEldhoQkqcuQkCR1GRKSpC5DQpLUZUhIkroMCUlSlyEhSeoyJCRJXYaEJKlrjw+JJCuS3JZkLMlpc92PJO1N9uiQSDIP+ABwLLAUeFWSpXPblSTtPebPdQNTOAIYq6o7AJJcAhwP3DKnXUlzZPFpn5jrFrQHu/Psl874nHt6SCwCNg493gQcOXFQklXAqvbwu0lum4Xe9gYHAt+Z6yb2BHn3XHegDv9Gh+zi3+kzRhX39JCYlqo6Dzhvrvt4rEmyoaqWzXUfUo9/o7vfHn1OAtgMHDr0+JBWkyTNgj09JK4FliQ5LMk+wInA2jnuSZL2Gnv04aaq2pbkVOBKYB6wuqpunuO29iYewtOezr/R3SxVNdc9SJL2UHv64SZJ0hwyJCRJXYaEfoZfhaI9XZLVSe5NctNc9/JYZ0hoO34Vin5OXACsmOsm9gaGhCZ69KtQqurHwPhXoUh7jKr6PLB1rvvYGxgSmmjUV6EsmqNeJM0xQ0KS1GVIaCK/CkXSowwJTeRXoUh6lCGh7VTVNmD8q1BuBS7zq1C0p0nyEeBLwK8k2ZTk5Lnu6bHKr+WQJHW5JyFJ6jIkJEldhoQkqcuQkCR1GRKSpC5DQpLUZUhIkrr+P8VB63itb6pqAAAAAElFTkSuQmCC\n", "text/plain": [ "
" ] }, "metadata": { "needs_background": "light" }, "output_type": "display_data" } ], "source": [ "X, y = dict_creditcard['data'], dict_creditcard['target']\n", "\n", "plt.bar(Counter(y).keys(), Counter(y).values())\n", "plt.title('Distribution of fraud (target)');" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Simple time series\n", "\n", "Generate a very simple timeseries dataset to play with. The generator assumes to generate daily data with a season, trend and noise." ] }, { "cell_type": "code", "execution_count": 16, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
yt
0-1.066620
10.303379
2-0.412812
3-0.227176
40.105581
\n", "
" ], "text/plain": [ " yt\n", "0 -1.066620\n", "1 0.303379\n", "2 -0.412812\n", "3 -0.227176\n", "4 0.105581" ] }, "execution_count": 16, "metadata": {}, "output_type": "execute_result" } ], "source": [ "from sklego.datasets import make_simpleseries\n", "\n", "df_simpleseries = make_simpleseries(as_frame=True, n_samples=1500, trend=0.001)\n", "df_simpleseries.head()" ] }, { "cell_type": "code", "execution_count": 17, "metadata": {}, "outputs": [ { "data": { "image/png": "iVBORw0KGgoAAAANSUhEUgAAAXIAAAEICAYAAABCnX+uAAAABHNCSVQICAgIfAhkiAAAAAlwSFlzAAALEgAACxIB0t1+/AAAADh0RVh0U29mdHdhcmUAbWF0cGxvdGxpYiB2ZXJzaW9uMy4yLjIsIGh0dHA6Ly9tYXRwbG90bGliLm9yZy+WH4yJAAAgAElEQVR4nO2dd3wUdfrHP086ARIghF5CLyLNiCCiUhQPVMRT7N0fp2c/y4Hl9PRU1NNTz8qh2FBULNioCor03oshBAg1oRNSd7+/P3ZmMzs7uzu7O7Mzs/u8X6+8sjv1yTczzzzzfJ9CQggwDMMwziXJagEYhmGY6GBFzjAM43BYkTMMwzgcVuQMwzAOhxU5wzCMw2FFzjAM43BYkTO2gog2EtH5VssRDCJ6h4iesFoOhpEhjiNnYgkRnVR8zQRQCcAlff+LEGJK7KWyN0T0FICOQojrrZaFsScpVgvAJBZCiHryZyIqAnC7EGKudRKFBxElCyFcobdkmNjBrhXGVhBRERENkz4/RURfEtEnRHSCiNYTUWciGk9EB4loNxFdqNg3m4jeI6J9RLSHiP5FRMnSuo5E9CsRHSOiUiL6XLFfVyKaQ0SHiWgrEY1RrPuAiN4mop+IqAzAYGnZvxTbXExEa4joKBEtIqKeinV/l2Q5IR17qMbffCYRHZBllZZdTkRriegiAI8CuIqIThLRWsMGm4kbWJEzducSAB8DaAhgNYBZ8Fy3LQE8DeBdxbYfAKgB0BFAHwAXArhdWvcMgNnScVoB+C8AEFFdAHMAfAqgCYCrAbxFRN0Vx70WwLMA6gP4XSkcEfUB8D6AvwDIkeT5jojSiagLgLsBnCmEqA9gOIAi9R8ohFgO4JAkr8wNAD4SQswE8ByAz4UQ9YQQvUKMF5OAsCJn7M4CIcQsIUQNgC8B5AKYIISoBjAVQB4RNSCipgBGALhfCFEmhDgI4D/wKGYAqAbQFkALIUSFEEJWyBcDKBJCTBZC1AghVgP4CsCVChmmCyEWCiHcQogKlXxjAbwrhFgqhHAJIT6Ex+/fHx7ffzqA7kSUKoQoEkJsD/B3fgjgegAgokbwKP1PIxkwJvFgRc7YnQOKz+UAShU+6nLpdz14lHQqgH2Si+MoPNZxE2mbRwAQgGVSZMyt0vK2AM6S95H2uw5AM8V5dweRry2AB1X7t4bngVEA4H4ATwE4SERTiahFgON8AuAS6Q1hDDwPsH1BzsswXniyk4kXdsNjCTeWrHcfhBD7AfwfABDROQDmEtFv0n6/CiEuCHLsYKFduwE8K4R4VnNHIT4F8CkRZcHzYHkBHreJers9RLQYwOXS+rd1np9h2CJn4gPJep0N4GUiyiKiJCLqQETnAQARXUlEraTNj8CjHN0AfgDQmYhuIKJU6edMIuqm89T/A3AHEZ1FHuoS0Ugiqk9EXYhoCBGlA6iA5w3CHeRYH8Hz5nA6gK8Vyw/A40Li+5XRhC8MJp64EUAagE3wKOtpAJpL684EsFSKY/8OwH1CiEIhxAl4JhmvBrAXwH54rOZ0PScUQqyAx9J/QzpnAYCbpdXpACYAKJWO2wTA+CCH+wYeV803QohTiuVfSr8PEdEqPXIxiQUnBDGMjSCi7fAkRjkmtp6xHrbIGcYmENGf4XH5/GK1LIyz4MlOhrEBRDQfQHcANwghgvnRGcYPdq0wDMM4HHatMAzDOBxLXCuNGzcWeXl5VpyaYRjGsaxcubJUCJGrXm6JIs/Ly8OKFSusODXDMIxjIaKdWsvZtcIwDONwWJEzDMM4HFbkDMMwDocVOcMwjMNhRc4wDONwWJEzDMM4HFbkDMMwDocVOcMwCUnBwRNYUnjIajEMgYtmMQyTkAx75TcAQNGEkRZLEj1skTMMwzgcVuQMwzAOhxU5wzCMw2FFzjAM43BYkTMMwzgcVuQMwzAOh8MPGYZJKKpdbkxbWWy1GIbCipxhmIRi0oIdeGHmFqvFMBR2rTAMkzAIIbBy52GrxTAcVuQMwyQM01YWY+7mgz7L3G5hkTTGwYqcYZiEYc3uo37LXIIVOcMwjGNwaVjfWsucBityhmESBlbkQSCiB4hoIxFtIKLPiCjDiOMyDMMYxZLCQ/hSI+ywqsZtgTTGErUiJ6KWAO4FkC+E6AEgGcDV0R6XYRjGSKav2aO5vKLGFWNJjMco10oKgDpElAIgE8Beg47LMAxjCESkubyimi1yCCH2APg3gF0A9gE4JoSYrd6OiMYS0QoiWlFSUhLtaRmGYcJCW40DFdVskYOIGgIYBaAdgBYA6hLR9erthBAThRD5Qoj83NzcaE/LMAwTFkkBLXJW5AAwDMAOIUSJEKIawNcAzjbguAzDMIYRQI+jnBU5AI9LpT8RZZLHCTUUwGYDjsswDGMYgSzySvaRA0KIpQCmAVgFYL10zInRHpdhGMZIAlnkE38rjK0gJmBI1IoQ4kkhRFchRA8hxA1CiEojjsswDGMEj36zHpMXFmmuW1x4CKt3HYmtQAbDmZ0Mw8Q9ny7dFXT96LcWofSkc+1PVuQMw8Q1Q1+er2u76/631FxBTIQVOcMwcc32kjJd2209cMJkScyDFTnDMIzDYUXOMA5HCIG8cT/ildlbrRaFsQhW5AzjcOQyrG/MK7BYEmcw7Y4BVotgOKzIGcbhxEE57ZjSo2W21SIYDityhnE47jhoVWYWWv04A2V4OhlW5AzjcGTXSqAyrYmMVj/O5KT4GydW5AzjcDbsOWa1CLZFq41bHOpxVuQM43SumrjEahFsi9rtNKRrE583l+cvPz3WIpkCK3KGYeIWtUU+6cZ8n+/X9GuD0+Ng8pMVOcPECS63wD+mb0Dnx2bgWHm11eJYzuLth3D6U77NypI0/Co1cRD2w4qcYeKIjxbvRJXLjbW7j1otiuW8/et2n+/vXH+G5na3DMwDAGSmJZstkmmkWC0AwzCRozWZx3jwt71rx+rHe8/B4bIqAMCY/NbYsu8E3l+4I2ayGQ1b5AzjYKpdzu9uYxZqL4py3vO0FtkY1Km2d/D2kpMAgO/X7o2FaIbDipxhHMjhsirM3XQgYA3taSuLIRI8UUgdVx9sNLLrpAIAtu53ZgVEdq0wjAO57cPlWL0rsB/8u7V74XILvHld3xhKZS/UrpVgGbDjR3TFd2v34o15Bbh/WCekJDvLxnWWtAzDAAAKddTY/nH9vhhIYl/8LPIgJnnDzDTv53UOTLBiRc4wDkSv2ySRfehqH/l5XXK1NwSQnpKk+dkpGCIxETUgomlEtIWINhNR/NWJZBgbodf9/cYvXNoWAH645xxkZaQGXK+03tMc5lYBjLPIXwMwUwjRFUAvAJsNOi7DQAiBV+duw56j5VaLYhv0VjyctXG/yZLYk6LSMszedMD7PZzStR8t3mmGSKYStSInomwA5wJ4DwCEEFVCCM5GYAxjy/4TeHXuH7h/6mqrRbENeuNREtW18si0dWHv85dz2wMAPl6SgIocQDsAJQAmE9FqIppERHXVGxHRWCJaQUQrSkpKDDgtkyicqnIBAKpdiR1Op0SvRb69pCwhszyV43NmXkNd+5zVvpFZ4piOEYo8BUBfAG8LIfoAKAMwTr2REGKiECJfCJGfmxt40oFh1FRWexR5RqrzfJdmUVGt39Ie9eZCrC92XiRGNCgVed10fVHWyUnOvb6MkLwYQLEQYqn0fRo8ip1hDKGyxqO00lKcWwvDakpOVlgtQkxRVi546pLTdO2T6uBC5VErciHEfgC7iaiLtGgogE3RHpdhZGTXChOaAe1zNItDZddJ09g6flGGZ+Y19vP0aqLuHLS08BCqapwxx2BUZuc9AKYQURqAQgC3GHRcJsEpOVGJuz5dBUC7/yLj4Zp+bTDuoq7IzkzFou2lfusTLV3/RGVN2PukJNcq8kUFpbh20lLcNKAt/jmqh5GimYIhilwIsQZAfsgNGSYEQghUuwTSpKSMvYqQQ67050ErDLNhZiqyMz1x0loJLfFQc1svlTUuXZmvapQ+8oelqJeVu44YJpeZONe7z8Qlny/fjc6Pz0DxkVMAAGWWtVYj3UTkE43wOOXY9G3jH6Wxed9xU2WyE5EocQBIUbhW5Ifl8fLwLXsrYEXO2Irv13nKiO4o9dyMpCh9xK4VD6kamYfKZ5y6xggA/PP7TQmTUBVp7HxGqv9k+vEKZ3RaYkXO2BICweUWWKjh7010tGIr+rZpEHK//ccSQ5FH+rzvkOs/KXqigi1yhomKt+YVYMKMLd7vbI970DC4cVGP5iH3S5SEqkjnUogI53X2zXFxyrwMK3LGtshdW2ScclOZTfPsDJ/vH9/WT9d+iZKur0wGOj2MGisA0Lt16DcbO8KKnLElWp1v1uw+mjDKKBib9/l2sVG2LAvGDe8tS4iHoXIuRRlSqId7h3bCuZ2dl3nOipyxFbIxdf/na7DtwEm/9Z0em4HyBE8Q+mBRkffzvUM6hrXvmjivu1JyohJHTlV5v6eGmXafnESOtMpZkTO2Qhl9UVDir8gBoNs/ZuLg8cRKOZf5/Q/V5K+WwzwIf357EcoiSJZxCmc+Oxd3fLLK+z1cixwA6qU7rxQEK3LGtgRLj56ydFcMJbEPszf51hePpDqIU9LOjUArVDMUeots2QlW5IwjSXNgOy4zSArTIgcSZ9ITAFIjsMjrprEiZ5iYEMkNGg+o/+oeLbPCPkZlAlnkKRGUpnWiRe48iZm4RuiMFo/kldnpTFtZjA8VbcgWPDIYrRtlhn2cRFLkfXQkSqlx4qXlQJEZBth56BQe/GJtQoTTyTz05Vqf75EocSBxfOSvjOmF/xvUPuz91CV9Oj76k0ESmQcrcgspPnIKK3cetloMR/LBoiJ8taoYBQe1I1vikRSDGh9U1iRG+Gb3FllIimDMcuun+3yvcQvblwFm14qFnPPCPABA0YSRFkviXBJFKQGeGGcjytEmikUeyUQwAPRs5e+OcblFRKGMsYItcsZWhGv43D91jTmC2JBwLPKXr+yFF6/oqbkuUcoBR6rItbD7mLEiZ2xFuLdLYWkZdh8+ZYosdiMcN8Gfz2iFMfmtNdclyryCunVbNLht/hLDitwGHDyRmFmKSp77aTPyxv2IZTt85wz0GFVvziswSSp7EYlReHnflji/i2/tkEgbL9idk6qMVSN7KdfYXJOzIrcB/Z792WoRLGfib4Way42a4IsHIlEmr4zpjQ9u6Ydrz2rjXfbkdxsd0zAhHLaouiAZ6VqxuR5nRW5Hjp6qQkV14kziBaNFgzoAgPYaRf8TjWiUSVZGqs/3ymqba6YIULuMjHStJIxFTkTJRLSaiH4w6pjxxMqdh/Hz5gMB1yur0vV+eg6um7Q0FmLZnvy2jQAAd5zXAVkZCR5kpdBLDTJTA2+nQeem9QwWxn6oJySNjDJJpMnO+wBsNvB4ccWf316M2z5cEXD9ZW8uxJb9ta+GK3c6o3u32SjfjrV6USYq4boNRvdpidev6eP97ra5YooEtUVeR6MHZ6TY3CA3RpETUSsAIwFMMuJ4icrhk1WhN0pUBPDQhZ2tlsI2hKvIicinW44R8eh2Q63ItZopR0qiuFZeBfAIAHv/tTagqsaNN375A6eq4rcmtJEo1dUNA/KwePwQv22OlcffxJ0WyrGIxP2brFD+rjjs36l+yzCyHs8fNs8gjvovJaKLARwUQqwMsd1YIlpBRCtKSkqiPa1j+WTJTvx79jaMeG2B/0r2HHjp2KQeVj4+zPtdLqbVPLuO37YzNuz3WxbvRBKRkazwGdvdwowEM6vz3jJ5ua3j7414ZA0EcCkRFQGYCmAIEX2i3kgIMVEIkS+EyM/NdV5PPKN4efZWAMC+Y/6x4yUn/PtUJipNs9KRUy8dI073dIfv26Zh0O0TJe1cJhKLXBnKaWelFClFpcbHx7dvXBstZedIsqgVuRBivBCilRAiD8DVAH4RQlwftWRxSpnUb1KrlOh9U9dg495jsRbJlsh1pAd3bYKiCSPRqWn9oNtrNWuOFw4er0DeuB99rplIJn6VVvxP6+PvLebZnzyxFl/debamCy4Szmhba0CUx7MiZ0ITjpIZ+frvJkriHML1b8ZzH8pCDUszgn4JPhb5f+Zui0YkW5NdJ1XTBRcJyoennZt+G6rIhRDzhRAXG3nMeCCerUWzuH1Qu7C2V6dnxxNa/vBofeSAvV0F0WBkItDoPi29n+1caZMt8hhAPIsZFhf3bI7+7XPC2ueUja2laNHSS5Eo8qyMVNwzpKP3ezyN2SdLajsnJRuQb9AiOwMA0K9dI++y8ir7zsOwIo8BU5cnZsf3SKkfQQbnobL4jcHX8odHanQOUDwga+KoCfNbisJpkbid1Ey/+xx8eccAH+t+0z77zl+xIo8BkxcWWS2Cbdm09zjyxv3os+zREd107fvxbf1wxRmtAAD3frbacNnsgpbSPjOvkf9CPcdSHKzfc/FTrM3oIJzc+uk4M6+RjyL/+1frbeuOSvDiFYzVLCwo9X7Oy8lEv3aNUD9DXx2RQZ1ykZdTF9NWFpslni1Q+3xTkwn/HHVaRMeK12qSyqbdRoZWqt00ldVuQzNGjYItcsZSlPfJ6D6t8OIVvcLaPz2l9hI2I47YDqj94aP7tER6SmTKxMiJQDuhTOo0UtGqm3lU2zSRihW5jbnq3cVWi2A6szbWxjNHUq1OqdDO//d8LJIs/LEfrcDHi4uiFc+WPD2qR8T7pqgcyPHiJ5f1eN20ZDTNyjDtPNU2HS9W5DZmqapbTjyyvKi2ymMkkRjpqb6X8BJpzGZvOoAnpm+MTjiboK4hEo3FqbbIT9nU5xsucpf7vm2DZwBHi13ruLMiZ2yD3tKqyqiWNFXiUGWcKCYlRk7kqd96KqQQxB2lZdh5yLmuqVhV5a2yqUXOk50WctfgDnC5gXd+3W61KLZA72vrgkcGexOA1D5Mu0YVRIORtcPVFvm+YxVokpWBwf+eDwAomjDSsHPFEnmMzFbodq3pwxa5hQw/rRnG/akr3rqur9Wi2IIanaVVG2SmoVXDTO/3167u7f384eKdWrs4GmGgdlJHrYx6c6Fhx7YS4f1tria3a70VVuQWUTRhJHq2agDA3z2g5HBZVcLULo/U8gw2fvGA0rWSlhLd35oIUStmcuU79gxAiO87wCEM6drE+/m/inZcAND3mTnatcvjkEjvRaV7RU6tjifcCk0+76HzozqWOmolXojH1nXhEJ//VZvw9vztOP3JWSG3UyqiS3q18FtfdOiUoXLZFXeEs3rKpI3qOKuz7XYLvKFIP2/ZILqqfloWeaTjbidkt5xe91w4OMH1yYrcRF6YuQUn4rgqn9EYYVWVnKjEnqPlBkhjD5buOIwFf5SG3lAnWoq8wsZV/fTSppFnzuRPPZoZfuxOTeoZfkyj4agVkzh6KrwiTp+P7Z8QceMy5VUuvPqzb03sSPW4erfRcTKBBxg70RmICpvGRodD4/ppAOrjprPzDD+2umiZECKixh5mwha5STzzw+awtj+rfQ7uHdrJJGnsx6QFhXj310KfZS6DlNbBOGqZlxrl5KaajFT/4207cMLQc1hBdY1Aw7qppihY9VvMy7Pt15SDFblJxGM8s5FoxYxHbJHH6URXWWUNrpm4xNBjZqalYOG4IVjwyGDvsqsNPocVVLvdYXeV0ovaG/XVKvsVaWNFbhIdcuv6fL+ge1OLJLEpGpZTpAo5PtU48HtBKWpMmIhs2aAOWjfKxLs3nOG3br9GU3C7s+doOVbvOmqiIvdPorIbrMhNQpmwAniiUZY9NhQ3n52HXq2yLZLKPmi9APdpE1mdjDg1yE0vOVtHo2ZL/+edV6P8r5+sBOCZ6DYDdfYwYL+3QJ7sNAm1vzclidCkfgaeujSyOtLxhtogXzRuCFpEHFpnr5vKKLQUiJHYsa52JFRLIYcnKqpNOb5W67jj5TXIztRXNz8WsEVuEupX4ngt6G8UkStx47vD2AWzr5lGde2jiKJBroBZZlIPUq1JeLul6ketyImoNRHNI6JNRLSRiO4zQjCno06yiNfU6EgxsiG1zd5yDSOSsr7h0LFJfc3lTksQWr3rKABtd50RNK6X5rfMborcCNdKDYAHhRCriKg+gJVENEcIscmAYzsWtUUe7b1x9FQV0lOSUSctPl6HjdRRwQolud3CdBeFWShblt01uAPuGRKb8FSXEEgyTS0ai3KMzHruaXVjKjfJ+o+UqC1yIcQ+IcQq6fMJAJsBtIz2uE5HbdW4omwR1fvpORj9Vvwkuhh5zwWzyPccLXdsne0axTVz04C8mPm0jex5aTafLdtlyXntlg1rqI+ciPIA9AGwVGPdWCJaQUQrSkpKjDytLflsue8FVh1GDYgHhnXGbee0wx3ndfBZvmX/CawvPmaIfFZjpPU0oEMOMtOS8e1dA/3WDXpxHs57ab5xJ4shymsm2qqH4WBGyKNZhJtBbRQV8WaRyxBRPQBfAbhfCHFcvV4IMVEIkS+EyM/NzTXqtLalsMTXCgzHyrlvWCc8cXF3TZfBJW/8jpITldi8z2+IHYWRGXiN66Vj09MXoXfrBoYd0w4oC0CZFSM9uIv/vegyofCUWSgfcGbOlVyoygPZf9xeseSGXB1ElAqPEp8ihPjaiGM6Ga2GtpE0bQ10YQ59eT7+lCClbRMZpWvFLIv82rPaBj2v3THrAafm5TG9fL7/7Yu1MTmvXoyIWiEA7wHYLIR4JXqRnM20lcXo+NgMv+X5eY3CPlagpIPjFc6vqBjrmkN2S+DQQ1ll7eu7WaGIWked+FshKqpdjhizJvVr68+bKa1W1Nlxk+LWI8GIx9lAADcAGEJEa6SfEQYc15H8uG6v37Iv7xiAdo3ramwdHAe5KsOiotqFF2duNeXY153VJsA5nWNlypSerM1UjGW1vXd/K0TXJ2Zi8sKimJ0zUmL19qClyHs+NTsm59aDEVErvwshSAjRUwjRW/r5yQjhnIjWK3CkUQDxWp+l+Ih5jTL+dVkP7Hje346odpC7QOaQpMh7mej7D1YD/ts1e0w7r1Eo5xGeH326aefRyu60E5zZaTBpGjGnkVZC7N8+J1pxbEl5lXlKlYg0rddqm3Y/D0bpySq0b1wX0zWicYwimImxrvgYZm3cb9q5jUBpJA0z0fAJlND35PQNWF5kfR8BVuQGMG1lMeZtOQgASNewyLmkrS9WNJN2UkidzKGySuRoZBUaiZYfXKmzXpv7h6nnj5ZY/V8DubY+XLwTN763LCYyBIMVuQE89OVa3PLBcgDarpUGmebcjE6YjNLCitDJKgda5OXVbmSmmVvXTusS6tIsy/vZ7r2ao020M4K66dbXHrT5v8l5KC3ysee2x6f/d5ZpLhIHGpkArKnnHCj8c/+xCizablxPTCOprjGvWYKM1jWkHCs7+4bXFR/FE9M3Wi0GujTz7+n527aSmKbxsyI3GKVF/vDwLji7Q2PTzvXDur2oqHbhx3X7TDuHGRwui302XqBX8BGvL8C1//NLRPbjP3O2YePe2GbVVrvcSEsxV5FqJZ0VHDzp/Wy33pRKpq/xjxCzAnXW9vaSk7jx/WV49Jv1MZOBFbmBzNq436cPpZHW1OV9/MvX3Dd1DSbM2IK7Pl1lW6tSiyOnfONvT29pfKONx0d28/l+QJGJN2P9Puw7Vg5A30OlxuXGaz//gdFvLjJWyBBUu9xIMdm3IbtWAsWp27lqp11EU4pR43Jj7qYDAGLbC5UVuYH85eOVph37X6N7aC4vLPWUAjh6yj7JCYFwuQXGfbUOq3Yd8S5bPH6IKVEZtwxs5/P9hveWoayyBm63wJ1TVuHKdxbrPpZcjzrWGY/VLhED14rnb0tJDqDIbWyRm13mNxyOnapG32fm4KbJy/D8jC0AYlt8jBW5Q8hMS8HnY/v7Lf9tm6cAmROiMgpLTmLq8t0+VnByEplSZlbLkjxZWeNVXMVHynUfS74hzR7hgoMnvRFO5VUuHK+oNt21IpMawPK3ka7045DiOlrx+DDL5Fi64zB6PT0bh8uqsLDgkHd5LO9JVuQ2J6dubcRL7zaBE0Oc0AxAq5Z6LK2qs5772Wtdh3PaWFhW5VUuDHvlVzzw+RoAQLd/zMSJihps3mfu63koizzQcjswbWVtN/vG9dItlEQbl1tg0fZSvDRrC/LG/YgNe8ybY7E+boYJyq+PDPYms6SnJGN0n5b4ZrV/xp0TakhrhbqZqcjTkpNQpYpWkWUI56xei9zEIZYt8UXbD/ksX7P7qHknhcJHHsCFs7DgELbuP4EuzbS7CSUiTeqn46CORs8ut/CZSP9h3T70MGE+CGCL3PbUS09BQ4VVfuf5HTS3c4Ii10oHN3PCSstbII+TWwDztx7UdRzl2B4uq8KmvcbHwctniLUrQ/7TUoP8I4a/+luMpHEGSx8dqmu7XYd9S1GYmRjIitxhBIoicIKPXOthY2TvTjXT7jgb53X2rbf91ara1/GHvtRXilQp98jXF2DE68aXEJaTu5KI0P+5n73L37n+DMPPpWRo1yZo37gu7g6jjdzCglKsVkxYK5m9cT9OVjq/OqcWM+8fhCXjh0Ycknm8vBp7jpabotBZkTuMQGFi7/62HTtKy2xR9yEQWhY5mXgF9miZjScu9g1D/GjxTu/nSkVFRGWW7Pu/78AzP9S2nFU+JM1KZpJPkUS+TQuyMsz1fjasm4ZfHjofXZvrc53M2rgf101aitFv+YdiFhw8ibEfr8T4r9dj+po9WLkzNtdirN5iujbLQrPsjNAbBuDr1XswcMIvWFx4KPTGYcI+chN4etRpGNyliSnHDuTL3HnoFAb/ez4AoPC5EbZsOKyVXGl2lQF1+J7Pw4SUy4EalwtCAE9LSvyJi7sDiI3bqlYu3/9baoxavOkJM6yodmmG2C4pPIR5Ww56q3XuOXIK9031TNoWTRhprKAa2DlEUgutekzR4nhFXlHtwhcrduP6s9raRnm1zamL1o0yTTm2ngYDVS43MpJi06g3HJQKMSM1CRXVbtPrxagVeZnitV85km4h0OXxmZrHiIUil61+9b/XrIYSavQk/pzSSDmvdrlx9cQlAIDzpLZxseraI2OX+14v6RoVUqPF8a6V/8zZhn9M34ifNliTppEcmZkAAB6USURBVK6liMy8rPTccOpIDbugrHo4/a5z8OiIrqYVFJNRh88dOF4bbaBMrd6oMYEpK3BXDIqTyeGj6miIWCvFYJSpfN+e8LpaN4E8nrFsFA1Yk+H58W39It7XDIvcPldJhJRIF36lRgeYeVsPetebRaznGAMlbiiZsmSXtynB8YpqjP1ohU+3GZm5mw7gp/WxeQD+uG4frpCyKU9rkYXOTeth7LnaEThGkhZEEZYrJp0ue3Oh3/ovVuwGoG2RG/0mEWiyOlYFLuW2Ze1ztTtZ/bLlAMpU5Yffnl/g49aolMZzwR+xLRfx1CWnxfR8ADCoU+QN5BNekde43Ni63zdBotqtndBQ43LjlsnLcd2kJebKpJG2bWZsdLKOBI0XZnrqrwDA1GW7MHvTAbwzf7vfdrd/tAJ/nbLKcBm1mKcI9XtoeJeYFWOKxjo8ISm3Go2u8kY/wAO5b2LV2ahbsyykJSfh2cu0u+xMmLHFr7jYgj9KfUI8Y/kmqHyQDu3mrE5aCe9aeWn2Vgx/9TdsL/FUZxNCYO9RT6q1uriQ/DpcWFJmqkzqymeAubPoen2mehIWYsXqXUd8svBiOTkVzU3zzeq9KDlRqRluaLTfPNDxtB4iZtCwbhq2PfsnDOigXXI5u06qX4GxpTsO+yh3LR+6WVz+dm3UjNUu8jH5rcLaPj01wS3y1bs8WW6yu+Tz5buxcqcnnlW2yDfuPYbiI6di9koa64YFkU5+qXWnGUktgVCHqsXSGxVN9b7N+47jW40sWiB2irxHyyzN5WZy12B/l5eev/eRaevMEEcTWRcA1hXPev/mfHx5x4CwywNkZaQaLoshipyILiKirURUQETjjDimFvI9KddLXqiYaCF4su5Gvv47znlhnvfCM/N/PGlBIfo+M8dveSwmO8ee2x4jT28ecnv5gXaqyoU7P1mJjXuP4W+fr/GxMg+drMR9U1fHLJHDSf0zA10/Rk+AqhXlVfmtUTRhpOkdgrR4eHhX7+fsOh6ls2qXuaUCosGqqJUhXZvizLxGaBjmhL1WzaFoifoqIaJkAG8CuABAMYDlRPSdEGJT8D3DR1Zij3+7Adf3b+tTKOrpHzb5VLSLhWtBmVzig4nXFRF5Y3OPV1TjR52TlVOW7gIAzNjg30z3rfnbMX3NXpzeMhu3D2pvnLABsGtUjRaBrL135m/HQ8O7GHYe9YPBCZm6dsFq18rlfVvi2Z82WyqDERZ5PwAFQohCIUQVgKkARhlwXD/UN5VyolFdlvSm9z0NUc1MAQ80kWbmOZUEc7MUlpShxuXW5caQZ9Fj1bnHSf0zA1nkb8wrMPQ86t6TR0/FvouSkqlj++O9m/Jt2RdWPTZWN7/IsUHlRSMUeUsAuxXfi6VlPhDRWCJaQUQrSkpKIjqRUpG73QKzNh4IuK23YI2J/+NAYUSxctmF8g12fGwGJkhF7oMhPxDe0ohsMQL1fRZrRZ7ftiH+cm5kbxr//N7wF0tN1C8pO0rNnaQPRf/2OYZEg5hRuvXVuX/4fLdTgwmriNlkpxBiohAiXwiRn5sbWQymUiEs3WF9TZHAFnlsMCrr7/VfjLUu1ajLCsS6JOq0O8/G+BHdQm8YJkbWgFeHsZZoxP07kY17jxn+4Fa/ObIeN0aR7wHQWvG9lbTMcJRP3oUF1veo1Eo2uaRXC/Rp0zAm5zf6lbJ5FAWBgqEukdqrdeAGGU7CyBhv9aGeG60dz+00/v7VenR+fIahHeW/W+vbdFlPkpwV9GplTu1xLYwYgeUAOhFROyJKA3A1gO8MOG5Q9Pooq2rcfmUjCw6exMeLi6KW4ZCGT/m/1/SJWYqy0Uk14c6+60VpkQfKHHQiWjkEkaK2yC/p1cKwY0fD/27Mx9Cu0ReAO2ySz//lK3vZotZK//aNNJZpx+SbQdQaRwhRA+BuALMAbAbwhRBiY7TH1eJEhOFxcriizMX/XYAnpkcv4r6j+vs+OgGzprWULqCv7jjbpLOE5qs7Bxh6PDmMcuaGfbjszYVRTQxqlfi1A2e1z8G7NwSviX7/sNC1zM0q/hXrui6B0KqJk1XH+HjxQBgyCkKIn4QQnYUQHYQQzxpxTC0qI/S1qdP3K6S6LNH6OBvVM7fgU6wxK0JBOf56SgyYxRlt/a0mLfTE5wPA3M2eyfY7p6zCmt1Ho0oSilUGZySEcuG1z60X8hjKhh5GYpeiYlpvx/83qD3+dVmPmJzfHqOgk/M6NY5ov0DWQLRJHTEqgxGUl6/sZdixth04gZdmbcH64mP4Yd1efLFit7dwVDQoyyc4oXb0fTosTAD4RIrNly+jaOwCu1rkQGgXXrA2cTIvztxqlDi+57ZJc+hnL+uBUb1b4NWrenuXpaUk4fr+bX22M6tRiLPqkUeoBALdIy63QGoUSVZ2uPn+fEYrDO7aBOuKj+LmycujOpZbAG/O245JC3b4vP2MyW8dZK/QKC1VJ4SK6ZVx7e6jPqV5o7kenJgAdEH3pujTpkHAZidmsFvVB7PaJsllrRtl4rWr+wAA7v98jeY2q564wLQHj6MUeaSuEHlSavbG/Sg6VBufG229DOX+r4zphbIYFg1S0qhuGgZ2jOxtRQu1C8vlFlFFyCiVndUBBvcO6Rgy3DLY33rv0E54/efaOOZxX633fo5Uka8rPoq7P10d0b5W8sCwzujeIsunsqXZqLOCKzTKV1vN2icv1NRVjeqa54p1lGvFLUREkyaywh378Uo891Ntgky4VtDuw6d8qvgpb9y+bRriBtVrVCwx01d4vLw67H1enLkF+f+aC8A33NBq18rfLuyC7+4e6Lf8xSt6YmDHHHRpWj9gyveTl3THaS18i1gpQ+EmLdiBmggsxDmbAie22Rn5oZweQ4tc/aysrLHGeApGdp1UNFQo7UwTaquocZQidwmBpCRCvzx9k1YyWjXDgfAt8tFvLcRDX671Tgoq989x4MTnn/vqK78ZSTGtt+Zv9zazUPpYrU6nBoCerRpg+Gm+WYtpyUmYcnt/zHrgXJ+bUMktA9sFTfZ6Zc62iCb1YtFKzkz6tWuEWwbmxeRc6rE6r7M5vXGNZNG4IVgyfqip53CUIne7BZKJcGWY9X9r3ELzVSfcG6j0pCcWtvhIOY6dqkaNW+Dms/NQNGEk6ptQmjJcdjw/QrdyDgd1HH44LCwoxW/baksyxKqhRCjeuf4MPDu6BwZLfSaVkTVZGalY+fgwzf1CXTGRZDE60T8O1FrHKclJeFJHlx4jHljKY6x64oKoutrHigaZaabL6SxFLjwWXbixo0WlZfjbF/4TEC63QEW1C/+Zsy2sV7RBL87D0Ffmex4sNrAwZYgIGWEUrRc6I8fLNRT5qaoadHtiJuaGcAtcN2lp0PVWQUS47qy2yJfe7tqommVH6s+MJK7ZzqGHwQg0JXB6S+2MRiMmJv84WNshzGo3nZ1wlCJ3uQWIAhermv/Q+ZrLH562Dt+u2eu33CUE3v21EK/9/Ac+WbIrLFlKT1bBJeylyAHg5rPzkF0nFY9cFLrEqt75hm0HTvotKyo9hfJqF/49W39Y2cJxQ3RvGyvuPK8DZt4/CD1b+ZYNCPTmEGo+MxJFrq58aGe++MsA1Ev3xEgEmtx9eYx2SKy6TWMk3De11iAjR2kvc3HUUAhJcapTyQd1aoyiCSOR1zi89G+XS3gjKiJ5JXa77RdO16lpfax98kK0bFAHADAgSJrwuD/pKyT10Jdr8fxPm3EkijK3P9xzjlcmO5GUROjaTLsLz+Sbzwz7eGnJ4U9sVTvItZJTLw1tczI118mGQaB7YpRGg+toYIu8FkeFH7qEQBIR+rWrnex88ILOuKpfbZxzekqS7gzQGndtve5IDGuPRR7+frFATsIJ9sag131QPz0F7/5WiIMnKvEfRcJDOPQI8LptZwZ3bYK0lCRU1bgx4XJ9RawimU9wqVwrDTOtn28JREoSedM51Ab59/ecg1+2HIxZowe7GVFWYlM1pM3dgzth6tj+Pq+99wzthCb1aycSFo0bolmVUAu3qJ0EjeSacEmTr3ZEHoLUZMKXdwyIKoZVfjDaMdTLbOTQusHewlHBrecHv1wbdqW/ghJf19W0O62rRxOKplkZXgWqdq10a56FuwZ3RHo0WXZhYHVOgp1w1FA0y85A56bBa1nn1EtHq4b6XuGnLN2lsMgjU8jJNr2aZLkEgDPzGqFBFAV85CQMpStX70Sp00mVfN7hXB0nKsOLu5cbiANAs6wMW7qgZDJSk9GkvqcjTqDchZYN6uClK3ripgHm5lWwRV6Lo1wretGrYiYvLNJdICkQ9nWteC5y2WhSl/qU+36GQ6IobyVySnU4IYLRNFJY8qi58cZG8PKVvfHThn3o3kJ7bgEArsxvjX3HKkyVw65vw1YQl4o8HOTmxZE+3e1QC1kLrx9T+m6EmMo36Vj1JbWarIxUHDhe6Q0R1JOFH03cvV35x8Xd0VJ6083OTMU1/dpYLBF3BlJiU3syOuTMy18ePE/3PkpFN2vjfvz+h74ORE6xCox4DXVQcIVhvH/zmbh3aCe0buRRYgM7NUbbnExMv2sgpt2hXd/8VBg+cmXp4PomVcYzglvPaYfhpzULax+tK+6dXz19YTs99hNemhW6n6zMxr3HsGX/cd/jO+TeiwX2vXJCMOO+QQHjWOWl4fyj5W1/3VaCv3y8EoA+90Msi8eHQ4Y04VQhKRVjLvrE0+StG2Xibxd09n7PykjFrw8PBuDfO1ImnMnOokO11fwSwec7YcYWzN10ANUugTfnbcfDw7vq2m/k67/7fP/6r/adELYCx1rk3Zpn4bQW2iFtsn4neKzyDjraiz353UZ8u3oPbnp/md+6vHE/YvzX6zX2QsjJV6toIIWwHZFabAXy5d87VF/tbUCfWyGRCBQJFE6de+VEp029dBGTGiA5aoXib46Em8/OQ98Y9cV1Co5V5Hog8nQv+fnB83Vt/8i0dQHXfbZMO/Mzt156JKKZTvNsjytgUCdPLZEbB+Rpbqe0NkOxYe+xqOVKBO78ZJXuUM0cxcMg3lwFNw3Iw60D2+Ebg63njBiFNzqJuFTkr1/TB8O6NQ07jEtd63jnoTLc+cnKoPs0rGtP10p2nVQse2woHh3heXUdk986okgVJQeOV+Kf3/v3Ot1ztBwfLNzh/X7slG/43WtXR5ZE5FSOlVdj8fZDurYtLK2tjx9vFnmdtGT845Lu6NOmIR4boS+LWA/h1BNKFOJyRHq3boBJN+VH3bnkmR82Y8aG/UG3yUyz7zRDk/oZmmMwqnfkHdonLywC4BuKeMvkZXjq+004eMITbvb3r2rfbDo1qRd1iKcT0dNkYvqaPXjmh00APNmcr0sdZuKRJlnGvbnapU+nnYhqRIjoJSLaQkTriOgbImoQei9reXh46GJSMnr669mtaFYoNj09HK+M8bWQ5zxwrk+zhVsHtgt5HKWekotqyQlDJVIdcgB4bGS3mLYCizWTbszXnGeo1lHRcPWuo97Pn43tj7MN7PJkN/RmW+vBLn067US0ozsHQA8hRE8A2wCMj14kc5ErJ+pxR9ZNt6+1HSmZaSl+D59OTeujZ6sGyK2fjivPaOWtnHhu51zNY/yy5QDGfrQCAHCiwr9npTKkLt78vmqGdW+Kv57fwW95tcuNb1YXI2/cj36uJgA4/6V5+GBRkfd7JJ2vnITabRkNds2mtpKoNJUQYrbi6xIAV0Qnjnl8PrY/lhQe9n7v3y4HiwuD+zGdZm1Hy/LHapsp/HjvOcjLqYvTnpzlt92tH6zwft5ztNz7eUdpGYqPlGOVwtJMhCHUuk6qatx473fPvMGuw6dweqZvhJUy7BCI/9BDPW8oemGL3B8jH223ApgRaCURjSWiFUS0oqSkJNBmpnFW+xzcN6z2Fbh7iyxMviX8MqWJwmktslE3PSUsV9R1k5ZizLuLfZbFu4ICtJPCql3u2jBYHUOQEudWZu/W2l7XI2VVeHv+9rCyYeN9rCIh5IgQ0Vwi2qDxM0qxzWMAagBMCXQcIcREIUS+ECI/N1f7lT3WZKQED2NKBCUUimizDRNhBLXKNIRbbyXedVPHJvU0m530f/5nvDBzC576zj8aKhDx7oaKhJB3qRBCu3mhBBHdDOBiAEOFsH/KSK5Uua1pVjrSQ4Qxva8IqVPTLCsD0+7UTtGOJ6K9ZTLjcJ5BDxXVbmza55tSLoQIOGeQCFamlgKWSySHk6OQwq4VP6K6y4joIgCPADhPCHEq1PZ24JKeLUBEOL9LLooPl4feIQANMlPRqqF2p5S4Isq3kkCv1PHOy3NqW+DJE8JDXv4V9dJT8P095/htnwB6POgbbjhzofEcBRUp0ZpLbwBIBzBHsjSWCCHuiFoqE0lKIlzayxNH3TIKHaMnTjgeYNsnMiqqazXTNf9bglev6o0diuQfNYlgkQe7ZcLpW8quFX+ijVrpaJQgVpCdmYrfHh6Mc1+aF/6+Ni2WZTQ8TWAMCwuCV9N0ShXNaAhm/ASq915V48ZmlYvqvABhsYlM/JsBIWgToJFsKN68tq/Bktibq89sHXojJiCh/LrJCeD3DVYG2SWtPFFRjUXbax96z/64yadp82W9W8Rlfke0JLwiB6C7NZzMHed1QJOsjNAbxgGyX1OI2oqKennj2vhNOQ8XpX/4eIV/glAiWeRamcNy4467P12Na/+31FsieE2x7yQoF8zShhU5gN//PgSz7j9XdwJQuIrfycgjIiDwWpi1QC7uGXlNF6dx9+COeHrUaQHXK/V0z6dm+61PC1DyNZ6QxyA1xf8+k5W87EapqnFj2Y7DWLv7qM926QkwTpHAoyLRpVl9ZOi8SBKpFrK3ZZwIr17GsG5NQm8URzw0vEtUD65EyCKWrx+tGHvZXVLbFAZ4+gf/2PJ0tsg1YWeTgrrpKSgL0t1lQPscfHL7WQlx08nIvTkFgP7tG+naZ84D56KTTRtumIkrEXvhhYFsTWsp8mHdmvp8r3ELTXcTW+Ta8KgoGNa9adD1RIlhOfmgsMiJCM1CzA10bFIvIZU44IlkCpQJW2NgrRGnkhZEkdcWXPN8X7f7KIqP+Od5sI9cG1bkCv55aa2P84y2/u6TBAkd90HpIwc8NUSCkchWaVpKEtY/NVxz3dTluwPud1GYTY2dityasV87/ze72uvG8/vOKatwSKMnKlvk2vCoKFAWrNdKOhAJ2HzYm1Iu/emhaoiEUvSMP+/ccIbVIsSEHi2zseyxobjijFZ+6/QaAOwj14YVeQCUl9WA9jkAgEt7tbRGGAtRP84qQyhqdiHUMvlmrq6ppkn9DM16M8ra7MFgi1wbnuwMwBV9W2HZDk/98s/G9ke1y52QqcHya/CfJStKtsgn33Im0pKTcN2kpT7b14SRah2vjOrdAr9uK0F2mHH3iU61y43Sk/7uFCWsyLVhRR6AMWe2xmV9WtbGviZooZ7WjTI1mzYPaJ+jOfFkZAMBpyLH28uGAKMPZa/XQPBkpzaJqZ10kpaSlLAKPBB10zw3ktoyeu+mfDTLysCrV/XW2i0hyamXZrUIjuLrVXtCbtOrVWJW0wwFW+RMWHx/zzlYufOIn59zaLemGNotePhmotEhtx4Wjx+CAc//EnCbGfcNiqFE9uHNa/uisOQkXp6zLaz9mmUnRmmMcGFzkwmL9rn1cGU+F9DSS70QBZ5aJlC5ByUjezb3zrsw0cMWORMVk27MR/MGbCUFIlQCWThlD+INvZ1+iBIzhyMcEvcqYgxhWPem3kQPxh+trjgf3doPdaRJu0SMhJJJ5IeY0bBFzjAmotTjr4zphfJqFwZ1aoxv7xqIeVsPJnTbMr1/OwEJmIoXHqzIGcZElIWfsjJScXlfj1+4S7P66NIsMWvSyKTqdq2wbyUUrMhVnN4yG20j7BrEMGqUrpWEK7gWglSdfUoTpT9uNLAiV6HV4ZxhIkXpWkmAJkBhkaR4sGWkJvk0rFZyQbemmL3pQKzEciSGOOiI6EEiEkTU2IjjMUy8oIy316oxwgRnYMccvH5NH1zfvw0+vq2f1eLYlqgtciJqDeBCALuiF4dh4hdW4+HTIrsOMlKT8a/LTrdaFFtjhEX+HwCPgCeWGUaTszt4qmdqhSIyHu4f1llzOQ+ZPqKyyIloFIA9Qoi1oV4biWgsgLEA0KZNm2hOyzCOQp6rY6Xkz+Sbz0Re47po17guzu+Si4teXeCznh9++gipyIloLgCtFiaPAXgUHrdKSIQQEwFMBID8/Hy23pmEIbd+OgCgThpX7lMzuGttk+4UjSgWnlfQR0hFLoQYprWciE4H0A6AbI23ArCKiPoJIfYbKiXDOJhnR/fA2R1y0Kc1V+4LhlZ0Jkds6iNi14oQYj0A7+OUiIoA5AshSg2Qi2HihvoZqbi6H7sTQ6EVZ88GuT4SNz+YYRhbofSHN8vK8FvGBMYwRS6EyGNrnGGYSJEt8g65dfHiFT0BAN2bZ1kpkmPgzE6GYWxB8+wMPHRhZ1zaqyXa5GRixn2D0DXB69HohRU5wzC2gIhw95BO3u/d2BrXDfvIGYZhHA4rcoZhGIfDipxhGMbhsCJnGIZxOKzIGYZhHA4rcoZhGIfDipxhGMbhsCJnGIZxOCQsaGxKRCUAdka4e2MAdi8FYHcZ7S4fwDIagd3lA+wvo93kayuEyFUvtESRRwMRrRBC5FstRzDsLqPd5QNYRiOwu3yA/WW0u3wy7FphGIZxOKzIGYZhHI4TFflEqwXQgd1ltLt8AMtoBHaXD7C/jHaXD4ADfeQMwzCML060yBmGYRgFrMgZhmEcjqMUORFdRERbiaiAiMZZJENrIppHRJuIaCMR3Sctb0REc4joD+l3Q2k5EdHrkszriKhvjORMJqLVRPSD9L0dES2V5PiciNKk5enS9wJpfV6M5GtARNOIaAsRbSaiATYcwwek//EGIvqMiDKsHkciep+IDhLRBsWysMeNiG6Stv+DiG4yWb6XpP/zOiL6hogaKNaNl+TbSkTDFctNu9e1ZFSse5CIBBE1lr7HfAwjQgjhiB8AyQC2A2gPIA3AWgDdLZCjOYC+0uf6ALYB6A7gRQDjpOXjALwgfR4BYAYAAtAfwNIYyfk3AJ8C+EH6/gWAq6XP7wC4U/r8VwDvSJ+vBvB5jOT7EMDt0uc0AA3sNIYAWgLYAaCOYvxutnocAZwLoC+ADYplYY0bgEYACqXfDaXPDU2U70IAKdLnFxTydZfu43QA7aT7O9nse11LRml5awCz4ElWbGzVGEb0N1l14ggGfwCAWYrv4wGMt4Fc0wFcAGArgObSsuYAtkqf3wVwjWJ773YmytQKwM8AhgD4QboISxU3k3cspQt3gPQ5RdqOTJYvW1KSpFpupzFsCWC3dKOmSOM43A7jCCBPpSjDGjcA1wB4V7HcZzuj5VOtGw1givTZ5x6WxzAW97qWjACmAegFoAi1itySMQz3x0muFfnGkimWllmG9PrcB8BSAE2FEPukVfsBNJU+WyH3qwAeAeCWvucAOCqEqNGQwSuftP6YtL2ZtANQAmCy5P6ZRER1YaMxFELsAfBvALsA7INnXFbCXuMoE+64WXkv3QqPhYsgcsRcPiIaBWCPEGKtapVtZAyGkxS5rSCiegC+AnC/EOK4cp3wPKItieskoosBHBRCrLTi/DpJgefV9m0hRB8AZfC4BLxYOYYAIPmZR8Hz0GkBoC6Ai6ySRy9Wj1swiOgxADUAplgtixIiygTwKIB/WC1LpDhJke+Bx4cl00paFnOIKBUeJT5FCPG1tPgAETWX1jcHcFBaHmu5BwK4lIiKAEyFx73yGoAGRJSiIYNXPml9NoBDJsoHeKyXYiHEUun7NHgUu13GEACGAdghhCgRQlQD+BqesbXTOMqEO24xH08iuhnAxQCukx42dpKvAzwP7LXSfdMKwCoiamYjGYPiJEW+HEAnKWogDZ4Jpe9iLQQREYD3AGwWQryiWPUdAHnm+iZ4fOfy8hul2e/+AI4pXoMNRwgxXgjRSgiRB88Y/SKEuA7APABXBJBPlvsKaXtTLTohxH4Au4moi7RoKIBNsMkYSuwC0J+IMqX/uSyjbcZRQbjjNgvAhUTUUHrzuFBaZgpEdBE8rr5LhRCnVHJfLUX8tAPQCcAyxPheF0KsF0I0EULkSfdNMTwBDfthkzEMiVXO+QgnKEbAEyWyHcBjFslwDjyvrusArJF+RsDjD/0ZwB8A5gJoJG1PAN6UZF4PID+Gsp6P2qiV9vDcJAUAvgSQLi3PkL4XSOvbx0i23gBWSOP4LTwz/7YaQwD/BLAFwAYAH8MTXWHpOAL4DB6ffTU8Cue2SMYNHl91gfRzi8nyFcDjT5bvl3cU2z8mybcVwJ8Uy02717VkVK0vQu1kZ8zHMJIfTtFnGIZxOE5yrTAMwzAasCJnGIZxOKzIGYZhHA4rcoZhGIfDipxhGMbhsCJnGIZxOKzIGYZhHM7/A8GjOFC0qWUfAAAAAElFTkSuQmCC\n", "text/plain": [ "
" ] }, "metadata": { "needs_background": "light" }, "output_type": "display_data" } ], "source": [ "plt.plot(df_simpleseries['yt'])\n", "plt.title('Timeseries yt');" ] } ], "metadata": { "kernelspec": { "display_name": "Python 3", "language": "python", "name": "python3" }, "language_info": { "codemirror_mode": { "name": "ipython", "version": 3 }, "file_extension": ".py", "mimetype": "text/x-python", "name": "python", "nbconvert_exporter": "python", "pygments_lexer": "ipython3", "version": "3.7.7" }, "toc": { "base_numbering": 1, "nav_menu": {}, "number_sections": true, "sideBar": true, "skip_h1_title": false, "title_cell": "Table of Contents", "title_sidebar": "Contents", "toc_cell": false, "toc_position": {}, "toc_section_display": true, "toc_window_display": true } }, "nbformat": 4, "nbformat_minor": 4 }