Kaggle: Introduction to Manual Feature Engineering Part 1

https://www.kaggle.com/willkoehrsen/introduction-to-manual-feature-engineering

I thought I should make a note of what I was doing. Qiita says that memos are OK, so let's make a note here.

It's complicated to think that this notebook combines various tables, but

Just group by with client_id and count, and add that line to application_train to increase the feature amount, It's easy to think of it.

However, I feel that this KDE plot is useful in many places. Of course I've put out this kind of graph, but it's nice to have a name. For a common understanding.

Flattening the multi-level index to one level seems to be useful somewhere other than machine learning.

The continuation Function for Numeric Aggregations https://www.kaggle.com/willkoehrsen/introduction-to-manual-feature-engineering#Function-for-Numeric-Aggregations From.

There is such a thing in python in the first place.

If you pass the list, it will sort. sorted([1,2,4,4,1,1,2,5,555,230]) [1, 1, 1, 2, 2, 4, 4, 5, 230, 555]

Functions can also be used for sorting, so new_corrs = sorted(new_corrs, key = lambda x: abs(x[1]), reverse = True) Sort by absolute value of x 1 (in reverse order).

Learned English words

--institution institution --handle with A A ―― by pure chance Just by chance, just by chance

I just remembered it or forgot it.

Recommended Posts

Kaggle: Introduction to Manual Feature Engineering Part 1
Introduction to PyQt4 Part 1
Kaggle House Prices ① ~ Feature Engineering ~
Introduction to Ansible Part ③'Inventory'
Introduction to Ansible Part ④'Variable'
Introduction to Ansible Part 2'Basic Grammar'
Introduction to Python Hands On Part 1
Web-WF Python Tornado Part 3 (Introduction to Openpyexcel)
Introduction to MQTT (Introduction)
Introduction to Scrapy (1)
Introduction to Scrapy (3)
Introduction to Supervisor
[Introduction to cx_Oracle] (Part 3) Basics of Table Reference
Introduction to Scrapy (2)
[Linux] Introduction to Linux
[Introduction to cx_Oracle] (Part 7) Handling of bind variables
Introduction to discord.py
Day 66 [Introduction to Kaggle] The easiest Titanic forecast
Kaggle House Prices ① ~ Feature Engineering ~
Note
Feature Engineering Traveling with Pokemon-Category Variables-
Note
Note
Feature Engineering Traveling with Pokemon-Numerical Edition-
Predicting Credit Card Defaults Feature Engineering
5th Feature Engineering for Machine Learning-Feature Selection
Kaggle: Introduction to Manual Feature Engineering Part 1
[Introduction to cx_Oracle] (Part 11) Basics of PL / SQL Execution
Introduction to Financial Engineering [Investment science] Chapter 2 Exercises Self-answer
[Introduction to cx_Oracle] (Part 13) Connection using connection pool (client side)
Introduction to Python numpy pandas matplotlib (~ towards B3 ~ part2)
Kaggle Summary: Redhat (Part 1)
Kaggle ~ Housing Analysis ③ ~ Part1
Introduction to Lightning pytorch
Introduction to Web Scraping
Introduction to EV3 / MicroPython
Introduction to Python language
Introduction to TensorFlow-Image Recognition
Introduction to OpenCV (python)-(2)
Kaggle Summary: Redhat (Part 2)
Introduction to Dependency Injection
Introduction to Private Chainer
Introduction to machine learning
[Introduction to cx_Oracle] (Part 6) DB and Python data type mapping
Day 67 [Introduction to Kaggle] Have you tried using Random Forest?
[Introduction to cx_Oracle] (Part 4) Fetch and scroll of result set
An introduction to Python that even monkeys can understand (Part 3)
Introduction to Python scikit-learn, matplotlib, single-layer algorithm (~ towards B3 ~ part3)
An introduction to Python that even monkeys can understand (Part 1)