Kaggle: Introduction to Manual Feature Engineering Part 1

https://www.kaggle.com/willkoehrsen/introduction-to-manual-feature-engineering

I thought I should make a note of what I was doing. Qiita says that memos are OK, so let's make a note here.

It's complicated to think that this notebook combines various tables, but

Just group by with client_id and count, and add that line to application_train to increase the feature amount, It's easy to think of it.

However, I feel that this KDE plot is useful in many places. Of course I've put out this kind of graph, but it's nice to have a name. For a common understanding.

Flattening the multi-level index to one level seems to be useful somewhere other than machine learning.

The continuation Function for Numeric Aggregations https://www.kaggle.com/willkoehrsen/introduction-to-manual-feature-engineering#Function-for-Numeric-Aggregations From.

There is such a thing in python in the first place.

sorted()

If you pass the list, it will sort. sorted([1,2,4,4,1,1,2,5,555,230]) [1, 1, 1, 2, 2, 4, 4, 5, 230, 555]

Functions can also be used for sorting, so new_corrs = sorted(new_corrs, key = lambda x: abs(x[1]), reverse = True) Sort by absolute value of x 1 (in reverse order).

Learned English words

--institution institution --handle with A A ―― by pure chance Just by chance, just by chance

I just remembered it or forgot it.

Recommended Posts

Kaggle: Introduction to Manual Feature Engineering Part 1

Introduction to PyQt4 Part 1

Kaggle House Prices ① ～ Feature Engineering ～

Introduction to Ansible Part ③'Inventory'

Introduction to Ansible Part ④'Variable'

Introduction to Ansible Part 2'Basic Grammar'

Introduction to Python Hands On Part 1

Web-WF Python Tornado Part 3 (Introduction to Openpyexcel)

Introduction to MQTT (Introduction)

Introduction to Scrapy (1)

Introduction to Scrapy (3)

Introduction to Supervisor

[Introduction to cx_Oracle] (Part 3) Basics of Table Reference

Introduction to Scrapy (2)

[Linux] Introduction to Linux

[Introduction to cx_Oracle] (Part 7) Handling of bind variables

Introduction to discord.py

Day 66 [Introduction to Kaggle] The easiest Titanic forecast

Kaggle House Prices ① ～ Feature Engineering ～

Note

Feature Engineering Traveling with Pokemon-Category Variables-

Note

Note

Feature Engineering Traveling with Pokemon-Numerical Edition-

Predicting Credit Card Defaults Feature Engineering

5th Feature Engineering for Machine Learning-Feature Selection

Kaggle: Introduction to Manual Feature Engineering Part 1

[Introduction to cx_Oracle] (Part 11) Basics of PL / SQL Execution

Introduction to Financial Engineering [Investment science] Chapter 2 Exercises Self-answer

[Introduction to cx_Oracle] (Part 13) Connection using connection pool (client side)

Introduction to Python numpy pandas matplotlib (~ towards B3 ~ part2)

Kaggle Summary: Redhat (Part 1)

Kaggle ~ Housing Analysis ③ ~ Part1

Introduction to Lightning pytorch

Introduction to Web Scraping

Introduction to EV3 / MicroPython

Introduction to Python language

Introduction to TensorFlow-Image Recognition

Introduction to OpenCV (python)-(2)

Kaggle Summary: Redhat (Part 2)

Introduction to Dependency Injection

Introduction to Private Chainer

Introduction to machine learning

[Introduction to cx_Oracle] (Part 6) DB and Python data type mapping

Day 67 [Introduction to Kaggle] Have you tried using Random Forest?

[Introduction to cx_Oracle] (Part 4) Fetch and scroll of result set

An introduction to Python that even monkeys can understand (Part 3)

Introduction to Python scikit-learn, matplotlib, single-layer algorithm (~ towards B3 ~ part3)

An introduction to Python that even monkeys can understand (Part 1)