SalePrice
variable in the House Prices competition through a histogram or KDE plot signals the need for logarithmic transformation, directly impacting the log-based evaluation metric.histplot
to visualize FIFA rankings.SalePrice
) across categories (e.g., Neighborhood
).Age
or Fare
across survival classes.Height
vs. Weight
and distinguishing between Male
and Female
subjects using color to discover conditional relationships.Age
and Cabin
columns, guiding decisions to impute or drop.missingno
create missing data heatmaps.Pclass
or Embarked
) and comparing survival rates across groups. Provides immediate, actionable insights into categorical influence on the target variable.Pclass
in the Titanic challenge.Library | Level of Control | Ease of Use | Interactivity | Typical Use Case | Key Trade-off |
---|---|---|---|---|---|
Matplotlib | Low-level / High | Low | No | Extensive customization; foundational plotting | Steep learning curve for high degree of control |
Seaborn | High-level / Medium | High | No | Rapid statistical EDA; clean, attractive plots | Less granular control than Matplotlib |
Plotly | High-level / Medium | Medium | Yes | Interactive public notebooks; dynamic analysis | Slower with large datasets; more code for basic plots |
Age
variable: Reveals an approximately normal distribution.Fare
variable: Shows a heavy right skew, suggesting a log transformation might improve linear model performance.Pclass
or Sex
.Age
and Cabin
columns, guiding decisions to impute or drop (e.g., Cabin
often dropped due to extensive missingness).Age
, Fare
, SibSp
, Parch
).SalePrice
histogram/KDE plot.GrLivArea
(above-grade living area) vs. SalePrice
: Shows a clear, positive, linear correlation.SalePrice
distribution across Neighborhood
or OverallQual
categories.