What is DataFrame concatenation in Pandas?

Concatenation means stacking DataFrames together. Use pd.concat() to combine multiple DataFrames into one larger DataFrame.

How do you concatenate along rows vs columns?

Use axis=0 (default) to stack rows vertically like pancakes. Use axis=1 to place columns side by side like books on a shelf.

What does the join parameter do in pd.concat()?

Join controls how mismatched columns are handled. Outer keeps all columns with NaN for gaps. Inner keeps only matching columns.

Concatenating DataFrames in Pandas | Guide

🧩 Concatenating DataFrames in Pandas

The LEGO Brick Story

Imagine you have two boxes of LEGO bricks. One box has red bricks, the other has blue bricks. You want to play with ALL of them together!

Concatenating is just like dumping both boxes into one big pile so you can build something amazing.

That’s exactly what pd.concat() does with DataFrames!

🎯 What is Concatenation?

Concatenation = Stacking DataFrames together

Think of it like:

Rows: Stacking pancakes on top of each other 🥞
Columns: Putting books side by side on a shelf 📚

import pandas as pd

# Two small DataFrames
df1 = pd.DataFrame({'Name': ['Alice', 'Bob']})
df2 = pd.DataFrame({'Name': ['Charlie', 'Diana']})

# Concatenate them!
result = pd.concat([df1, df2])

Result: One bigger DataFrame with all four names!

📚 Concatenating Along ROWS (axis=0)

This is the default way. Like stacking plates!

The Setup

# Morning orders
morning = pd.DataFrame({
    'Item': ['Coffee', 'Toast'],
    'Price': [3, 2]
})

# Evening orders
evening = pd.DataFrame({
    'Item': ['Soup', 'Salad'],
    'Price': [5, 4]
})

Stack Them!

all_orders = pd.concat([morning, evening])
print(all_orders)

Output:

     Item  Price
0  Coffee      3
1   Toast      2
0    Soup      5
1   Salad      4

🤔 Wait… Why are there two "0"s and two "1"s?

Each DataFrame kept its original index!

Fix: Reset the Index

all_orders = pd.concat(
    [morning, evening],
    ignore_index=True
)

Now the output is clean:

     Item  Price
0  Coffee      3
1   Toast      2
2    Soup      5
3   Salad      4

🎨 Visual Flow

graph TD
    A["Morning DataFrame&lt;br/&gt;2 rows"] --> C["pd.concat"]
    B["Evening DataFrame&lt;br/&gt;2 rows"] --> C
    C --> D["Combined DataFrame&lt;br/&gt;4 rows stacked vertically"]

📖 Concatenating Along COLUMNS (axis=1)

Now imagine putting two posters side by side on your wall.

The Setup

# Student names
names = pd.DataFrame({
    'Name': ['Alice', 'Bob', 'Charlie']
})

# Their scores
scores = pd.DataFrame({
    'Math': [90, 85, 78],
    'Science': [88, 92, 80]
})

Put Them Side by Side!

full_report = pd.concat(
    [names, scores],
    axis=1
)
print(full_report)

Output:

      Name  Math  Science
0    Alice    90       88
1      Bob    85       92
2  Charlie    78       80

🎨 Visual Flow

graph TD
    A["Names DataFrame&lt;br/&gt;1 column"] --> C["pd.concat axis=1"]
    B["Scores DataFrame&lt;br/&gt;2 columns"] --> C
    C --> D["Full Report&lt;br/&gt;3 columns side by side"]

⚠️ Important Rule!

When combining columns, row counts must match.

If they don’t? You’ll get NaN (missing values) filling the gaps!

🔗 The Join Method

Sometimes your DataFrames don’t have the same columns (for rows) or same rows (for columns).

Join decides what to do!

Two Options:

Join Type	What It Does	Like This…
`outer`	Keep EVERYTHING	All guests come to the party 🎉
`inner`	Keep only MATCHING	Only VIP guests allowed 🎫

Example: Different Columns

df1 = pd.DataFrame({
    'A': [1, 2],
    'B': [3, 4]
})

df2 = pd.DataFrame({
    'B': [5, 6],
    'C': [7, 8]
})

Outer Join (Default)

result = pd.concat(
    [df1, df2],
    join='outer'
)
print(result)

Output:

     A  B    C
0  1.0  3  NaN
1  2.0  4  NaN
0  NaN  5  7.0
1  NaN  6  8.0

Explanation: Column A and C don’t exist in both. So empty spots become NaN.

Inner Join

result = pd.concat(
    [df1, df2],
    join='inner'
)
print(result)

Output:

Explanation: Only column B exists in BOTH. So only B survives!

🎨 Visual Comparison

graph TD
    subgraph "Outer Join"
        O1["A, B"] --> O3["A, B, C"]
        O2["B, C"] --> O3
    end

    subgraph "Inner Join"
        I1["A, B"] --> I3["B only"]
        I2["B, C"] --> I3
    end

🏆 Quick Summary

Task	Code	Result
Stack rows	`pd.concat([df1, df2])`	Taller DataFrame
Stack columns	`pd.concat([df1, df2], axis=1)`	Wider DataFrame
Clean index	`ignore_index=True`	Fresh 0,1,2,3…
Keep all data	`join='outer'`	NaN fills gaps
Only common	`join='inner'`	Matching only

🎯 Real-World Example

You’re a teacher with two class sections:

class_a = pd.DataFrame({
    'Student': ['Emma', 'Liam'],
    'Grade': ['A', 'B']
})

class_b = pd.DataFrame({
    'Student': ['Noah', 'Olivia'],
    'Grade': ['B', 'A']
})

# Combine all students
all_students = pd.concat(
    [class_a, class_b],
    ignore_index=True
)

print(all_students)

Output:

  Student Grade
0    Emma     A
1    Liam     B
2    Noah     B
3  Olivia     A

Now you have ONE list of all your students! 🎓

💡 Pro Tips

Always use ignore_index=True when stacking rows unless you need original indices.
Check column names first! Mismatched spelling = separate columns.
Use axis=1 carefully. Make sure row counts align.
outer is safe, inner is strict. Choose based on your needs.

🚀 You Did It!

You now know how to:

✅ Stack DataFrames vertically (rows)
✅ Stack DataFrames horizontally (columns)
✅ Control what happens with join
✅ Keep your index clean

Concatenating is like being a master puzzle builder. You take separate pieces and combine them into one beautiful picture!

Now go stack some DataFrames! 🎉

Concatenating DataFrames

Unable to load concept

Coming Soon...

🧩 Concatenating DataFrames in Pandas

The LEGO Brick Story

🎯 What is Concatenation?

📚 Concatenating Along ROWS (axis=0)

The Setup

Stack Them!

🤔 Wait… Why are there two "0"s and two "1"s?

Fix: Reset the Index

🎨 Visual Flow

📖 Concatenating Along COLUMNS (axis=1)

The Setup

Put Them Side by Side!

🎨 Visual Flow

⚠️ Important Rule!

🔗 The Join Method

Two Options:

Example: Different Columns

Outer Join (Default)

Inner Join

🎨 Visual Comparison

🏆 Quick Summary

🎯 Real-World Example

💡 Pro Tips

🚀 You Did It!

Story - Premium Content

Stay Tuned!

Story - Premium Content

Interactive - Premium Content

Interactive - Premium Content

Stay Tuned!

Cheatsheet - Premium Content

Cheatsheet - Premium Content

Stay Tuned!

Quiz - Premium Content

Quiz - Premium Content

Stay Tuned!

Flashcard - Premium Content

Flashcard - Premium Content

Stay Tuned!

Sign in Required

Report an Issue