Intro to Data Visualization with R & ggplot2

The R programming language is experiencing rapid increases in popularity and wide adoption across industries. This popularity is due, in part, to R’s rich and powerful data visualization capabilities. While tools like Excel, Power BI, and Tableau are often the go-to solutions for data visualizations, none of these tools can compete with R in terms of the sheer breadth of, and control over, crafted data visualizations.

As an example, R’s ggplot2 package provides the R programmer with dozens of print-quality visualizations – where any visualization can be heavily customized with a minimal amount of code.

In this webinar Dave Langer will provide an introduction to data visualization with the ggplot2 package. The focus of the webinar will be using ggplot2 to analyze your data visually with a specific focus on discovering the underlying signals/patterns of your business.

Attendees will learn how to:

• Craft ggplot visualizations, including customization of rendered output.

• Choose optimal visualizations for the type of data and the nature of the analysis at hand.

• Leverage ggplot2’s powerful segmentation capabilities to achieve “visual drill-in of data”.

• Export ggplot2 visualizations from RStudio for use in documents and presentations.


Learn more about Data Science Dojo here:

See what our past attendees are saying here:

Like Us:
Follow Us:
Connect with Us:

Also find us on:
Google +:



This is a great tur]torial, good job

heroinhero69 says:

dumb question, but how do you successfully indent a line of code with the + sign, so that you don’t have something that looks like:

ggplot(data = filename) + geom_point(mapping = aes(x = xvar, y = yvar))

and instead have:

ggplot(data = filename) +
geom_point(mapping = aes(x = xvar, y = yvar))

P.B M says:

Nice video to get you hook with ggplot2

Mohammed Belo says:

Wonderful, this is was so useful and one hour full of knowledge and hand on practice.
Thanks alot guys !


Tuan Long says:

thanks a lot !!!

Liberty Mgbanyi says:

Please how do I display equation of the line and r^2 on my plots in R? In excel it is very easy to do this. I am buying into R because of R markdown. Please help out as I need my equation displayed just the way I use to in excel

Hasibul Islam says:

in 42:30min, you have a color on you bars. but with same code, my bars are having the same color. Why? Please give me a solution. Thanks in advance

Martin Wilson says:

Very helpful and appreciated, thanks for uploading

Kyle Larson says:

What would be fantastic is if you could please create 10-15min or less summary videos of your lessons just to provide a snap shot of the different codes. That way it would make it extremely easy to revise your information without needing to sit through the repetition of the more indepth explanations we have already heard.

berik says:

Yeah, proximity, I’m sure that accounts for the variance. Ugh. But thanks for the video. Or webinar. Or hookup or hangout or get together or whatever it’s called. Cheers.

Tashi Dem says:

What is the use of factorise here? I thought factorising some variable was going to be used later in exercise.

Jack William Biggs says:

great introductory guide – thanks for uploading this


worth watching 1 hr..Really helpful. Thanks a lot

Robin Redhu says:

I think in last two graphs both density and histogram are wrongly labeled

Ntim Domfeh says:

Thank you very much. You are far too kind

Adam Curtis says:

Great intro to ggplot2. Made the basics very clear.

Dan Reznik says:

you should do geom_boxplot(notch=T) so folks understand the concept of visually comparing medians; also read_csv preferred over read.csv

Kushagra Mishra says:

I would suggest everyone beginning with ggplot2 to go through this 1hr vedio, it will save you a lot of time understanding the basics.

Charwak Apte says:

Infinite SNR – Thanks!

asif khan says:

very nice explanation with the dataset. Thank You.

Shubhangi Suralkar says:

really helpfull

Ali Awadh says:

The seventh question, I believe the labs should be as:
labs(x = “Age”, y = “Density”), and
labs(x = “Age”, y = “Survived Count”)

ravinder ram says:

ggplot2 best package in data science for visulaization

Andy Monks says:

excellent video! Thank you very much Dave

Kristian Schmidt says:

Very helpful. I think at the end, the density plot vs histograms issue is, the layered density plots show two different distributions of age and the histograms show one distribution of age and bi-color that distribution by survival. Two different things.

Michael Rockinger says:

Show really starts after 25min. You should have discussed passengerid and name when you discussed the variables. Is ggplot smart to use factors for visualization? In a few days i will be desperate to remember that i need to factor to get certain visu. It should be the programmer to have contol not the program. No? Is it really sooo complicated to put % in the plot? Not good publicity for such a great package as ggplot.

sahal naz says:

very useful video… thank you

WahranRai says:

46:08 May be instead of using copy and paste, we could use, for example:
ggplt = ggplot2(titanic,aes…) and add layers to that
ggplt +

 Write a comment


Do you like our videos?
Do you want to see more like that?

Please click below to support us on Facebook!