Crowdsourcing Data on Humor Via YouTube: Want to Help?

Per this video, I’m preparing for a presentation at the International Society of Humor Studies (yes there’s such a thing). I present on Tuesday (July 5) in Boston at the international meeting. If you’re near Boston University, please enroll and attend! This is a scholarly and professional organization dedicated to the advancement of humor research. Many of the Society’s members are university and college professors in the Arts and Humanities, Biological and Social Sciences and Education. Then there’s me.

Given my “prolific” experience as a YouTube “comedian” (220 million views, and about 200K per day) and my publication of “Beyond Viral,” I’m tackling humor from the perspective of comedy videos on YouTube and their “rankings.” My background as a psychology student (Georgetown) and MBA in marketing (statistics) also helps, and so does my decades of analyzing market research for my job as a marketer (now at Johnson & Johnson). But you, dear reader, offer perhaps other valuable perspectives.

Here’s the fundamental question this presentation (including a white paper) will address:

What can we learn about what this planet finds funny, based on the data available on YouTube?

Do you want to help? Here’s some information if you have time/interest…

  • YouTube, as the world’s largest video site and 2nd most-popular search engine after Google, is a good basis to explore humor. The videos can be sorted in many ways, and the large data sample is a rich source of insights. There are, of course, three “confounding” variables to extrapolating YouTube data to the planet’s humor preference: a) Selection bias: YouTube viewers are not necessarily representative, b) Popularity bias: videos by “popular” webstars generally get more views and higher ratings regardless of their humor quotient, c) Algorithm bias: YouTube videos for many years were ranked by “most viewed” or “favorite” videos, which created a “rich get rich” effect… once a video achieved critical mass, it received new views based on its ranking and effectively “locked” some weak videos in a place of perpetual viewing. That’s changed, and now videos are “spotlighted” based such criteria as percentage of comments, promoted videos, and other concealed factors that change.
  • I’ve spent countless hours reviewing the top 100 most-viewed comedy videos on YouTube (see preliminary findings by clicking “MORE” at the bottom of this post), and categorizing them by a dozen plus criteria. Your contribution, if you wish to help, need not be as exhaustive. I had to view, classify, expand classifications and review them multiple times. I found only about 12 of them funny by my subjective standards, but that’s not the goal. After viewing them each 5-10 times, I can say none are funny anymore to me.
  • You can help any way you have time, assuming a) you find this research interesting and b) you have time free between now and Monday (July 4). You could spend 1 minute providing a comment about how you might suggest analyzing YouTube. Or maybe you’re keen on spending a hours actually reviewing videos based on criteria/methodology you prefer (do it, don’t ask for my feedback). If you can find some interesting published method for classifying humor (edgy/cute or intellectual/emotional) than use it. Or create your own based on a hypothesis (are Asians more likely to be top-rated comedians? Are women?).
  • What’s in it for you? You’ll be part of something that, to my knowledge, has never been done (although if I’m wrong and you find otherwise let me know). We’re combining two disciplines (the art of comedy and the science of analysis/psychology) that rarely meet. I’ll be grateful for your comments and volunteer assignments, and I’ll credit you in the report and in a YouTube video if you provide ANY meaningful contribution (like a 2-page summary of quantifiably substantiated findings).
  • What do you do next? If you have an idea, run with it. You could sort comedy videos by date (time period) and look at objective patterns.
    • You could review most-viewed or most-subscribed comedians and observe similarities and differences in some quantifiable way. Just try to avoid your subjective opinion (what YOU like/don’t), and instead focus on quantifiable patterns based on what crowds like (as measured by rankings/ratings/comments/likes/dislikes)… as you’ll see by my “preliminary findings” this does require some subjective calls but be consistant and note criteria.
    • You’ll also have to rely on your YouTube knowledge to isolate “confounding variables” (Shaytards love Shaycarl and tend to view/rate his videos as high, which could lead to a faulty conclusion that it’s representative of the planet’s preference about humor. The goal isn’t to find out what hard-core YouTubers like (or specific “tribes” of people) but something bigger.
    • You could research other academic research on humor that provide clues. Or use an already researched classification model for comedy/humor.
    • Instead of focusing on comedy videos, you could explore the most-subscribed channels on YouTube that classify themselves as comedy. What are the patterns?
  • Do you wait for my okay to start? Nope– just have a go. Even if your efforts don’t produce anything meaningful, you’ll be credited for your effort (just describe your approach and findings in a simple summary). I doubt we’ll see two people tackle it the same way, so there’s little risk of redundancy.
  • Timing: Again this is being presented on Tuesday so I need to wrap it by Monday, July 4. I hope you’ll join the effort! I’ll be checking comments between now and Monday regularly.

To read about my approach and findings so far, click MORE…

My approach so far: I’ve reviewed the 100 most-viewed videos that are classified as “comedy.” I could have used “most liked” comedy videos of all time (see link), but they’re fairly similar. About 10 of them are misclassified, and the frequency of views is somewhat deceptive. (YouTube years ago used to list videos prominently by how many times they were viewed, which created a self fulfilling prophecy. Videos would be viewed simply because they HAD been viewed, and get “locked” at the top. Thus these 100 aren’t necessarily representative.)

Preliminary Findings:

  • About 75% of the most-viewed comedy videos are amateur, with only about 25% being SNL clips or television clips that are ripped.
  • I’ve tracked patterns such as a) a cute child, b) music parody, c) dancing, d) animal, e) sexual references. Despite my thinking that dance/music would triumph, we’re seeing “cute” and “sexual” as the leading two patterns.
  • Only about 32% are or include “real” moments (a baby laughing, a cat doing something funny, a prank, or a sports fail). The rest are scripted or produced. That was surprising considering most people don’t think of top YouTube videos as “produced.”
  • Blood, featuring a worried child offset against giggling parents, deploys something subtle and important. The off-camera laughter (OCL) is something that advances what otherwise might be simply “cute” and creates a hysterical video. I’ve only counted about 5 instances of OCL, and if it’s genuine it has the intended effect of a television show’s laugh track: a laughter cue. Blood, and other prank or satire/FAIL videos, could arguably employ “schadenfreude” (taking pleasure from someone else’s misfortune).

Patrice Oppliger, Ph.D, is the IHSH conference chair, and I asked him if he had any classification recommendations. His response: There are some standards of classifying humor according to the various theories such as: incongruity (juxtaposing two or more unrelated elements) or surprise, superiority (laughing at other because the audience feels superior to the joke target), release theory (a Freudian approach were our deep seeded ID’s come out “safely” in a humor format – for example, sexual, bigoted jokes).

  • Using those theories, I’ve found most videos to be based on “superiority” (a satire of a celebrity, group of people or victim), and many are based on the “release” theory, like the sexually nuanced SNL skits. The minority of videos use incongruity (juxtaposing cats with captions ala LOLCats, or Poddy Training (75 million views), which combines a popular song with a baby cartoon.
  • There are, of course, other theories on humor “types” and feel free to chose whatever one you like!

Thanks for reading this, and even more if you’re willing to pitch in!

22 thoughts on “Crowdsourcing Data on Humor Via YouTube: Want to Help?”

  1. I just want to throw one bit of information your way. I think the question you are trying to answer will always be determined by age/era groups. What I find funny on YouTube and what my son of 9 thinks is funny is two different points. Good luck with your findings and what you consider results. I look forward to seeing the outcome of your presentation.
    -Ace

  2. Ace- good point. Is there a way to look at preferences by demographic (age, location)? If so I’m not aware, but agree it’s a VITAL issue. And YouTube itself is skewed to represent 13-20 year olds, so that produces a “sample bias” of even using YouTube. Wanna help!?

  3. I left a comment. Where the fuck is it? This is the 3d time ive commented on twitter/here that it hasnt shown up.

  4. Kevin, there is the Show Video Statistics button that often lists the top age/gender groups the video is most popular with.

  5. My approach would be to look at most favorited comedy videos, filtering out any top subscribed YouTubers who are adept at manipulating their viewers to rate/comment/subscribe/favorite. I think this would be less skewed than most viewed considering how little views correspond to quality much of the time (think Rebecca Black).

    I can’t today, but maybe I’ll do a small survey and analysis this weekend.

  6. hey it sounds like a very cool thing you are doing I was doing that on myspace music bands I wa giving them different stars to show which group was good, excellent,bad. sooo this is somewhat you are doing kind of I would like to try to help you if I can, you know like with Vloggers, to home scenerios, there is all different kinds of funny, check out Canadian studmuffin, to BiggyDman, and Paul Telner, these are really funny guys as well as the Shaytards.
    well I hope I can be of some kind of help and check out a few of my videos kittykittypaw.

  7. Hi Nalts. Your pursuit of knowledge interest me. Therefore, I will offer to suggest that many comedians (yourself included) pull from a common place to connect with their audiences. Since I don’t know if I’m funny or scary to you, I’ll conclude by saying the common human experience should yield a connection that transends basic language barriers. I’ll be backing off awhile now, I believe I’m becoming a bit too chummy and you tend to enjoy a good laugh at other people’s expense. 😀

  8. It’d be a lot easier to do this if Youtube allowed you to read the hot spots chart for videos other than yours. If there’s a way to do that I’m unaware of, this study would be much easier to conduct.

  9. On the topic of parody and different audiences, Weird Al posted his Lady Gaga parody video ten days ago and it now has 6.6 million views. I wonder if Gaga fans find it as funny as those of us who aren’t Gaga fans. Or, in other words, do little monsters have a sense of humor? My guess is that they do, but they might laugh at different parts of the video and for different reasons.

  10. I am very impressed with your scholasticism and the research you’ve already done. Very interesting and creative.
    Too lazy to help you with research but I of course have my opinion.
    I’m afraid crowd humor is based on pain and fear and it freightens me. I think when its constructive pain and fear, like learning something new about yourself or others, and still reaching for the “good” action or thought is really the funniest and is contageous. Have fun with this.

  11. Kev, I’ve got some data for you in a Google Doc. Check your goddamn email.

  12. By ………………Frankly the PSAs I was exposed to as a kid about the dangers of drug abuse and narcotics really didnt really have much of an effect on me.

Comments are closed.