<?xml version="1.0" encoding="UTF-8"?>
<doi_batch xmlns="http://www.crossref.org/schema/4.3.6" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:fr="http://www.crossref.org/fundref.xsd" xmlns:jats="http://www.ncbi.nlm.nih.gov/JATS1" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:ai="http://www.crossref.org/AccessIndicators.xsd" xmlns:str="http://exslt.org/strings" version="4.3.6" xsi:schemaLocation="http://www.crossref.org/schema/4.3.6 http://www.crossref.org/schema/deposit/crossref4.3.6.xsd">
  <head>
    <doi_batch_id>10.7717/peerj-cs.38</doi_batch_id>
    <timestamp>20170722033524</timestamp>
    <depositor>
      <depositor_name>PeerJ</depositor_name>
      <email_address>doi@peerj.com</email_address>
    </depositor>
    <registrant>PeerJ</registrant>
  </head>
  <body>
    <journal>
      <journal_metadata language="en" reference_distribution_opts="any" metadata_distribution_opts="any">
        <full_title>PeerJ Computer Science</full_title>
        <issn media_type="electronic">2376-5992</issn>
        <archive_locations>
          <archive name="CLOCKSS"/>
          <archive name="LOCKSS"/>
          <archive name="Portico"/>
        </archive_locations>
      </journal_metadata>
      <journal_issue>
        <publication_date media_type="online">
          <year>2015</year>
        </publication_date>
        <journal_volume>
          <volume>1</volume>
        </journal_volume>
      </journal_issue>
      <journal_article publication_type="full_text">
        <titles>
          <title>Measuring online social bubbles</title>
        </titles>
        <contributors>
          <person_name contributor_role="author" sequence="first">
            <given_name>Dimitar</given_name>
            <surname>Nikolov</surname>
            <affiliation>Center for Complex Networks and Systems Research, Indiana University, Bloomington, IN, United States</affiliation>
          </person_name>
          <person_name contributor_role="author" sequence="additional">
            <given_name>Diego F.M.</given_name>
            <surname>Oliveira</surname>
            <affiliation>Center for Complex Networks and Systems Research, Indiana University, Bloomington, IN, United States</affiliation>
          </person_name>
          <person_name contributor_role="author" sequence="additional">
            <given_name>Alessandro</given_name>
            <surname>Flammini</surname>
            <affiliation>Center for Complex Networks and Systems Research, Indiana University, Bloomington, IN, United States</affiliation>
          </person_name>
          <person_name contributor_role="author" sequence="additional">
            <given_name>Filippo</given_name>
            <surname>Menczer</surname>
            <affiliation>Center for Complex Networks and Systems Research, Indiana University, Bloomington, IN, United States</affiliation>
          </person_name>
        </contributors>
        <jats:abstract>
        <jats:p>Social media have become a prevalent channel to access information, spread ideas, and influence opinions. However, it has been suggested that social and algorithmic filtering may cause exposure to less diverse points of view. Here we quantitatively measure this kind of social bias at the collective level by mining a massive datasets of web clicks. Our analysis shows that collectively, people access information from a significantly narrower spectrum of sources through social media and email, compared to a search baseline. The significance of this finding for individual exposure is revealed by investigating the relationship between the diversity of information sources experienced by users at both the collective and individual levels in two datasets where individual users can be analyzed—Twitter posts and search logs. There is a strong correlation between collective and individual diversity, supporting the notion that when we use social media we find ourselves inside “social bubbles.” Our results could lead to a deeper understanding of how technology biases our exposure to new information.</jats:p>
      </jats:abstract>
        <publication_date media_type="online">
          <month>12</month>
          <day>02</day>
          <year>2015</year>
        </publication_date>
        <pages>
          <first_page>e38</first_page>
        </pages>
        <publisher_item>
          <item_number item_number_type="article-number">e38</item_number>
          <identifier id_type="doi">10.7717/peerj-cs.38</identifier>
        </publisher_item>
        <fr:program>
          <fr:assertion name="fundgroup">
            <fr:assertion name="funder_name">James S. McDonnell Foundation</fr:assertion>
          </fr:assertion>
          <fr:assertion name="fundgroup">
            <fr:assertion name="funder_name">National Science Foundation</fr:assertion>
            <fr:assertion name="award_number">CCF-1101743</fr:assertion>
          </fr:assertion>
        </fr:program>
        <ai:program name="AccessIndicators">
          <ai:license_ref>http://creativecommons.org/licenses/by/4.0/</ai:license_ref>
        </ai:program>
        <doi_data>
          <doi>10.7717/peerj-cs.38</doi>
          <resource>https://peerj.com/articles/cs-38</resource>
          <collection property="text-mining">
            <item>
              <resource content_version="vor" mime_type="application/pdf">https://peerj.com/articles/cs-38.pdf</resource>
            </item>
            <item>
              <resource content_version="vor" mime_type="application/xml">https://peerj.com/articles/cs-38.xml</resource>
            </item>
            <item>
              <resource content_version="vor" mime_type="text/html">https://peerj.com/articles/cs-38.html</resource>
            </item>
          </collection>
          <collection property="crawler-based">
            <item crawler="iParadigms">
              <resource>https://peerj.com/articles/cs-38.pdf</resource>
            </item>
          </collection>
        </doi_data>
        <citation_list>
          <citation key="10.7717/peerj-cs.38/ref-1">
            <volume_title>LinkKDD’05: proceedings of the 3rd international workshop on Link discovery</volume_title>
            <author>Adamic</author>
            <first_page>36</first_page>
            <cYear>2005</cYear>
            <article_title>The political blogosphere and the 2004 US election: divided they blog</article_title>
          </citation>
          <citation key="10.7717/peerj-cs.38/ref-2">
            <journal_title>EPJ Data Science</journal_title>
            <author>An</author>
            <volume>3</volume>
            <cYear>2014</cYear>
            <doi>10.1140/epjds/s13688-014-0012-2</doi>
            <article_title>Sharing political news: the balancing act of intimacy and socialization in selective exposure</article_title>
          </citation>
          <citation key="10.7717/peerj-cs.38/ref-3">
            <journal_title>Science</journal_title>
            <author>Bakshy</author>
            <cYear>2015</cYear>
            <article_title>Exposure to ideologically diverse news and opinion on facebook</article_title>
          </citation>
          <citation key="10.7717/peerj-cs.38/ref-4">
            <volume_title>Proceedings of the 21st international conference on world wide web, WWW’12</volume_title>
            <author>Bakshy</author>
            <first_page>519</first_page>
            <cYear>2012</cYear>
            <article_title>The role of social networks in information diffusion</article_title>
          </citation>
          <citation key="10.7717/peerj-cs.38/ref-5">
            <volume_title>Thinking and deciding</volume_title>
            <author>Baron</author>
            <edition_number>3</edition_number>
            <cYear>2000</cYear>
          </citation>
          <citation key="10.7717/peerj-cs.38/ref-6">
            <author>Conover</author>
            <cYear>2011</cYear>
            <article_title>Political polarization on twitter</article_title>
          </citation>
          <citation key="10.7717/peerj-cs.38/ref-7">
            <journal_title>EPJ Data Science</journal_title>
            <author>Conover</author>
            <volume>1</volume>
            <cYear>2012</cYear>
            <doi>10.1140/epjds6</doi>
            <article_title>Partisan asymmetries in online political activity</article_title>
          </citation>
          <citation key="10.7717/peerj-cs.38/ref-8">
            <author>Doris-Down</author>
            <first_page>120</first_page>
            <cYear>2013</cYear>
            <article_title>Political blend: an application designed to bring people together based on political differences</article_title>
          </citation>
          <citation key="10.7717/peerj-cs.38/ref-9">
            <volume_title>SSRN Scholarly Paper ID 2363701</volume_title>
            <author>Flaxman</author>
            <cYear>2013</cYear>
            <article_title>Ideological segregation and the effects of social media on news consumption</article_title>
          </citation>
          <citation key="10.7717/peerj-cs.38/ref-10">
            <journal_title>Proceedings of the National Academy of Sciences of the United States of America</journal_title>
            <author>Fortunato</author>
            <volume>103</volume>
            <issue>34</issue>
            <first_page>12684</first_page>
            <cYear>2006</cYear>
            <doi>10.1073/pnas.0605525103</doi>
            <article_title>Topical interests and the mitigation of search engine bias</article_title>
          </citation>
          <citation key="10.7717/peerj-cs.38/ref-11">
            <author>Gilbert</author>
            <first_page>1</first_page>
            <cYear>2009</cYear>
            <article_title>Blogs are echo chambers: blogs are echo chambers</article_title>
          </citation>
          <citation key="10.7717/peerj-cs.38/ref-12">
            <cYear>2009</cYear>
            <article_title>Introducing google social search: I finally found my friend’s New York blog!</article_title>
          </citation>
          <citation key="10.7717/peerj-cs.38/ref-13">
            <cYear>2009</cYear>
            <article_title>Personalized search for everyone</article_title>
          </citation>
          <citation key="10.7717/peerj-cs.38/ref-14">
            <author>Graells-Garrido</author>
            <cYear>2013</cYear>
            <article_title>Data portraits: connecting people of opposing views. Technical Report</article_title>
          </citation>
          <citation key="10.7717/peerj-cs.38/ref-15">
            <author>Grevet</author>
            <first_page>1400</first_page>
            <cYear>2014</cYear>
            <article_title>Managing political differences in social media</article_title>
          </citation>
          <citation key="10.7717/peerj-cs.38/ref-16">
            <journal_title>Psychological Bulletin</journal_title>
            <author>Hart</author>
            <volume>135</volume>
            <issue>4</issue>
            <first_page>555</first_page>
            <cYear>2009</cYear>
            <doi>10.1037/a0015701</doi>
            <article_title>Feeling validated versus being correct: a meta-analysis of selective exposure to information</article_title>
          </citation>
          <citation key="10.7717/peerj-cs.38/ref-17">
            <journal_title>Management Science</journal_title>
            <author>Hosanagar</author>
            <volume>60</volume>
            <issue>4</issue>
            <first_page>805</first_page>
            <cYear>2013</cYear>
            <doi>10.1287/mnsc.2013.1808</doi>
            <article_title>Will the global village fracture into tribes? Recommender systems and their effects on consumer fragmentation</article_title>
          </citation>
          <citation key="10.7717/peerj-cs.38/ref-18">
            <journal_title>British Journal of Social Psychology</journal_title>
            <author>Kastenmüller</author>
            <volume>49</volume>
            <issue>4</issue>
            <first_page>745</first_page>
            <cYear>2010</cYear>
            <doi>10.1348/014466609X478988</doi>
            <article_title>Selective exposure: the impact of collectivism and individualism</article_title>
          </citation>
          <citation key="10.7717/peerj-cs.38/ref-19">
            <author>Koutra</author>
            <cYear>2014</cYear>
            <article_title>Events and controversies: influences of a shocking news event on information seeking. Technical Report</article_title>
          </citation>
          <citation key="10.7717/peerj-cs.38/ref-20">
            <cYear>2014</cYear>
            <article_title>8 key takeaways about social media and news</article_title>
          </citation>
          <citation key="10.7717/peerj-cs.38/ref-21">
            <volume_title>Handbook of judgment and decision making</volume_title>
            <author>McKenzie</author>
            <first_page>200</first_page>
            <cYear>2004</cYear>
            <article_title>Hypothesis testing and evaluation</article_title>
          </citation>
          <citation key="10.7717/peerj-cs.38/ref-22">
            <volume_title>Proceedings of the 2008 international conference on web search and data mining</volume_title>
            <author>Meiss</author>
            <first_page>65</first_page>
            <cYear>2008</cYear>
            <article_title>Ranking web sites with real user traffic</article_title>
          </citation>
          <citation key="10.7717/peerj-cs.38/ref-23">
            <journal_title>Science</journal_title>
            <author>Mervis</author>
            <volume>346</volume>
            <issue>6210</issue>
            <first_page>686</first_page>
            <cYear>2014</cYear>
            <doi>10.1126/science.346.6210.686</doi>
            <article_title>An internet research project draws conservative ire</article_title>
          </citation>
          <citation key="10.7717/peerj-cs.38/ref-24">
            <journal_title>Communications of the ACM</journal_title>
            <author>Mobasher</author>
            <volume>43</volume>
            <issue>8</issue>
            <first_page>142</first_page>
            <cYear>2000</cYear>
            <doi>10.1145/345124.345169</doi>
            <article_title>Automatic personalization based on web usage mining</article_title>
          </citation>
          <citation key="10.7717/peerj-cs.38/ref-25">
            <volume_title>Proceedings of the SIGCHI conference on human factors in computing systems</volume_title>
            <author>Munson</author>
            <first_page>1457</first_page>
            <cYear>2010</cYear>
            <article_title>Presenting diverse political opinions: How and how much</article_title>
          </citation>
          <citation key="10.7717/peerj-cs.38/ref-26">
            <journal_title>Review of General Psychology</journal_title>
            <author>Nickerson</author>
            <volume>2</volume>
            <issue>2</issue>
            <first_page>175</first_page>
            <cYear>1998</cYear>
            <doi>10.1037/1089-2680.2.2.175</doi>
            <article_title>Confirmation bias: a ubiquitous phenomenon in many guises</article_title>
          </citation>
          <citation key="10.7717/peerj-cs.38/ref-27">
            <journal_title>Political Behavior</journal_title>
            <author>Nyhan</author>
            <volume>32</volume>
            <issue>2</issue>
            <first_page>303</first_page>
            <cYear>2010</cYear>
            <doi>10.1007/s11109-010-9112-2</doi>
            <article_title>When corrections fail: the persistence of political misperceptions</article_title>
          </citation>
          <citation key="10.7717/peerj-cs.38/ref-28">
            <volume_title>The filter bubble: how the new personalized Web is changing what we read and how we think</volume_title>
            <author>Pariser</author>
            <cYear>2011</cYear>
          </citation>
          <citation key="10.7717/peerj-cs.38/ref-29">
            <volume_title>Recommender systems handbook</volume_title>
            <author>Ricci</author>
            <cYear>2011</cYear>
          </citation>
          <citation key="10.7717/peerj-cs.38/ref-30">
            <volume_title>On economic inequality</volume_title>
            <author>Sen</author>
            <cYear>1973</cYear>
          </citation>
          <citation key="10.7717/peerj-cs.38/ref-31">
            <journal_title>The Bell System Technical Journal</journal_title>
            <author>Shannon</author>
            <volume>27</volume>
            <issue>3</issue>
            <first_page>379</first_page>
            <cYear>1948</cYear>
            <doi>10.1002/j.1538-7305.1948.tb01338.x</doi>
            <article_title>A mathematical theory of communication</article_title>
          </citation>
          <citation key="10.7717/peerj-cs.38/ref-32">
            <cYear>2011</cYear>
            <article_title>The Backfire Effect</article_title>
          </citation>
          <citation key="10.7717/peerj-cs.38/ref-33">
            <journal_title>Current Directions in Psychological Science</journal_title>
            <author>Stanovich</author>
            <volume>22</volume>
            <issue>4</issue>
            <first_page>259</first_page>
            <cYear>2013</cYear>
            <doi>10.1177/0963721413480174</doi>
            <article_title>Myside bias, rational thinking, and intelligence</article_title>
          </citation>
          <citation key="10.7717/peerj-cs.38/ref-34">
            <journal_title>Journal of Political Philosophy</journal_title>
            <author>Sunstein</author>
            <volume>10</volume>
            <issue>2</issue>
            <first_page>175</first_page>
            <cYear>2002</cYear>
            <doi>10.1111/1467-9760.00148</doi>
            <article_title>The law of group polarization</article_title>
          </citation>
          <citation key="10.7717/peerj-cs.38/ref-35">
            <volume_title>Republic.com 2.0</volume_title>
            <author>Sunstein</author>
            <cYear>2009</cYear>
          </citation>
        </citation_list>
        <component_list>
          <component parent_relation="isPartOf">
            <titles>
              <title>Figure 1: Diversity of information sources accessed through different online channels.</title>
            </titles>
            <format mime_type="image/png"/>
            <doi_data>
              <doi>10.7717/peerj-cs.38/fig-1</doi>
              <resource>https://peerj.com/articles/cs-38/fig-1</resource>
            </doi_data>
          </component>
          <component parent_relation="isPartOf">
            <titles>
              <title>Figure 2: Dependence of entropy on traffic volume.</title>
            </titles>
            <format mime_type="image/png"/>
            <doi_data>
              <doi>10.7717/peerj-cs.38/fig-2</doi>
              <resource>https://peerj.com/articles/cs-38/fig-2</resource>
            </doi_data>
          </component>
          <component parent_relation="isPartOf">
            <titles>
              <title>Figure 3: Diversity of sources accessed by different online activities.</title>
            </titles>
            <format mime_type="image/png"/>
            <doi_data>
              <doi>10.7717/peerj-cs.38/fig-3</doi>
              <resource>https://peerj.com/articles/cs-38/fig-3</resource>
            </doi_data>
          </component>
          <component parent_relation="isPartOf">
            <titles>
              <title>Figure 4: Top websites that are targets of 40% of clicks for search (A) and social media (B).</title>
            </titles>
            <format mime_type="image/png"/>
            <doi_data>
              <doi>10.7717/peerj-cs.38/fig-4</doi>
              <resource>https://peerj.com/articles/cs-38/fig-4</resource>
            </doi_data>
          </component>
          <component parent_relation="isPartOf">
            <titles>
              <title>Figure 5: Correlation between collective and average individual entropy.</title>
            </titles>
            <format mime_type="image/png"/>
            <doi_data>
              <doi>10.7717/peerj-cs.38/fig-5</doi>
              <resource>https://peerj.com/articles/cs-38/fig-5</resource>
            </doi_data>
          </component>
          <component parent_relation="isPartOf">
            <titles>
              <title>Table 1: Most frequent sources for each category of traffic and their corresponding numbers of clicks.</title>
            </titles>
            <format mime_type="text/html"/>
            <doi_data>
              <doi>10.7717/peerj-cs.38/table-1</doi>
              <resource>https://peerj.com/articles/cs-38/table-1</resource>
            </doi_data>
          </component>
          <component parent_relation="isPartOf">
            <titles>
              <title>Table 2: Seed DMOZ categories for the crawler used to extract a list of close to 3,500 news sites.</title>
            </titles>
            <format mime_type="text/html"/>
            <doi_data>
              <doi>10.7717/peerj-cs.38/table-2</doi>
              <resource>https://peerj.com/articles/cs-38/table-2</resource>
            </doi_data>
          </component>
          <component parent_relation="isPartOf">
            <titles>
              <title>Table 3: Blogging platforms, Wiki platforms and news aggregators filtered out of the list of news sites.</title>
            </titles>
            <format mime_type="text/html"/>
            <doi_data>
              <doi>10.7717/peerj-cs.38/table-3</doi>
              <resource>https://peerj.com/articles/cs-38/table-3</resource>
            </doi_data>
          </component>
        </component_list>
      </journal_article>
    </journal>
  </body>
</doi_batch>
