<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" version="2.0">

<channel>
	<title>R-bloggers</title>
	
	<link>https://www.r-bloggers.com</link>
	<description>R news and tutorials contributed by (750) R bloggers</description>
	<lastBuildDate>Tue, 18 Jul 2017 05:32:59 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>https://wordpress.org/?v=4.6.1</generator>

<image>
	<url>https://i2.wp.com/www.r-bloggers.com/wp-content/uploads/2016/08/cropped-R_single_01-200.png?fit=32%2C32&amp;ssl=1</url>
	<title>R-bloggers</title>
	<link>https://www.r-bloggers.com</link>
	<width>32</width>
	<height>32</height>
</image> 
<site xmlns="com-wordpress:feed-additions:1">11524731</site>	<atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/rss+xml" href="http://feeds.feedburner.com/RBloggers" /><feedburner:info uri="rbloggers" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" /><feedburner:emailServiceId>RBloggers</feedburner:emailServiceId><feedburner:feedburnerHostname>https://feedburner.google.com</feedburner:feedburnerHostname><item>
		<title>Automatically Fitting the Support Vector Machine Cost Parameter</title>
		<link>http://feedproxy.google.com/~r/RBloggers/~3/w4YrCjqcZ6M/</link>
		<pubDate>Tue, 18 Jul 2017 05:32:59 +0000</pubDate>
		<dc:creator><![CDATA[Jake Hoare]]></dc:creator>
				<category><![CDATA[R bloggers]]></category>

		<guid isPermaLink="false">https://www.displayr.com/?p=1536</guid>
		<description><![CDATA[
In an earlier post I discussed how to avoid overfitting when using Support Vector Machines. This was achieved using cross validation. In cross validation, prediction accuracy is maximized by varying the cost parameter. Importantly, prediction accuracy is...]]></description>
				<content:encoded><![CDATA[<p class="syndicated-attribution"><div class="social4i" style="height:29px;"><div class="social4in" style="height:29px;float: left;"><div class="socialicons s4fblike" style="float:left;margin-right: 10px;"><div class="fb-like" data-href="https://www.r-bloggers.com/automatically-fitting-the-support-vector-machine-cost-parameter/" data-send="true"  data-layout="button_count" data-width="100" data-height="21"  data-show-faces="false"></div></div><div class="socialicons s4linkedin" style="float:left;margin-right: 10px;"><script type="in/share" data-url="https://www.r-bloggers.com/automatically-fitting-the-support-vector-machine-cost-parameter/" data-counter="right"></script></div></div><div style="clear:both"></div></div>

<div style="border: 1px solid; background: none repeat scroll 0 0 #EDEDED; margin: 1px; font-size: 12px;">
(This article was first published on  <strong><a href="https://www.displayr.com/automatically-fitting-the-support-vector-machine-cost-parameter/"> R – Displayr</a></strong>, and kindly contributed to <a href="https://www.r-bloggers.com/" rel="nofollow">R-bloggers)</a>      
</div></p>
<p><img src="https://i0.wp.com/www.displayr.com/wp-content/uploads/2017/04/glass-300x185.png?resize=300%2C185&#038;ssl=1" class="webfeedsFeaturedVisual wp-post-image" alt="preview Show information about the snippet editorYou can click on each element in the preview to jump to the Snippet Editor. SEO title preview: Displayr | Automatically Fitting the Support Vector Machine Cost Parameter" style="float: left; margin-right: 20px;" srcset_temp="https://i0.wp.com/www.displayr.com/wp-content/uploads/2017/04/glass-300x185.png?resize=300%2C185&#038;ssl=1 300w, https://www.displayr.com/wp-content/uploads/2017/04/glass-768x473.png 768w, https://www.displayr.com/wp-content/uploads/2017/04/glass-150x92.png 150w, https://www.displayr.com/wp-content/uploads/2017/04/glass-360x222.png 360w, https://www.displayr.com/wp-content/uploads/2017/04/glass.png 780w" sizes="(max-width: 300px) 100vw, 300px" data-recalc-dims="1" />
<p><img class="size-full wp-image-1539" src="https://i2.wp.com/www.displayr.com/wp-content/uploads/2017/04/glass.png?w=450" alt="Glass insulators" srcset_temp="https://www.displayr.com/wp-content/uploads/2017/04/glass.png 780w, https://www.displayr.com/wp-content/uploads/2017/04/glass-300x185.png 300w, https://www.displayr.com/wp-content/uploads/2017/04/glass-768x473.png 768w, https://www.displayr.com/wp-content/uploads/2017/04/glass-150x92.png 150w, https://www.displayr.com/wp-content/uploads/2017/04/glass-360x222.png 360w" sizes="(max-width: 780px) 100vw, 780px" data-recalc-dims="1" /></p>
<p>In <a href="https://www.displayr.com/using-support-vector-machines-in-displayr/" rel="nofollow" target="_blank">an earlier post</a> I discussed how to avoid overfitting when using Support Vector Machines. This was achieved using cross validation. In cross validation, prediction accuracy is maximized by varying the cost parameter. Importantly, prediction accuracy is calculated on a different subset of the data from that used for training.</p>
<p>In this blog post I take that concept a step further, by automating the manual search for the optimal cost.</p>
<p>The data set I’ll be using describes different types of glass based upon physical attributes and chemical composition.  You can read more about the data <a href="https://archive.ics.uci.edu/ml/datasets/Glass+Identification" rel="nofollow" target="_blank">here</a>, but for the purposes of my analysis all you need to know is that the outcome variable is categorical (7 types of glass) and the 4 predictor variables are numeric.</p>
<hr />
<h1><span style="color: #2b2a2f;">Creating the base support vector machine model</span></h1>
<p>I start, as in my <a href="https://www.displayr.com/using-support-vector-machines-in-displayr/" rel="nofollow" target="_blank">earlier analysis</a>, by splitting the data into a larger 70% training sample and a smaller 30% testing sample. Then I train a support vector machine on the training sample with the following code:</p>
<pre class="brush: r; title: ; notranslate">
library(flipMultivariates)
svm = SupportVectorMachine(Type ~ RefractiveIndex + Ca + Ba + Fe,
                           subset = training,
                           cost = 1)
</pre>
<p>This produces output as shown below. There are 2 reasons why we can largely disregard the 64.67% accuracy:</p>
<ol>
<li>We used the training data (and not the independent testing data) to calculate accuracy.</li>
<li>We have used a default value for the cost of 1 and not attempted to optimize.</li>
</ol>
<p><iframe class="displayr-embed" src="https://embedded.azureedge.net/75668/207841/72c1985d-091e-4170-bcd7-f9addd264962.html?v=28297-90b5b92269934767632" width="446" height="335" frameborder="0" scrolling="no"></iframe></p>
<hr />
<h1><span style="color: #2b2a2f;">Amending the R code</span></h1>
<p>I am going to amend the code above in order to loop over a range of values of cost. For each value, I will calculate the accuracy on the test sample. The updated code is as follows:</p>
<pre class="brush: r; title: ; notranslate">
library(flipMultivariates)
library(flipRegression)
costs = c(0.1, 1, 10, 100, 1000, 10000)
i = 1
accuracies = rep(0, length(costs))

for (cost in costs)
{
    svm = SupportVectorMachine(Type ~ RefractiveIndex + Ca + Ba + Fe,
                               subset = training,
                               cost = cost)
    accuracies[i] = attr(ConfusionMatrix(svm, subset = (testing == 1)), "accuracy")
    i = i + 1
}
plot(costs, accuracies, type = "l", log = "x")
</pre>
<p>The first 5 lines set things up. I load libraries required to run the Support Vector Machine and calculate the accuracy. Next I choose a range of costs, initialize a loop counter <span style="font-family: 'Courier New';">i</span> and an empty vector <span style="font-family: 'Courier New';">accuracies, </span>where I store the results.</p>
<p>Then I add a loop around the code that created the base model to iterate over <span style="font-family: 'Courier New';">costs</span><span style="font-family: 'Courier New';">. </span>The next line calculates and stores the accuracy on the testing sample. Finally I plot the results which tells me that the greatest accuracy appears around 100. This allows us to go back and update <span style="font-family: 'Courier New';">costs</span> to a more granular range around this value.</p>
<p>Re-running the code again using the new costs (10, 20, 50, 75, 100, 150, 200, 300, 500, 1000) I get the final chart shown below. This indicates that a cost of 50 gives best performance.</p>
<p><iframe class="displayr-embed" src="https://embedded.azureedge.net/75668/207841/797faa03-278b-4af6-98f5-383913acbaab.html?v=28297-90b5b92269-1357995370" width="450" frameborder="0" scrolling="no"></iframe></p>
<hr />
<p><span style="color: #3e7dcc; font-size: 20px;">TRY IT OUT</span><br />
The analysis in this post used R in Displayr. The <span style="font-family: 'Courier New';">flipMultivariates</span> package (available on <a href="https://github.com/Displayr/flipMultivariates" rel="nofollow" target="_blank">GitHub</a>), which uses the <span style="font-family: 'Courier New';">e1071 <span style="font-family: circular-book;">package, performed the calculations. You can try automatically fitting the <a href="https://app.displayr.com/Try/Automatically%20Fitting%20SVM%20Cost%20Parameter" rel="nofollow" target="_blank">Support Vector Machine Cost Parameter</a> yourself using the data in this example. </span></span></p>
<hr />

		<script type='text/javascript'>
		  var vglnk = { key: '949efb41171ac6ec1bf7f206d57e90b8' };

		  (function(d, t) {
			var s = d.createElement(t); s.type = 'text/javascript'; s.async = true;
			s.src = '//cdn.viglink.com/api/vglnk.js';
			var r = d.getElementsByTagName(t)[0]; r.parentNode.insertBefore(s, r);
		  }(document, 'script'));
		</script>		
		
<p class="syndicated-attribution">
<div style="border: 1px solid; background: none repeat scroll 0 0 #EDEDED; margin: 1px; font-size: 13px;">
<div style="text-align: center;">To <strong>leave a comment</strong> for the author, please follow the link and comment on their blog: <strong><a href="https://www.displayr.com/automatically-fitting-the-support-vector-machine-cost-parameter/"> R – Displayr</a></strong>.</div>
<hr />
<a href="https://www.r-bloggers.com/" rel="nofollow">R-bloggers.com</a> offers <strong><a href="https://feedburner.google.com/fb/a/mailverify?uri=RBloggers" rel="nofollow">daily e-mail updates</a></strong> about <a title="The R Project for Statistical Computing" href="https://www.r-project.org/" rel="nofollow">R</a> news and <a title="R tutorials" href="https://www.r-bloggers.com/search/tutorial" rel="nofollow">tutorials</a> on topics such as: <a title="Data science" href="https://www.r-bloggers.com/search/data%20science" rel="nofollow">Data science</a>, <a title="Big Data" href="https://www.r-bloggers.com/search/Big%20Data" rel="nofollow">Big Data, <a title="R jobs" href="https://www.r-users.com/" rel="nofollow">R jobs</a>, visualization (<a title="ggplot and ggplot2 tutorials" href="https://www.r-bloggers.com/search/ggplot2" rel="nofollow">ggplot2</a>, <a title="Boxplots using lattice and ggplot2 tutorials" href="https://www.r-bloggers.com/search/boxplot" rel="nofollow">Boxplots</a>, <a title="Maps and gis" href="https://www.r-bloggers.com/search/map" rel="nofollow">maps</a>, <a title="Animation in R" href="https://www.r-bloggers.com/search/animation" rel="nofollow">animation</a>), programming (<a title="RStudio IDE for R" href="https://www.r-bloggers.com/search/RStudio" rel="nofollow">RStudio</a>, <a title="Sweave and literate programming" href="https://www.r-bloggers.com/search/sweave" rel="nofollow">Sweave</a>, <a title="LaTeX in R" href="https://www.r-bloggers.com/search/LaTeX" rel="nofollow">LaTeX</a>, <a title="SQL and databases" href="https://www.r-bloggers.com/search/SQL" rel="nofollow">SQL</a>, <a title="Eclipse IDE for R" href="https://www.r-bloggers.com/search/eclipse" rel="nofollow">Eclipse</a>, <a title="git and github, Version Control System" href="https://www.r-bloggers.com/search/git" rel="nofollow">git</a>, <a title="Large data in R using Hadoop" href="https://www.r-bloggers.com/search/hadoop" rel="nofollow">hadoop</a>, <a title="Web Scraping of google, facebook, yahoo, twitter and more using R" href="https://www.r-bloggers.com/search/Web+Scraping" rel="nofollow">Web Scraping</a>) statistics (<a title="Regressions and ANOVA analysis tutorials" href="https://www.r-bloggers.com/search/regression" rel="nofollow">regression</a>, <a title="principal component analysis tutorial" href="https://www.r-bloggers.com/search/PCA" rel="nofollow">PCA</a>, <a title="Time series" href="https://www.r-bloggers.com/search/time+series" rel="nofollow">time series</a>, <a title="finance trading" href="https://www.r-bloggers.com/search/trading" rel="nofollow">trading</a>) and more...
</div></p><img src="http://feeds.feedburner.com/~r/RBloggers/~4/w4YrCjqcZ6M" height="1" width="1" alt=""/>]]></content:encoded>
	<enclosure url="" length="" type="" />
	<post-id xmlns="com-wordpress:feed-additions:1">153066</post-id>	<feedburner:origLink>https://www.r-bloggers.com/automatically-fitting-the-support-vector-machine-cost-parameter/</feedburner:origLink></item>
		<item>
		<title>Investigating Cryptocurrencies (Part II)</title>
		<link>http://feedproxy.google.com/~r/RBloggers/~3/2NQB5iWs554/</link>
		<pubDate>Tue, 18 Jul 2017 00:36:00 +0000</pubDate>
		<dc:creator><![CDATA[C]]></dc:creator>
				<category><![CDATA[R bloggers]]></category>

		<guid isPermaLink="false">https://www.r-bloggers.com/?guid=950cd5be49808743e3ad06e8750c7f02</guid>
		<description><![CDATA[
This is the second in a series of posts designed to show how the R programming language can be used with cryptocurrency related data sets.  A number of R packages are great for analyzing stocks and bonds and similar financial instruments.  These can also be applied to working with cryptocurrencies.  In this post we will focus on Bitcoin.Bitcoin has garnered enough attention that it is available through Yahoo's finance data under the symbol BTCUSD=X.  The quantmod package is comprised of a set of packages and utilities geared towards time series analysis traditionally associated with stocks.  You can load Bitcoin along with other Stock symbols using the loadSymbols function.  In this example we will also load AMD, which makes graphics cards used by cryptocurrency miners.library(quantmod)loadSymbols(c('BTCUSD=X','AMD'))If you have any issue downloading the data, make sure you update to the latest version of quantmod.  If all goes well, you will have two objects in your global environment named AMD and BTCUSD=X.ls()[1] "AMD"      "BTCUSD=X"You can plot AMD by simply passing it to the plot function.plot(AMD)
Bitcoin is slightly different simply because the symbol in use includes an equal sign.  To ensure that R evaluates the code properly, the symbol must be surrounded in back ticks.plot(`BTCUSD=X`)There is data missing for certain days.  There are other sources for cryptocurrency data which can be substituted if needed.  We will ignore this anomaly for the remainder of this post.  There is data for the last 4 weeks. We can construct a candle chart that focuses on this subset of data.chartSeries(`BTCUSD=X`, subset='last 4 weeks')
This chart can then be modified to include technical analysis - for instance Bollinger Bands.addBBands()
The capabilities of the quantmod package in an earlier post (see http://www.r-chart.com/2010/06/stock-analysis-using-r.html) where a listing of other functions that can be applied is included.Inasmuch as cryptocurrencies behave like traditional equities, they lend themselves to similar types of analysis.  The quantmod package is a great place to start when analyzing cryptocurrencies.   
]]></description>
				<content:encoded><![CDATA[<p class="syndicated-attribution"><div class="social4i" style="height:29px;"><div class="social4in" style="height:29px;float: left;"><div class="socialicons s4fblike" style="float:left;margin-right: 10px;"><div class="fb-like" data-href="https://www.r-bloggers.com/investigating-cryptocurrencies-part-ii/" data-send="true"  data-layout="button_count" data-width="100" data-height="21"  data-show-faces="false"></div></div><div class="socialicons s4linkedin" style="float:left;margin-right: 10px;"><script type="in/share" data-url="https://www.r-bloggers.com/investigating-cryptocurrencies-part-ii/" data-counter="right"></script></div></div><div style="clear:both"></div></div>

<div style="border: 1px solid; background: none repeat scroll 0 0 #EDEDED; margin: 1px; font-size: 12px;">
(This article was first published on  <strong><a href="http://www.r-chart.com/2017/07/investigating-cryptocurrencies-part-ii.html"> R-Chart</a></strong>, and kindly contributed to <a href="https://www.r-bloggers.com/" rel="nofollow">R-bloggers)</a>      
</div></p>
<div class="separator" style="clear: both; text-align: center;"><a href="https://i0.wp.com/2.bp.blogspot.com/-0XS1P2dD0-0/WWqX6R2OK_I/AAAAAAAAA4Q/Bmjjx9DztA8ktZZbryCB_zF8L3ekD5tJgCPcBGAYYCw/s1600/crypto_currencies.png?ssl=1" style="margin-left: 1em; margin-right: 1em;" rel="nofollow" target="_blank"><img border="0" data-original-height="88" data-original-width="258" src="https://i0.wp.com/2.bp.blogspot.com/-0XS1P2dD0-0/WWqX6R2OK_I/AAAAAAAAA4Q/Bmjjx9DztA8ktZZbryCB_zF8L3ekD5tJgCPcBGAYYCw/s1600/crypto_currencies.png?resize=258%2C88&#038;ssl=1" data-recalc-dims="1" /></a></div>
<p>This is the second in a series of posts designed to show how the R programming language can be used with cryptocurrency related data sets.  A number of R packages are great for analyzing stocks and bonds and similar financial instruments.  These can also be applied to working with cryptocurrencies.  In this post we will focus on Bitcoin.</p>
<p>Bitcoin has garnered enough attention that it is available through <a href="https://finance.yahoo.com/chart/BTCUSD%3DX" rel="nofollow" target="_blank">Yahoo’s finance data</a> under the symbol <b>BTCUSD=X</b>.  The quantmod package is comprised of a set of packages and utilities geared towards time series analysis traditionally associated with stocks.  You can load Bitcoin along with other Stock symbols using the loadSymbols function.  In this example we will also load AMD, which makes graphics cards used by cryptocurrency miners.</p>
<p><span style="font-family: Courier New, Courier, monospace;"><b>library(quantmod)</b></span><br /><span style="font-family: Courier New, Courier, monospace;"><b>loadSymbols(c(‘BTCUSD=X’,’AMD’))</b></span><br /><span style="font-family: Courier New, Courier, monospace;"><b><br /></b></span><span style="font-family: inherit;">If you have any issue downloading the data, make sure you update to the latest version of quantmod.  If all goes well, you will have two objects in your global environment named <b>AMD</b> and <b>BTCUSD=X</b>.</span><br /><span style="font-family: Courier New, Courier, monospace;"><b><br /></b></span><span style="font-family: Courier New, Courier, monospace;"><b>ls()</b></span><br /><span style="color: #666666; font-family: Courier New, Courier, monospace;"><b>[1] “AMD”      “BTCUSD=X”</b></span><br /><span style="font-family: Courier New, Courier, monospace;"><b><br /></b></span><span style="font-family: inherit;">You can plot AMD by simply passing it to the plot function.</span><br /><span style="font-family: inherit;"><br /></span><span style="font-family: Courier New, Courier, monospace;"><b>plot(AMD)</b></span><br /><span style="font-family: inherit;"><b><br /></b></span>
<div class="separator" style="clear: both; text-align: center;"><a href="https://i0.wp.com/2.bp.blogspot.com/-CI1SRjeUy9M/WW1OMBghU2I/AAAAAAAAA4s/Z-Rijsn7ckAYxB6b_X5IVQcwgQrVtEWDwCLcBGAs/s1600/AMD.png?ssl=1" style="margin-left: 1em; margin-right: 1em;" rel="nofollow" target="_blank"><img border="0" data-original-height="341" data-original-width="411" src="https://i0.wp.com/2.bp.blogspot.com/-CI1SRjeUy9M/WW1OMBghU2I/AAAAAAAAA4s/Z-Rijsn7ckAYxB6b_X5IVQcwgQrVtEWDwCLcBGAs/s400/AMD.png?resize=411%2C341&#038;ssl=1" data-recalc-dims="1" /></a></div>
<p><span style="font-family: inherit;">Bitcoin is slightly different simply because the symbol in use includes an equal sign.  To ensure that R evaluates the code properly, the symbol must be surrounded in back ticks.</span><br /><span style="font-family: inherit;"><br /></span><span style="font-family: Courier New, Courier, monospace;"><b>plot(`BTCUSD=X`)</b></span></p>
<div class="separator" style="clear: both; text-align: center;"><a href="https://i0.wp.com/1.bp.blogspot.com/-SQqQGE6As5I/WW1PCodBezI/AAAAAAAAA4w/K7_7oLi4TVoS-DlycJMAfEDI8hcBJyifQCLcBGAs/s1600/BTC.png?ssl=1" style="margin-left: 1em; margin-right: 1em;" rel="nofollow" target="_blank"><img border="0" data-original-height="341" data-original-width="411" src="https://i0.wp.com/1.bp.blogspot.com/-SQqQGE6As5I/WW1PCodBezI/AAAAAAAAA4w/K7_7oLi4TVoS-DlycJMAfEDI8hcBJyifQCLcBGAs/s400/BTC.png?resize=411%2C341&#038;ssl=1" data-recalc-dims="1" /></a></div>
<p>There is data missing for certain days.  There are other sources for cryptocurrency data which can be substituted if needed.  We will ignore this anomaly for the remainder of this post.  There is data for the last 4 weeks. We can construct a candle chart that focuses on this subset of data.</p>
<p><span style="font-family: Courier New, Courier, monospace;"><b>chartSeries(`BTCUSD=X`, subset=’last 4 weeks’)</b></span></p>
<div class="separator" style="clear: both; text-align: center;"><a href="https://i1.wp.com/2.bp.blogspot.com/-WTkxR3ws_XM/WW1WGWXQlpI/AAAAAAAAA5U/0WyOyifW9TQgrakaQeMBAfRtunVPzIdHwCLcBGAs/s1600/BTCUSD_Candle.png?ssl=1" style="margin-left: 1em; margin-right: 1em;" rel="nofollow" target="_blank"><img border="0" data-original-height="400" data-original-width="450" src="https://i1.wp.com/2.bp.blogspot.com/-WTkxR3ws_XM/WW1WGWXQlpI/AAAAAAAAA5U/0WyOyifW9TQgrakaQeMBAfRtunVPzIdHwCLcBGAs/s320/BTCUSD_Candle.png?resize=450%2C400&#038;ssl=1" data-recalc-dims="1" /></a></div>
<p>This chart can then be modified to include technical analysis – for instance Bollinger Bands.</p>
<p><span style="font-family: Courier New, Courier, monospace;"><b>addBBands()</b></span><br /><span style="font-family: Courier New, Courier, monospace;"><b><br /></b></span>
<div class="separator" style="clear: both; text-align: center;"><a href="https://i1.wp.com/3.bp.blogspot.com/-brJuv_EUEAc/WW1Wss5MYhI/AAAAAAAAA5Y/JGF-yNUVjVgRINa0DOMn5OewnfB29AVdwCLcBGAs/s1600/BTCUSD_CandleBB.png?ssl=1" style="margin-left: 1em; margin-right: 1em;" rel="nofollow" target="_blank"><img border="0" data-original-height="400" data-original-width="450" src="https://i1.wp.com/3.bp.blogspot.com/-brJuv_EUEAc/WW1Wss5MYhI/AAAAAAAAA5Y/JGF-yNUVjVgRINa0DOMn5OewnfB29AVdwCLcBGAs/s320/BTCUSD_CandleBB.png?resize=450%2C400&#038;ssl=1" data-recalc-dims="1" /></a></div>
<p><span style="font-family: Courier New, Courier, monospace;"><b><br /></b></span><span style="font-family: inherit;">The </span>capabilities<span style="font-family: inherit;"> of the quantmod package in an earlier post (see </span><a href="http://www.r-chart.com/2010/06/stock-analysis-using-r.html" rel="nofollow" target="_blank">http://www.r-chart.com/2010/06/stock-analysis-using-r.html</a><span style="font-family: inherit;">) where a listing of other functions that can be applied is included.</span><br /><span style="font-family: inherit;"><br /></span><span style="font-family: inherit;">Inasmuch as cryptocurrencies behave like traditional equities, they lend themselves to similar types of analysis.  The quantmod package is a great place to start when analyzing cryptocurrencies.  </span><b style="font-family: inherit;"> </b><br /><span style="font-family: Courier New, Courier, monospace;"><b><br /></b></span><span style="font-family: Courier New, Courier, monospace;"><b><br /></b></span></p>

		<script type='text/javascript'>
		  var vglnk = { key: '949efb41171ac6ec1bf7f206d57e90b8' };

		  (function(d, t) {
			var s = d.createElement(t); s.type = 'text/javascript'; s.async = true;
			s.src = '//cdn.viglink.com/api/vglnk.js';
			var r = d.getElementsByTagName(t)[0]; r.parentNode.insertBefore(s, r);
		  }(document, 'script'));
		</script>		
		
<p class="syndicated-attribution">
<div style="border: 1px solid; background: none repeat scroll 0 0 #EDEDED; margin: 1px; font-size: 13px;">
<div style="text-align: center;">To <strong>leave a comment</strong> for the author, please follow the link and comment on their blog: <strong><a href="http://www.r-chart.com/2017/07/investigating-cryptocurrencies-part-ii.html"> R-Chart</a></strong>.</div>
<hr />
<a href="https://www.r-bloggers.com/" rel="nofollow">R-bloggers.com</a> offers <strong><a href="https://feedburner.google.com/fb/a/mailverify?uri=RBloggers" rel="nofollow">daily e-mail updates</a></strong> about <a title="The R Project for Statistical Computing" href="https://www.r-project.org/" rel="nofollow">R</a> news and <a title="R tutorials" href="https://www.r-bloggers.com/search/tutorial" rel="nofollow">tutorials</a> on topics such as: <a title="Data science" href="https://www.r-bloggers.com/search/data%20science" rel="nofollow">Data science</a>, <a title="Big Data" href="https://www.r-bloggers.com/search/Big%20Data" rel="nofollow">Big Data, <a title="R jobs" href="https://www.r-users.com/" rel="nofollow">R jobs</a>, visualization (<a title="ggplot and ggplot2 tutorials" href="https://www.r-bloggers.com/search/ggplot2" rel="nofollow">ggplot2</a>, <a title="Boxplots using lattice and ggplot2 tutorials" href="https://www.r-bloggers.com/search/boxplot" rel="nofollow">Boxplots</a>, <a title="Maps and gis" href="https://www.r-bloggers.com/search/map" rel="nofollow">maps</a>, <a title="Animation in R" href="https://www.r-bloggers.com/search/animation" rel="nofollow">animation</a>), programming (<a title="RStudio IDE for R" href="https://www.r-bloggers.com/search/RStudio" rel="nofollow">RStudio</a>, <a title="Sweave and literate programming" href="https://www.r-bloggers.com/search/sweave" rel="nofollow">Sweave</a>, <a title="LaTeX in R" href="https://www.r-bloggers.com/search/LaTeX" rel="nofollow">LaTeX</a>, <a title="SQL and databases" href="https://www.r-bloggers.com/search/SQL" rel="nofollow">SQL</a>, <a title="Eclipse IDE for R" href="https://www.r-bloggers.com/search/eclipse" rel="nofollow">Eclipse</a>, <a title="git and github, Version Control System" href="https://www.r-bloggers.com/search/git" rel="nofollow">git</a>, <a title="Large data in R using Hadoop" href="https://www.r-bloggers.com/search/hadoop" rel="nofollow">hadoop</a>, <a title="Web Scraping of google, facebook, yahoo, twitter and more using R" href="https://www.r-bloggers.com/search/Web+Scraping" rel="nofollow">Web Scraping</a>) statistics (<a title="Regressions and ANOVA analysis tutorials" href="https://www.r-bloggers.com/search/regression" rel="nofollow">regression</a>, <a title="principal component analysis tutorial" href="https://www.r-bloggers.com/search/PCA" rel="nofollow">PCA</a>, <a title="Time series" href="https://www.r-bloggers.com/search/time+series" rel="nofollow">time series</a>, <a title="finance trading" href="https://www.r-bloggers.com/search/trading" rel="nofollow">trading</a>) and more...
</div></p><img src="http://feeds.feedburner.com/~r/RBloggers/~4/2NQB5iWs554" height="1" width="1" alt=""/>]]></content:encoded>
	<enclosure url="" length="" type="" />
	<post-id xmlns="com-wordpress:feed-additions:1">153053</post-id>	<feedburner:origLink>https://www.r-bloggers.com/investigating-cryptocurrencies-part-ii/</feedburner:origLink></item>
		<item>
		<title>Ecosystems chapter added to “Empirical software engineering using R”</title>
		<link>http://feedproxy.google.com/~r/RBloggers/~3/-0fc9tc7M6w/</link>
		<pubDate>Mon, 17 Jul 2017 23:05:30 +0000</pubDate>
		<dc:creator><![CDATA[Derek Jones]]></dc:creator>
				<category><![CDATA[R bloggers]]></category>

		<guid isPermaLink="false">http://shape-of-code.coding-guidelines.com/?p=3181</guid>
		<description><![CDATA[The Ecosystems chapter of my Empirical software engineering book has been added to the draft pdf (download here). I don’t seem to be able to get away from rewriting everything, despite working on the software engineering material for many years. Fortunately the sparsity of the data keeps me in check, but I keep finding new […]]]></description>
				<content:encoded><![CDATA[<p class="syndicated-attribution"><div class="social4i" style="height:29px;"><div class="social4in" style="height:29px;float: left;"><div class="socialicons s4fblike" style="float:left;margin-right: 10px;"><div class="fb-like" data-href="https://www.r-bloggers.com/ecosystems-chapter-added-to-empirical-software-engineering-using-r/" data-send="true"  data-layout="button_count" data-width="100" data-height="21"  data-show-faces="false"></div></div><div class="socialicons s4linkedin" style="float:left;margin-right: 10px;"><script type="in/share" data-url="https://www.r-bloggers.com/ecosystems-chapter-added-to-empirical-software-engineering-using-r/" data-counter="right"></script></div></div><div style="clear:both"></div></div>

<div style="border: 1px solid; background: none repeat scroll 0 0 #EDEDED; margin: 1px; font-size: 12px;">
(This article was first published on  <strong><a href="http://shape-of-code.coding-guidelines.com/2017/07/17/ecosystems-chapter-added-to-empirical-software-engineering-using-r/"> The Shape of Code » R</a></strong>, and kindly contributed to <a href="https://www.r-bloggers.com/" rel="nofollow">R-bloggers)</a>      
</div></p>
<p>The Ecosystems chapter of my Empirical software engineering book has been added to the draft pdf (<a href="http://www.knosof.co.uk/ESEUR" rel="nofollow" target="_blank">download here</a>).</p>
<p>I don’t seem to be able to get away from rewriting everything, despite working on the software engineering material for many years.  Fortunately the  sparsity of the data keeps me in check, but I keep finding new and interesting data (not a lot, but enough to slow me down).</p>
<p>There is still a lot of work to be done on the ecosystems chapter, not least integrating all the data I have been promised.  The basic threads are there, they just need filling out (assuming the promised data sets arrive).</p>
<p>I did not get any time to integrate in the developer and economics data received since those draft chapters were released; there has been some minor reorganization.</p>
<p>As always, if you know of any interesting software engineering data, please tell me.</p>
<p>I’m looking to rerun the <a href="http://shape-of-code.coding-guidelines.com/2015/09/25/workshop-on-analyzing-software-engineering-data/" rel="nofollow" target="_blank">workshop on analyzing software engineering data</a>.  If anybody has a venue in central London, that holds 30 or so people+projector, and is willing to make it available at no charge for a series of free workshops over several Saturdays, please get in touch.</p>
<p>Projects chapter next.</p>

		<script type='text/javascript'>
		  var vglnk = { key: '949efb41171ac6ec1bf7f206d57e90b8' };

		  (function(d, t) {
			var s = d.createElement(t); s.type = 'text/javascript'; s.async = true;
			s.src = '//cdn.viglink.com/api/vglnk.js';
			var r = d.getElementsByTagName(t)[0]; r.parentNode.insertBefore(s, r);
		  }(document, 'script'));
		</script>		
		
<p class="syndicated-attribution">
<div style="border: 1px solid; background: none repeat scroll 0 0 #EDEDED; margin: 1px; font-size: 13px;">
<div style="text-align: center;">To <strong>leave a comment</strong> for the author, please follow the link and comment on their blog: <strong><a href="http://shape-of-code.coding-guidelines.com/2017/07/17/ecosystems-chapter-added-to-empirical-software-engineering-using-r/"> The Shape of Code » R</a></strong>.</div>
<hr />
<a href="https://www.r-bloggers.com/" rel="nofollow">R-bloggers.com</a> offers <strong><a href="https://feedburner.google.com/fb/a/mailverify?uri=RBloggers" rel="nofollow">daily e-mail updates</a></strong> about <a title="The R Project for Statistical Computing" href="https://www.r-project.org/" rel="nofollow">R</a> news and <a title="R tutorials" href="https://www.r-bloggers.com/search/tutorial" rel="nofollow">tutorials</a> on topics such as: <a title="Data science" href="https://www.r-bloggers.com/search/data%20science" rel="nofollow">Data science</a>, <a title="Big Data" href="https://www.r-bloggers.com/search/Big%20Data" rel="nofollow">Big Data, <a title="R jobs" href="https://www.r-users.com/" rel="nofollow">R jobs</a>, visualization (<a title="ggplot and ggplot2 tutorials" href="https://www.r-bloggers.com/search/ggplot2" rel="nofollow">ggplot2</a>, <a title="Boxplots using lattice and ggplot2 tutorials" href="https://www.r-bloggers.com/search/boxplot" rel="nofollow">Boxplots</a>, <a title="Maps and gis" href="https://www.r-bloggers.com/search/map" rel="nofollow">maps</a>, <a title="Animation in R" href="https://www.r-bloggers.com/search/animation" rel="nofollow">animation</a>), programming (<a title="RStudio IDE for R" href="https://www.r-bloggers.com/search/RStudio" rel="nofollow">RStudio</a>, <a title="Sweave and literate programming" href="https://www.r-bloggers.com/search/sweave" rel="nofollow">Sweave</a>, <a title="LaTeX in R" href="https://www.r-bloggers.com/search/LaTeX" rel="nofollow">LaTeX</a>, <a title="SQL and databases" href="https://www.r-bloggers.com/search/SQL" rel="nofollow">SQL</a>, <a title="Eclipse IDE for R" href="https://www.r-bloggers.com/search/eclipse" rel="nofollow">Eclipse</a>, <a title="git and github, Version Control System" href="https://www.r-bloggers.com/search/git" rel="nofollow">git</a>, <a title="Large data in R using Hadoop" href="https://www.r-bloggers.com/search/hadoop" rel="nofollow">hadoop</a>, <a title="Web Scraping of google, facebook, yahoo, twitter and more using R" href="https://www.r-bloggers.com/search/Web+Scraping" rel="nofollow">Web Scraping</a>) statistics (<a title="Regressions and ANOVA analysis tutorials" href="https://www.r-bloggers.com/search/regression" rel="nofollow">regression</a>, <a title="principal component analysis tutorial" href="https://www.r-bloggers.com/search/PCA" rel="nofollow">PCA</a>, <a title="Time series" href="https://www.r-bloggers.com/search/time+series" rel="nofollow">time series</a>, <a title="finance trading" href="https://www.r-bloggers.com/search/trading" rel="nofollow">trading</a>) and more...
</div></p><img src="http://feeds.feedburner.com/~r/RBloggers/~4/-0fc9tc7M6w" height="1" width="1" alt=""/>]]></content:encoded>
	<enclosure url="" length="" type="" />
	<post-id xmlns="com-wordpress:feed-additions:1">153062</post-id>	<feedburner:origLink>https://www.r-bloggers.com/ecosystems-chapter-added-to-empirical-software-engineering-using-r/</feedburner:origLink></item>
		<item>
		<title>Ten-HUT! The Apache Drill R interface package — sergeant — is now on CRAN</title>
		<link>http://feedproxy.google.com/~r/RBloggers/~3/OXx1lshh3_I/</link>
		<pubDate>Mon, 17 Jul 2017 22:54:57 +0000</pubDate>
		<dc:creator><![CDATA[hrbrmstr]]></dc:creator>
				<category><![CDATA[R bloggers]]></category>

		<guid isPermaLink="false">https://rud.is/b/?p=6111</guid>
		<description><![CDATA[I’m extremely pleased to announce that the sergeant package is now on CRAN or will be hitting your local CRAN mirror soon. sergeant provides JDBC, DBI and dplyr/dbplyr interfaces to Apache Drill. I’ve also wrapped a few goodies into the dplyr custom functions that work with Drill and if you have Drill UDFs that don’t... Continue reading →
]]></description>
				<content:encoded><![CDATA[<p class="syndicated-attribution"><div class="social4i" style="height:29px;"><div class="social4in" style="height:29px;float: left;"><div class="socialicons s4fblike" style="float:left;margin-right: 10px;"><div class="fb-like" data-href="https://www.r-bloggers.com/ten-hut-the-apache-drill-r-interface-package-sergeant-is-now-on-cran/" data-send="true"  data-layout="button_count" data-width="100" data-height="21"  data-show-faces="false"></div></div><div class="socialicons s4linkedin" style="float:left;margin-right: 10px;"><script type="in/share" data-url="https://www.r-bloggers.com/ten-hut-the-apache-drill-r-interface-package-sergeant-is-now-on-cran/" data-counter="right"></script></div></div><div style="clear:both"></div></div>

<div style="border: 1px solid; background: none repeat scroll 0 0 #EDEDED; margin: 1px; font-size: 12px;">
(This article was first published on  <strong><a href="https://rud.is/b/2017/07/17/ten-hut-the-apache-drill-r-interface-package-sergeant-is-now-on-cran/"> R – rud.is</a></strong>, and kindly contributed to <a href="https://www.r-bloggers.com/" rel="nofollow">R-bloggers)</a>      
</div></p>
<p>I’m extremely pleased to announce that the <a href="https://github.com/hrbrmstr/sergeant" rel="nofollow" target="_blank"><code>sergeant</code></a> package is now on CRAN or will be hitting your local CRAN mirror soon.</p>
<p><code>sergeant</code> provides JDBC, DBI and <code>dplyr</code>/<code>dbplyr</code> interfaces to <a href="https://drill.apache.org/" rel="nofollow" target="_blank">Apache Drill</a>. I’ve also wrapped a few goodies into the <code>dplyr</code> custom functions that work with Drill and if you have Drill UDFs that don’t work “out of the box” with <code>sergeant</code>‘s <code>dplyr</code> interface, file an issue and I’ll make a special one for it in the package.</p>
<p>I’ve written about drill <a href="https://rud.is/b/category/drill/" rel="nofollow" target="_blank">on the blog before</a> so check out those posts for some history and stay tuned for more examples. The <a href="https://github.com/hrbrmstr/sergeant/blob/master/README.md" rel="nofollow" target="_blank">README</a> should get you started using <code>sergeant</code> and/or Drill (if you aren’t running Drill now, take a look and you’ll likely get hooked).</p>
<p>I’d like to take a moment to call out special thanks to <a href="https://github.com/alistaire47" rel="nofollow" target="_blank">Edward Visel</a> for bootstrapping the <code>dbplyr</code> update to <code>sergeant</code> when the <code>dplyr</code>/<code>dbplyr</code> interfaces split. It saved me loads of time and really helped the progress of this package move faster towards a CRAN release.</p>

		<script type='text/javascript'>
		  var vglnk = { key: '949efb41171ac6ec1bf7f206d57e90b8' };

		  (function(d, t) {
			var s = d.createElement(t); s.type = 'text/javascript'; s.async = true;
			s.src = '//cdn.viglink.com/api/vglnk.js';
			var r = d.getElementsByTagName(t)[0]; r.parentNode.insertBefore(s, r);
		  }(document, 'script'));
		</script>		
		
<p class="syndicated-attribution">
<div style="border: 1px solid; background: none repeat scroll 0 0 #EDEDED; margin: 1px; font-size: 13px;">
<div style="text-align: center;">To <strong>leave a comment</strong> for the author, please follow the link and comment on their blog: <strong><a href="https://rud.is/b/2017/07/17/ten-hut-the-apache-drill-r-interface-package-sergeant-is-now-on-cran/"> R – rud.is</a></strong>.</div>
<hr />
<a href="https://www.r-bloggers.com/" rel="nofollow">R-bloggers.com</a> offers <strong><a href="https://feedburner.google.com/fb/a/mailverify?uri=RBloggers" rel="nofollow">daily e-mail updates</a></strong> about <a title="The R Project for Statistical Computing" href="https://www.r-project.org/" rel="nofollow">R</a> news and <a title="R tutorials" href="https://www.r-bloggers.com/search/tutorial" rel="nofollow">tutorials</a> on topics such as: <a title="Data science" href="https://www.r-bloggers.com/search/data%20science" rel="nofollow">Data science</a>, <a title="Big Data" href="https://www.r-bloggers.com/search/Big%20Data" rel="nofollow">Big Data, <a title="R jobs" href="https://www.r-users.com/" rel="nofollow">R jobs</a>, visualization (<a title="ggplot and ggplot2 tutorials" href="https://www.r-bloggers.com/search/ggplot2" rel="nofollow">ggplot2</a>, <a title="Boxplots using lattice and ggplot2 tutorials" href="https://www.r-bloggers.com/search/boxplot" rel="nofollow">Boxplots</a>, <a title="Maps and gis" href="https://www.r-bloggers.com/search/map" rel="nofollow">maps</a>, <a title="Animation in R" href="https://www.r-bloggers.com/search/animation" rel="nofollow">animation</a>), programming (<a title="RStudio IDE for R" href="https://www.r-bloggers.com/search/RStudio" rel="nofollow">RStudio</a>, <a title="Sweave and literate programming" href="https://www.r-bloggers.com/search/sweave" rel="nofollow">Sweave</a>, <a title="LaTeX in R" href="https://www.r-bloggers.com/search/LaTeX" rel="nofollow">LaTeX</a>, <a title="SQL and databases" href="https://www.r-bloggers.com/search/SQL" rel="nofollow">SQL</a>, <a title="Eclipse IDE for R" href="https://www.r-bloggers.com/search/eclipse" rel="nofollow">Eclipse</a>, <a title="git and github, Version Control System" href="https://www.r-bloggers.com/search/git" rel="nofollow">git</a>, <a title="Large data in R using Hadoop" href="https://www.r-bloggers.com/search/hadoop" rel="nofollow">hadoop</a>, <a title="Web Scraping of google, facebook, yahoo, twitter and more using R" href="https://www.r-bloggers.com/search/Web+Scraping" rel="nofollow">Web Scraping</a>) statistics (<a title="Regressions and ANOVA analysis tutorials" href="https://www.r-bloggers.com/search/regression" rel="nofollow">regression</a>, <a title="principal component analysis tutorial" href="https://www.r-bloggers.com/search/PCA" rel="nofollow">PCA</a>, <a title="Time series" href="https://www.r-bloggers.com/search/time+series" rel="nofollow">time series</a>, <a title="finance trading" href="https://www.r-bloggers.com/search/trading" rel="nofollow">trading</a>) and more...
</div></p><img src="http://feeds.feedburner.com/~r/RBloggers/~4/OXx1lshh3_I" height="1" width="1" alt=""/>]]></content:encoded>
	<enclosure url="" length="" type="" />
	<post-id xmlns="com-wordpress:feed-additions:1">153058</post-id>	<feedburner:origLink>https://www.r-bloggers.com/ten-hut-the-apache-drill-r-interface-package-sergeant-is-now-on-cran/</feedburner:origLink></item>
		<item>
		<title>Revisiting the useR!2017 conference: Recordings now available</title>
		<link>http://feedproxy.google.com/~r/RBloggers/~3/iRS2WtqDads/</link>
		<pubDate>Mon, 17 Jul 2017 20:12:50 +0000</pubDate>
		<dc:creator><![CDATA[David Smith]]></dc:creator>
				<category><![CDATA[R bloggers]]></category>

		<guid isPermaLink="false">https://www.r-bloggers.com/?guid=53ac8247fd8c073f9a3e5396d72395ee</guid>
		<description><![CDATA[The annual useR!2017 conference took place July 4-7 in Brussels, and in every dimension it was the best yet. It was the largest (with over 1,100 R users from around the world in attendance), and yet still very smoothly run with many amazing talks and lots of fun for everyone. If you weren't able to make it to Brussels, take a look at these recaps from Nick Strayer &#038; Lucy D'Agostino McGowan, Once Upon Data and DataCamp to get a sense of what it was like, or simply take a look at this recap video: From my personal point of...]]></description>
				<content:encoded><![CDATA[<p class="syndicated-attribution"><div class="social4i" style="height:29px;"><div class="social4in" style="height:29px;float: left;"><div class="socialicons s4fblike" style="float:left;margin-right: 10px;"><div class="fb-like" data-href="https://www.r-bloggers.com/revisiting-the-user2017-conference-recordings-now-available/" data-send="true"  data-layout="button_count" data-width="100" data-height="21"  data-show-faces="false"></div></div><div class="socialicons s4linkedin" style="float:left;margin-right: 10px;"><script type="in/share" data-url="https://www.r-bloggers.com/revisiting-the-user2017-conference-recordings-now-available/" data-counter="right"></script></div></div><div style="clear:both"></div></div>

<div style="border: 1px solid; background: none repeat scroll 0 0 #EDEDED; margin: 1px; font-size: 12px;">
(This article was first published on  <strong><a href="http://blog.revolutionanalytics.com/2017/07/revisiting-user2017.html"> Revolutions</a></strong>, and kindly contributed to <a href="https://www.r-bloggers.com/" rel="nofollow">R-bloggers)</a>      
</div></p>
<div>
<p>The annual useR!2017 conference took place July 4-7 in Brussels, and in every dimension it was the best yet. It was the largest (with over 1,100 R users from around the world in attendance), and yet still very smoothly run with many amazing talks and lots of fun for everyone. If you weren&#039;t able to make it to Brussels, take a look at these recaps from <a href="http://livefreeordichotomize.com/2017/07/12/user-rundown/" rel="nofollow" target="_blank">Nick Strayer & Lucy D&#039;Agostino McGowan</a>, <a href="http://www.onceupondata.com/2017/07/12/user-2017/" rel="nofollow" target="_blank">Once Upon Data</a> and <a href="https://www.datacamp.com/community/blog/user-2017-in-retrospect#gs.1TQSZdw?tap_a=5644-dce66f&tap_s=10907-287229" rel="nofollow" target="_blank">DataCamp</a> to get a sense of what it was like, or simply take a look at this recap video:</p>
<p class="asset-video"><iframe allowfullscreen="" frameborder="0" height="281" src="https://www.youtube.com/embed/YWF6nbUTRao?rel=0" width="450"></iframe></p>
<p class="asset-video">From my personal point of view, if I were to try and capture user!2017 in just one word, it would be: <strong>vibrant</strong>. With so many first-time attendees, an atmosphere of excitement was everywhere, and the conference was noticeably much more <a href="https://channel9.msdn.com/Events/useR-international-R-User-conferences/useR-International-R-User-2017-Conference/Diversity-of-the-R-Community" rel="nofollow" target="_blank">diverse</a> than in prior years — a really positive development. Kudos to the organizers for their focus on making useR!2017 a <a href="https://twitter.com/olga_mie/status/883301504991547396" rel="nofollow" target="_blank">welcoming and inclusive</a> conference, and a special shout-out to the <a href="https://twitter.com/RLadiesGlobal/status/882561056752754689" rel="nofollow" target="_blank">R-Ladies community</a> for encouraging and inspiring so many. I especially enjoyed meeting the diversity scholars and being a part of the special <a href="https://user2017.brussels/schedule" rel="nofollow" target="_blank">beginner&#039;s session</a> held before the conference officially began (and so sadly unrecorded). Judging from the 200+ attendees reactions there, many welcomed getting a jump-start on the R project, its community, and how best to participate and contribute.</p>
<p>The diversity was reflected in the content, too, with a great mix of tutorials, keynotes and talks on R packages, R applications, the R community and ecosystem, and the R project itself. With thanks to Microsoft, all of this material was recorded, andis now available to view on Channel 9: </p>
<p style="text-align: center;"><strong>useR!2017 Recordings</strong>: <a href="https://channel9.msdn.com/Events/useR-international-R-User-conferences/useR-International-R-User-2017-Conference" rel="nofollow" target="_blank">useR! International R User 2017 Conference</a></p>
<p>All recordings are streamable and downloadable, and are shared under a <a href="https://creativecommons.org/licenses/by-nc-nd/3.0/" rel="nofollow" target="_blank">Creative Commons license</a>. (Note: a few talks are still in the editing room awaiting posting, but all the content should be available at the link above by July 21.) In many cases, you can also find slides in the sessions listed in the <a href="https://user2017.brussels/schedule" rel="nofollow" target="_blank">useR!2017 schedule</a>. </p>
<p>With around 300 videos it might be tricky to find the one you want, but you can use the Filters button to reveal a search tool, and you can also filter by specific speakers:</p>
<p><a class="asset-img-link" href="https://channel9.msdn.com/Events/useR-international-R-User-conferences/useR-International-R-User-2017-Conference" style="display: inline;" rel="nofollow" target="_blank"><img alt="Filters" border="0" class="asset  asset-image at-xid-6a010534b1db25970b01b8d296ea52970c image-full img-responsive" src="http://revolution-computing.typepad.com/.a/6a010534b1db25970b01b8d296ea52970c-800wi" title="Filters"></a></p>
<p>Here are a few searches you might find useful:</p>
<ul>
<li><a href="https://channel9.msdn.com/Events/useR-international-R-User-conferences/useR-International-R-User-2017-Conference?sort=status&direction=desc&d=0&term=" rel="nofollow" target="_blank">Tutorials</a> (Day 0)</li>
<li><a href="https://channel9.msdn.com/Events/useR-international-R-User-conferences/useR-International-R-User-2017-Conference?sort=status&direction=desc&term=keynote" rel="nofollow" target="_blank">Keynote presentations</a></li>
<li><a href="https://channel9.msdn.com/Events/useR-international-R-User-conferences/useR-International-R-User-2017-Conference?sort=status&direction=desc&term=lightning" rel="nofollow" target="_blank">Lightning talk sessions</a></li>
<li><a href="https://channel9.msdn.com/Events/useR-international-R-User-conferences/useR-International-R-User-2017-Conference?sort=status&direction=desc&term=sponsor" rel="nofollow" target="_blank">Sponsor presentations</a></li>
</ul>
<p>Next year&#039;s useR! conference, <a href="https://user2018.r-project.org/" rel="nofollow" target="_blank">useR!2018</a>, will be held July 10-13 in Brisbane, Australia. The organizers have opened a <a href="https://user2018.r-project.org/blog/2017/06/14/survey/" rel="nofollow" target="_blank">survey on useR!2018</a> to give the R community an opportunity to make suggestions on the content. If you have ideas for tutorial topics and presenters, keynote speakers, services like child care, or sign language interpreters, or how scholarships should be awarded, please do contribute your ideas.</p>
<p>Looking even further out, useR!2019 will be in Toulouse (France), and useR!2020 will be in Boston (USA). That&#039;s a lot to be looking forward to, and with useR!2017 setting such a high a high bar I&#039;m sure these will be outstanding conferences as well. See you there! </p>
<p> </p>
</div>

		<script type='text/javascript'>
		  var vglnk = { key: '949efb41171ac6ec1bf7f206d57e90b8' };

		  (function(d, t) {
			var s = d.createElement(t); s.type = 'text/javascript'; s.async = true;
			s.src = '//cdn.viglink.com/api/vglnk.js';
			var r = d.getElementsByTagName(t)[0]; r.parentNode.insertBefore(s, r);
		  }(document, 'script'));
		</script>		
		
<p class="syndicated-attribution">
<div style="border: 1px solid; background: none repeat scroll 0 0 #EDEDED; margin: 1px; font-size: 13px;">
<div style="text-align: center;">To <strong>leave a comment</strong> for the author, please follow the link and comment on their blog: <strong><a href="http://blog.revolutionanalytics.com/2017/07/revisiting-user2017.html"> Revolutions</a></strong>.</div>
<hr />
<a href="https://www.r-bloggers.com/" rel="nofollow">R-bloggers.com</a> offers <strong><a href="https://feedburner.google.com/fb/a/mailverify?uri=RBloggers" rel="nofollow">daily e-mail updates</a></strong> about <a title="The R Project for Statistical Computing" href="https://www.r-project.org/" rel="nofollow">R</a> news and <a title="R tutorials" href="https://www.r-bloggers.com/search/tutorial" rel="nofollow">tutorials</a> on topics such as: <a title="Data science" href="https://www.r-bloggers.com/search/data%20science" rel="nofollow">Data science</a>, <a title="Big Data" href="https://www.r-bloggers.com/search/Big%20Data" rel="nofollow">Big Data, <a title="R jobs" href="https://www.r-users.com/" rel="nofollow">R jobs</a>, visualization (<a title="ggplot and ggplot2 tutorials" href="https://www.r-bloggers.com/search/ggplot2" rel="nofollow">ggplot2</a>, <a title="Boxplots using lattice and ggplot2 tutorials" href="https://www.r-bloggers.com/search/boxplot" rel="nofollow">Boxplots</a>, <a title="Maps and gis" href="https://www.r-bloggers.com/search/map" rel="nofollow">maps</a>, <a title="Animation in R" href="https://www.r-bloggers.com/search/animation" rel="nofollow">animation</a>), programming (<a title="RStudio IDE for R" href="https://www.r-bloggers.com/search/RStudio" rel="nofollow">RStudio</a>, <a title="Sweave and literate programming" href="https://www.r-bloggers.com/search/sweave" rel="nofollow">Sweave</a>, <a title="LaTeX in R" href="https://www.r-bloggers.com/search/LaTeX" rel="nofollow">LaTeX</a>, <a title="SQL and databases" href="https://www.r-bloggers.com/search/SQL" rel="nofollow">SQL</a>, <a title="Eclipse IDE for R" href="https://www.r-bloggers.com/search/eclipse" rel="nofollow">Eclipse</a>, <a title="git and github, Version Control System" href="https://www.r-bloggers.com/search/git" rel="nofollow">git</a>, <a title="Large data in R using Hadoop" href="https://www.r-bloggers.com/search/hadoop" rel="nofollow">hadoop</a>, <a title="Web Scraping of google, facebook, yahoo, twitter and more using R" href="https://www.r-bloggers.com/search/Web+Scraping" rel="nofollow">Web Scraping</a>) statistics (<a title="Regressions and ANOVA analysis tutorials" href="https://www.r-bloggers.com/search/regression" rel="nofollow">regression</a>, <a title="principal component analysis tutorial" href="https://www.r-bloggers.com/search/PCA" rel="nofollow">PCA</a>, <a title="Time series" href="https://www.r-bloggers.com/search/time+series" rel="nofollow">time series</a>, <a title="finance trading" href="https://www.r-bloggers.com/search/trading" rel="nofollow">trading</a>) and more...
</div></p><img src="http://feeds.feedburner.com/~r/RBloggers/~4/iRS2WtqDads" height="1" width="1" alt=""/>]]></content:encoded>
	<enclosure url="" length="" type="" />
	<post-id xmlns="com-wordpress:feed-additions:1">153051</post-id>	<feedburner:origLink>https://www.r-bloggers.com/revisiting-the-user2017-conference-recordings-now-available/</feedburner:origLink></item>
		<item>
		<title>Twitter analysis using R (Semantic analysis of French elections)</title>
		<link>http://feedproxy.google.com/~r/RBloggers/~3/hgMmkMnhfBQ/</link>
		<pubDate>Mon, 17 Jul 2017 17:51:47 +0000</pubDate>
		<dc:creator><![CDATA[EnhanceDataScience]]></dc:creator>
				<category><![CDATA[R bloggers]]></category>

		<guid isPermaLink="false">http://enhancedatascience.com/?p=2114</guid>
		<description><![CDATA[
Last month the French elections viewed through Twitter: a semantic analysis post showed how the two contenders were perceived on Twitter during three key events of the campaign (Macron leaks, presidential debate and election day). The goal of the post is to show how to perform this twitter analysis using R. Collecting tweets in real time with […]
The post Twitter analysis using R (Semantic analysis of French elections) appeared first on Enhance Data Science.
]]></description>
				<content:encoded><![CDATA[<p class="syndicated-attribution"><div class="social4i" style="height:29px;"><div class="social4in" style="height:29px;float: left;"><div class="socialicons s4fblike" style="float:left;margin-right: 10px;"><div class="fb-like" data-href="https://www.r-bloggers.com/twitter-analysis-using-r-semantic-analysis-of-french-elections/" data-send="true"  data-layout="button_count" data-width="100" data-height="21"  data-show-faces="false"></div></div><div class="socialicons s4linkedin" style="float:left;margin-right: 10px;"><script type="in/share" data-url="https://www.r-bloggers.com/twitter-analysis-using-r-semantic-analysis-of-french-elections/" data-counter="right"></script></div></div><div style="clear:both"></div></div>

<div style="border: 1px solid; background: none repeat scroll 0 0 #EDEDED; margin: 1px; font-size: 12px;">
(This article was first published on  <strong><a href="http://enhancedatascience.com/2017/07/17/twitter-analysis-using-r/"> Enhance Data Science</a></strong>, and kindly contributed to <a href="https://www.r-bloggers.com/" rel="nofollow">R-bloggers)</a>      
</div></p>
<p>Last month the <a href="http://enhancedatascience.com/2017/05/17/french-election-viewed-through-twitter-a-semantic-analysis/" rel="nofollow" target="_blank">French elections viewed through Twitter: a semantic analysis </a>post showed how the two contenders were perceived on Twitter during three key events of the campaign (Macron leaks, presidential debate and election day). The goal of the post is to show how to perform this twitter analysis using R.</p>
<h2>Collecting tweets in real time with streamR (Twitter streaming API)</h2>
<p>To perform the analysis, I needed an important number of tweets and I wanted to use all of the tweets concerning the election. The Twitter search API is limited since you only have access to a sample of tweets. On the other hand, the streaming API allows you to collect the data in real-time and to collect almost all tweets. Hence, I used the streamR package.</p>
<p>So, I collected tweets on 60 seconds batch and saved them on .json files. The use of batches instead of one large file is to improve RAM consumption (Instead of reading and then subsetting one large file, you can do the subset on each of the batches and then merge them). Here is the code to collect the data with streamR.</p>
<pre class="brush: r; title: ; notranslate">
###Loading my twitter credentials
load("oauth.Rdata")
##Collecting data
require('streamR')
i=1
while(TRUE)
{
 i=i+1
 filterStream( file=paste0("tweet_macronleaks/tweets_rstats",i,".json"),
 track=c("#MacronLeaks"), timeout=60, oauth=my_oauth,language = 'fr')
}
</pre>
<p>The code is doing an infinite loop (stopped manually), the filterStream function filters the Twitter stream according to the defined filter. Here, we only take the tweets containing #MacronLeaks which are in French.</p>
<h2>Tweets cleaning and pre-processing</h2>
<p>Now that the tweets are collected, they need to be cleaned and pre-processed. A raw tweet will contain links, tabulation, @, #, double spaces,  … that will influence the analysis. It will also contain stop words (stop words are very frequent words in the language such as ‘and’, ‘or, ‘with’, …).<br />
In addition to this, some tweets are retweeted (sometimes a lot) and may change the words and text distribution. Enough of the RT are kept to show that some tweets are more popular than others but most of them are removed to avoid them standing too much out of the crowd.</p>
<p>First, the saved tweets need to be read and merged:</p>
<pre class="brush: r; title: ; notranslate">
require(data.table)
data.tweet=NULL
i=1
while(TRUE)
{
 i=i+1
 print(i)
 print(paste0("tweet_macronleaks/tweets_rstats",i,".json"))

if (is.null(data.tweet))
 data.tweet=data.table(parseTweets(paste0("tweet_macronleaks/tweets_rstats",i,".json")))
 else
 data.tweet=rbind(data.tweet,data.table(parseTweets(paste0("tweet_macronleaks/tweets_rstats",i,".json"))))
}
</pre>
<p>Then we only keep some of the RT. The retweet count is the indices of a given retweet, hence we only keep log(1+n) of the RT.</p>
<pre class="brush: r; title: ; notranslate">
data.tweet[,min_RT:=min(retweet_count),by=text]
data.tweet[,max_RT:=max(retweet_count),by=text]
data.tweet=data.tweet[lang=='fr',]
data.tweet=data.tweet[retweet_count&amp;lt;=min_RT+log(max_RT-min_RT+1),]
</pre>
<p>Then, the text can be cleaned using function from the tm package</p>
<pre class="brush: r; title: ; notranslate">
###Unaccent and clean the text
Unaccent &lt;- function(x) {
 x=tolower(x)
 x = gsub("@\\w+", "", x) 
 x = gsub("[[:punct:]]", " ", x)
 x = gsub("[ |\t]{2,}", " ", x) 
 x = gsub("^ ", " ", x) 
 x = gsub("http\\w+", " ", x) 
 x=tolower(x)
 x=gsub('_',' ',x,fixed=T)
 x
 
}
require(tm)
###Remove accents
data.tweet$text=Unaccent(iconv(data.tweet$text,from="UTF-8",to="ASCII//TRANSLIT"))
##Remove top words
data.tweet$text=removeWords(data.tweet$text,c('rt','a',stopwords('fr'),'e','co','pr'))
##Remove double whitespaces
data.tweet$text=stripWhitespace(data.tweet$text)
</pre>
<h2>Tokenization and creation of the vocabulary</h2>
<p>Now that the tweets have been cleaned, they can be <a href="https://en.wikipedia.org/wiki/Tokenization_(lexical_analysis)" rel="nofollow" target="_blank">tokenized</a>. During this step, each tweet will be split into tokens of its different words, here each word corresponds to a token.</p>
<pre class="brush: r; title: ; notranslate">
# Create iterator over tokens
tokens &lt;- space_tokenizer(data.tweet$text)
it = itoken(tokens, progressbar = FALSE)
</pre>
<p>Now a vocabulary can be created (it is a “summary” of the words distribution) based on the corpus. Then the vocabulary is pruned (very common and rare words are removed).</p>
<pre class="brush: r; title: ; notranslate">
vocab = create_vocabulary(it)

vocab = prune_vocabulary(vocab,
 term_count_min = 5, 
 doc_proportion_max = 0.4,
 doc_proportion_min = 0.0005)
vectorizer = vocab_vectorizer(vocab, 
 grow_dtm = FALSE, 
 skip_grams_window = 5L)

tcm = create_tcm(it, vectorizer)
</pre>
<p>Now, we can create the word embedding, in this example, I used a <a href="https://nlp.stanford.edu/pubs/glove.pdf" rel="nofollow" target="_blank">glove embedding</a> to learn vectors representations of the words. The new vector space has around 200 dimensions.</p>
<pre class="brush: r; title: ; notranslate">
glove = GlobalVectors$new(word_vectors_size = 200, vocabulary = vocab, x_max = 100)
glove$fit(tcm, n_iter = 200)
word_vectors &lt;- glove$get_word_vectors()
</pre>
<h2>How to finish our twitter analysis with Tsne</h2>
<p>Now that the words are vectors, we would like to plot them in two dimensions to show the meaning of the words in an appealing (and understandable) way. The number of dimension needs to be reduced to two, to do so, we will use T-sne. <a href="https://lvdmaaten.github.io/tsne/" rel="nofollow" target="_blank">T-sne</a> is a non-parametric dimensionality reduction algorithm and tends to perform well on word embedding. R has a package (actually two) to perform Tsne, we will use the most recent one  <a href="https://cran.r-project.org/web/packages/Rtsne/Rtsne.pdf" rel="nofollow" target="_blank">Rtsne.</a><br />
To avoid overcrowding in our plot and reduce computing time, only words with more than 50 appearances will be used.</p>
<pre class="brush: r; title: ; notranslate">
require('Rtsne')
set.seed(123)
word_vectors_sne=word_vectors[which(vocab$vocab$doc_counts&gt;50&amp;!rownames(word_vectors)%in%stopwords('fr')),]
tsne_out=Rtsne(word_vectors_sne,perplexity =2,initial_dims = 200,dims = 2)
DF_proj=data.frame(x=tsne_out$Y[,1],y=tsne_out$Y[,2],word=rownames(word_vectors_sne))
</pre>
<p>Now that the projection in 2 dimensions has been done, to color the plot we’d like to know which contenders is assigned to each word. To do so, a dictionary is created with the names and pseudo of each of the contenders and the distance from every word to each of these pseudos is computed.<br />
For instance, to assign a candidate to the word ‘democracy’, the minimum distance between ‘democracy’ and ‘mlp’, ‘marine’, ‘fn will be computed. The same thing will be done between ‘democracy’ and ‘macron’, ’em’, ’emmarche’. If the first distance is the smallest then ‘democracy’ will be assigned to Marine Le Pen, otherwise, it will be assigned to Emmanuel Macron.</p>
<pre class="brush: r; title: ; notranslate">
require(ggplot2)
require(ggrepel)
DF_proj=data.table(DF_proj)
DF_proj$count=vocab$vocab$doc_counts[which(vocab$vocab$doc_counts&gt;500&amp; !(rownames(word_vectors)%in%stopwords('fr')))]
DF_proj=DF_proj[word!='NA']

distance_to_candidat=function(word_vectors,words_list,word_in)
{
 max(sim2(word_vectors[words_list,,drop=F],word_vectors[word_in,,drop=F]))
}
closest_candidat=function(word_vectors,mot_in)
{
 mot_le_pen=c('marine','pen','lepen','fn','mlp')
 mot_macron=c('macron','emmanuel','em','enmarche','emmanuelmacron')
 dist_le_pen=distance_to_candidat(word_vectors,mot_le_pen,mot_in)
 dist_macron=distance_to_candidat(word_vectors,mot_macron,mot_in)
 if (dist_le_pen&gt;dist_macron)
 'Le Pen'
 else
 'Macron'
}
DF_proj[,word:=as.character(word)]
DF_proj=DF_proj[word!=""]
DF_proj[,Candidat:=closest_candidat(word_vectors,word),by=word]

require(plotly)
gg=ggplot(DF_proj,aes(x,y,label=word,color=Candidat))+geom_text(aes(size=sqrt(count+1)))
ggplotly(gg)
</pre>
<p>You can get our latest news on Twitter:</p>
<p> </p>
<p> </p>
<p>The post <a rel="nofollow" href="http://enhancedatascience.com/2017/07/17/twitter-analysis-using-r/" target="_blank">Twitter analysis using R (Semantic analysis of French elections)</a> appeared first on <a rel="nofollow" href="http://enhancedatascience.com/" target="_blank">Enhance Data Science</a>.</p>

		<script type='text/javascript'>
		  var vglnk = { key: '949efb41171ac6ec1bf7f206d57e90b8' };

		  (function(d, t) {
			var s = d.createElement(t); s.type = 'text/javascript'; s.async = true;
			s.src = '//cdn.viglink.com/api/vglnk.js';
			var r = d.getElementsByTagName(t)[0]; r.parentNode.insertBefore(s, r);
		  }(document, 'script'));
		</script>		
		
<p class="syndicated-attribution">
<div style="border: 1px solid; background: none repeat scroll 0 0 #EDEDED; margin: 1px; font-size: 13px;">
<div style="text-align: center;">To <strong>leave a comment</strong> for the author, please follow the link and comment on their blog: <strong><a href="http://enhancedatascience.com/2017/07/17/twitter-analysis-using-r/"> Enhance Data Science</a></strong>.</div>
<hr />
<a href="https://www.r-bloggers.com/" rel="nofollow">R-bloggers.com</a> offers <strong><a href="https://feedburner.google.com/fb/a/mailverify?uri=RBloggers" rel="nofollow">daily e-mail updates</a></strong> about <a title="The R Project for Statistical Computing" href="https://www.r-project.org/" rel="nofollow">R</a> news and <a title="R tutorials" href="https://www.r-bloggers.com/search/tutorial" rel="nofollow">tutorials</a> on topics such as: <a title="Data science" href="https://www.r-bloggers.com/search/data%20science" rel="nofollow">Data science</a>, <a title="Big Data" href="https://www.r-bloggers.com/search/Big%20Data" rel="nofollow">Big Data, <a title="R jobs" href="https://www.r-users.com/" rel="nofollow">R jobs</a>, visualization (<a title="ggplot and ggplot2 tutorials" href="https://www.r-bloggers.com/search/ggplot2" rel="nofollow">ggplot2</a>, <a title="Boxplots using lattice and ggplot2 tutorials" href="https://www.r-bloggers.com/search/boxplot" rel="nofollow">Boxplots</a>, <a title="Maps and gis" href="https://www.r-bloggers.com/search/map" rel="nofollow">maps</a>, <a title="Animation in R" href="https://www.r-bloggers.com/search/animation" rel="nofollow">animation</a>), programming (<a title="RStudio IDE for R" href="https://www.r-bloggers.com/search/RStudio" rel="nofollow">RStudio</a>, <a title="Sweave and literate programming" href="https://www.r-bloggers.com/search/sweave" rel="nofollow">Sweave</a>, <a title="LaTeX in R" href="https://www.r-bloggers.com/search/LaTeX" rel="nofollow">LaTeX</a>, <a title="SQL and databases" href="https://www.r-bloggers.com/search/SQL" rel="nofollow">SQL</a>, <a title="Eclipse IDE for R" href="https://www.r-bloggers.com/search/eclipse" rel="nofollow">Eclipse</a>, <a title="git and github, Version Control System" href="https://www.r-bloggers.com/search/git" rel="nofollow">git</a>, <a title="Large data in R using Hadoop" href="https://www.r-bloggers.com/search/hadoop" rel="nofollow">hadoop</a>, <a title="Web Scraping of google, facebook, yahoo, twitter and more using R" href="https://www.r-bloggers.com/search/Web+Scraping" rel="nofollow">Web Scraping</a>) statistics (<a title="Regressions and ANOVA analysis tutorials" href="https://www.r-bloggers.com/search/regression" rel="nofollow">regression</a>, <a title="principal component analysis tutorial" href="https://www.r-bloggers.com/search/PCA" rel="nofollow">PCA</a>, <a title="Time series" href="https://www.r-bloggers.com/search/time+series" rel="nofollow">time series</a>, <a title="finance trading" href="https://www.r-bloggers.com/search/trading" rel="nofollow">trading</a>) and more...
</div></p><img src="http://feeds.feedburner.com/~r/RBloggers/~4/hgMmkMnhfBQ" height="1" width="1" alt=""/>]]></content:encoded>
	<enclosure url="" length="" type="" />
	<post-id xmlns="com-wordpress:feed-additions:1">153047</post-id>	<feedburner:origLink>https://www.r-bloggers.com/twitter-analysis-using-r-semantic-analysis-of-french-elections/</feedburner:origLink></item>
		<item>
		<title>No time wasting</title>
		<link>http://feedproxy.google.com/~r/RBloggers/~3/2vt4CgSqeH4/</link>
		<pubDate>Mon, 17 Jul 2017 16:42:59 +0000</pubDate>
		<dc:creator><![CDATA[Blog]]></dc:creator>
				<category><![CDATA[R bloggers]]></category>

		<guid isPermaLink="false">https://www.mango-solutions.com/blog/no-time-wasting</guid>
		<description><![CDATA[
                
                                
                                Ensuring your analytic IP is given the attention it deserves
It is now widely recognised that data is the key to making informed business decisions. As such, models and ...]]></description>
				<content:encoded><![CDATA[<p class="syndicated-attribution"><div class="social4i" style="height:29px;"><div class="social4in" style="height:29px;float: left;"><div class="socialicons s4fblike" style="float:left;margin-right: 10px;"><div class="fb-like" data-href="https://www.r-bloggers.com/no-time-wasting/" data-send="true"  data-layout="button_count" data-width="100" data-height="21"  data-show-faces="false"></div></div><div class="socialicons s4linkedin" style="float:left;margin-right: 10px;"><script type="in/share" data-url="https://www.r-bloggers.com/no-time-wasting/" data-counter="right"></script></div></div><div style="clear:both"></div></div>

<div style="border: 1px solid; background: none repeat scroll 0 0 #EDEDED; margin: 1px; font-size: 12px;">
(This article was first published on  <strong><a href="https://www.mango-solutions.com/blog/no-time-wasting"> Blog</a></strong>, and kindly contributed to <a href="https://www.r-bloggers.com/" rel="nofollow">R-bloggers)</a>      
</div></p>
<p>                                <img src="https://i2.wp.com/www.mango-solutions.com/blog/images/f/2/c/3/3/f2c33c43067e23917619ae347feda715238ced5a-blog-post---modspace.png?w=456&#038;ssl=1" srcset_temp="/blog/images/4/4/1/9/0/44190cec8a98be1b32141a7553cc469090794a06-blog-post---modspace3x.png 1440w, /blog/images/9/9/6/d/d/996dd00b341b3ba1748a13283b98e39e60d52594-blog-post---modspace2x.png 1000w, /blog/images/f/2/c/3/3/f2c33c43067e23917619ae347feda715238ced5a-blog-post---modspace.png 640w" sizes="100vw" data-recalc-dims="1" /></p>
<h2>Ensuring your analytic IP is given the attention it deserves</h2>
<p>It is now widely recognised that data is the key to making informed business decisions. As such, models and code are the tools used to extract insight and should be considered very valuable IP for an organisation.</p>
<p>Considering data as a valuable asset, it’s important to store it so it’s easy for others within the organisation to find, reuse and repurpose this code in other projects and areas of the business. </p>
<p>However, there are some key challenges: </p>
<p><strong>Losing code</strong></br><br />
Even if sharing code is actively encouraged inside an organisation, traditional storage platforms treat analytical code in the same way as any other file. This means that —without prior knowledge of a particular script’s existence— they can be hard to find, and in some cases lost forever in a mass of other files and scripts in the same platform.</p>
<p><strong>Reproducibility</strong></br><br />
Have you ever been asked to reproduce a piece of analysis from 6 months ago? Or how about two years ago? For many, reproducing an older piece of analysis can be a huge task. Finding the script is one thing, but then knowing which version of a script was used, the data it was run against, the versions of the software that it was originally run in make this a more complex problem that you might originally think.</p>
<p><strong>Wasting time</strong></br><br />
How many times have you written a script, only to find out a colleague has already written code which does the exact same thing? If this has happened to you, then it has probably happened to your colleagues. </p>
<p><strong>ModSpace offers the solutions to these problems and much more.</strong> </br></p>
<p>Developed by the Mango team with modellers and statisticians in mind, it is a safe place to store analytical code and models. Plus, because ModSpace has the ability to understand analytical code when code is loaded into the system, key information is collected to make it easily discoverable by other users.</p>
<p>By linking ModSpace to multiple repositories around your organisation and integrating it with analysts’ preferred tools  —such as R, SAS, Python, MATLAB, NONMEM and many others— you’re providing your teams with an enhanced workflow and analytic hub.</p>
<p><strong>If analytical code and models are a vital part of your business, please join us on 20 July for a FREE demonstration of ModSpace.</strong></br></p>
<h4><a href="https://www.mango-solutions.com/modspace/" rel="nofollow" target="_blank">Register now</a>.</h4>

		<script type='text/javascript'>
		  var vglnk = { key: '949efb41171ac6ec1bf7f206d57e90b8' };

		  (function(d, t) {
			var s = d.createElement(t); s.type = 'text/javascript'; s.async = true;
			s.src = '//cdn.viglink.com/api/vglnk.js';
			var r = d.getElementsByTagName(t)[0]; r.parentNode.insertBefore(s, r);
		  }(document, 'script'));
		</script>		
		
<p class="syndicated-attribution">
<div style="border: 1px solid; background: none repeat scroll 0 0 #EDEDED; margin: 1px; font-size: 13px;">
<div style="text-align: center;">To <strong>leave a comment</strong> for the author, please follow the link and comment on their blog: <strong><a href="https://www.mango-solutions.com/blog/no-time-wasting"> Blog</a></strong>.</div>
<hr />
<a href="https://www.r-bloggers.com/" rel="nofollow">R-bloggers.com</a> offers <strong><a href="https://feedburner.google.com/fb/a/mailverify?uri=RBloggers" rel="nofollow">daily e-mail updates</a></strong> about <a title="The R Project for Statistical Computing" href="https://www.r-project.org/" rel="nofollow">R</a> news and <a title="R tutorials" href="https://www.r-bloggers.com/search/tutorial" rel="nofollow">tutorials</a> on topics such as: <a title="Data science" href="https://www.r-bloggers.com/search/data%20science" rel="nofollow">Data science</a>, <a title="Big Data" href="https://www.r-bloggers.com/search/Big%20Data" rel="nofollow">Big Data, <a title="R jobs" href="https://www.r-users.com/" rel="nofollow">R jobs</a>, visualization (<a title="ggplot and ggplot2 tutorials" href="https://www.r-bloggers.com/search/ggplot2" rel="nofollow">ggplot2</a>, <a title="Boxplots using lattice and ggplot2 tutorials" href="https://www.r-bloggers.com/search/boxplot" rel="nofollow">Boxplots</a>, <a title="Maps and gis" href="https://www.r-bloggers.com/search/map" rel="nofollow">maps</a>, <a title="Animation in R" href="https://www.r-bloggers.com/search/animation" rel="nofollow">animation</a>), programming (<a title="RStudio IDE for R" href="https://www.r-bloggers.com/search/RStudio" rel="nofollow">RStudio</a>, <a title="Sweave and literate programming" href="https://www.r-bloggers.com/search/sweave" rel="nofollow">Sweave</a>, <a title="LaTeX in R" href="https://www.r-bloggers.com/search/LaTeX" rel="nofollow">LaTeX</a>, <a title="SQL and databases" href="https://www.r-bloggers.com/search/SQL" rel="nofollow">SQL</a>, <a title="Eclipse IDE for R" href="https://www.r-bloggers.com/search/eclipse" rel="nofollow">Eclipse</a>, <a title="git and github, Version Control System" href="https://www.r-bloggers.com/search/git" rel="nofollow">git</a>, <a title="Large data in R using Hadoop" href="https://www.r-bloggers.com/search/hadoop" rel="nofollow">hadoop</a>, <a title="Web Scraping of google, facebook, yahoo, twitter and more using R" href="https://www.r-bloggers.com/search/Web+Scraping" rel="nofollow">Web Scraping</a>) statistics (<a title="Regressions and ANOVA analysis tutorials" href="https://www.r-bloggers.com/search/regression" rel="nofollow">regression</a>, <a title="principal component analysis tutorial" href="https://www.r-bloggers.com/search/PCA" rel="nofollow">PCA</a>, <a title="Time series" href="https://www.r-bloggers.com/search/time+series" rel="nofollow">time series</a>, <a title="finance trading" href="https://www.r-bloggers.com/search/trading" rel="nofollow">trading</a>) and more...
</div></p><img src="http://feeds.feedburner.com/~r/RBloggers/~4/2vt4CgSqeH4" height="1" width="1" alt=""/>]]></content:encoded>
	<enclosure url="" length="" type="" />
	<post-id xmlns="com-wordpress:feed-additions:1">153040</post-id>	<feedburner:origLink>https://www.r-bloggers.com/no-time-wasting/</feedburner:origLink></item>
		<item>
		<title>Top reasons to send your team to EARL</title>
		<link>http://feedproxy.google.com/~r/RBloggers/~3/pcG9IGRjqzA/</link>
		<pubDate>Mon, 17 Jul 2017 16:21:37 +0000</pubDate>
		<dc:creator><![CDATA[Blog]]></dc:creator>
				<category><![CDATA[R bloggers]]></category>

		<guid isPermaLink="false">https://www.mango-solutions.com/blog/top-reasons-to-send-your-team-to-earl</guid>
		<description><![CDATA[
                
                                
                                It’s easy to get stuck in the day-to-day at the office and there’s never time to upskill or even think about career development. However, to really grow and develop ...]]></description>
				<content:encoded><![CDATA[<p class="syndicated-attribution"><div class="social4i" style="height:29px;"><div class="social4in" style="height:29px;float: left;"><div class="socialicons s4fblike" style="float:left;margin-right: 10px;"><div class="fb-like" data-href="https://www.r-bloggers.com/top-reasons-to-send-your-team-to-earl/" data-send="true"  data-layout="button_count" data-width="100" data-height="21"  data-show-faces="false"></div></div><div class="socialicons s4linkedin" style="float:left;margin-right: 10px;"><script type="in/share" data-url="https://www.r-bloggers.com/top-reasons-to-send-your-team-to-earl/" data-counter="right"></script></div></div><div style="clear:both"></div></div>

<div style="border: 1px solid; background: none repeat scroll 0 0 #EDEDED; margin: 1px; font-size: 12px;">
(This article was first published on  <strong><a href="https://www.mango-solutions.com/blog/top-reasons-to-send-your-team-to-earl"> Blog</a></strong>, and kindly contributed to <a href="https://www.r-bloggers.com/" rel="nofollow">R-bloggers)</a>      
</div></p>
<p>                                <img src="https://i1.wp.com/www.mango-solutions.com/blog/images/2/0/2/5/2/202529befdf74ff12a1e712bb963f265d1d35510-top-reasons-to-send-your-team-to-earl.png?w=456&#038;ssl=1" srcset_temp="/blog/images/1/4/6/e/2/146e219ca5f20fd042881a6b28b45d1332538d22-top-reasons-to-send-your-team-to-earl2x.png 1000w, /blog/images/a/7/b/3/a/a7b3a5050f55bf04945ab63127522fa64356b177-top-reasons-to-send-your-team-to-earl3x.png 1440w, /blog/images/2/0/2/5/2/202529befdf74ff12a1e712bb963f265d1d35510-top-reasons-to-send-your-team-to-earl.png 640w" sizes="100vw" data-recalc-dims="1" /></p>
<p>It’s easy to get stuck in the day-to-day at the office and there’s never time to upskill or even think about career development. However, to really grow and develop your organisation, it’s important to grow and develop your team.</br></p>
<p>While there are many ways to develop teams, including training and providing time to complete personal (and relevant) projects, conferences provide a range of benefits. </br></p>
<p><strong>Spark innovation</strong></br><br />
Some of the best in the business present their projects, ideas and solutions at EARL each year. It’s the perfect opportunity to see what’s trending and what’s really working. Topics at EARL Conferences this year include, best practice SAS to R; Shiny applications; using social media data; web scraping, plus presentations on R in marketing, healthcare, finance, insurance and transport. Take a look at the <a href="https://earlconf.com/downloads/london/EARL-London-2017-Agenda.pdf" rel="nofollow" target="_blank">agenda for London here</a>.</br></p>
<p>A cross-sector conference like EARL can help your organisation think outside the box because learnings are transferable, regardless of industry.</br></br></p>
<p><strong>Imbue knowledge</strong></br><br />
This brings us to knowledge. Learning from the best in the business will help employees expand their knowledgebase. This can keep them motivated and engaged in what they’re doing; and a wider knowledgebase can also inform their everyday tasks enabling them to advance the way they do their job. </br></p>
<p>When employees feel like you want to invest in them, they stay engaged and are more likely to remain in the same organisation for longer.</br></br></p>
<p><strong>Encourage networking</strong></br><br />
EARL attracts R users from all levels and industries and not just to speak. The agenda offers plenty of opportunities to network with some of the industry’s most engaged R users. This is beneficial for a number of reasons, including knowledge exchange and sharing your organisation’s values.</br></br></p>
<p><strong>Boost inspiration</strong></br><br />
We often see delegates who have come to an EARL Conference with a specific business challenge in mind. By attending, they get access to the current innovations, knowledge and networking mentioned above, and can return to their team —post-conference— with a renewed vigour to solve those problems using their new-found knowledge.</br></br></p>
<h5>Making the most out of attending EARL</br></h5>
<p>After all of that, the next step is making sure your organisation makes the most out of attending EARL. We recommend:</br></p>
<p><strong>Setting goals</strong></br><br />
Do you have a specific challenge you’re trying to solve in your organisation? Going with a set challenge in mind means your team can plan which sessions to sit in and who they should talk to during the networking sessions.  </br></br></p>
<p><strong>De-briefing</strong></br><br />
This is two-fold:</br><br />
1)  Writing a post-conference report will help your team put what they have learnt at EARL into action.</br><br />
2)  Not everyone can attend, so those who do can share their new-found knowledge with their peers who can learn second-hand from their colleague’s just be smarexperience.</br></br></p>
<p><strong>Following up</strong></br><br />
We’re all guilty of going to a conference, coming back inspired and then getting lost in the day-to-day. Assuming you’ve set goals and de-briefed, it should be easy to develop a follow up plan. </br></p>
<p>You can make the most of inspired team members to put in place new strategies, technologies and innovations through further training, contact follow-ups and new procedure development.</br></br></p>
<p><strong>EARL Conference offers a range of discounts, including package deals for organisations looking to send more than 2 delegates.</strong></p>
<p><strong><a href="https://earlconf.com/" rel="nofollow" target="_blank">Buy tickets now</a> or <a href="mailto:earl-team@mango-solutions.com" rel="nofollow" target="_blank">contact the EARL Team</a></strong></p>

		<script type='text/javascript'>
		  var vglnk = { key: '949efb41171ac6ec1bf7f206d57e90b8' };

		  (function(d, t) {
			var s = d.createElement(t); s.type = 'text/javascript'; s.async = true;
			s.src = '//cdn.viglink.com/api/vglnk.js';
			var r = d.getElementsByTagName(t)[0]; r.parentNode.insertBefore(s, r);
		  }(document, 'script'));
		</script>		
		
<p class="syndicated-attribution">
<div style="border: 1px solid; background: none repeat scroll 0 0 #EDEDED; margin: 1px; font-size: 13px;">
<div style="text-align: center;">To <strong>leave a comment</strong> for the author, please follow the link and comment on their blog: <strong><a href="https://www.mango-solutions.com/blog/top-reasons-to-send-your-team-to-earl"> Blog</a></strong>.</div>
<hr />
<a href="https://www.r-bloggers.com/" rel="nofollow">R-bloggers.com</a> offers <strong><a href="https://feedburner.google.com/fb/a/mailverify?uri=RBloggers" rel="nofollow">daily e-mail updates</a></strong> about <a title="The R Project for Statistical Computing" href="https://www.r-project.org/" rel="nofollow">R</a> news and <a title="R tutorials" href="https://www.r-bloggers.com/search/tutorial" rel="nofollow">tutorials</a> on topics such as: <a title="Data science" href="https://www.r-bloggers.com/search/data%20science" rel="nofollow">Data science</a>, <a title="Big Data" href="https://www.r-bloggers.com/search/Big%20Data" rel="nofollow">Big Data, <a title="R jobs" href="https://www.r-users.com/" rel="nofollow">R jobs</a>, visualization (<a title="ggplot and ggplot2 tutorials" href="https://www.r-bloggers.com/search/ggplot2" rel="nofollow">ggplot2</a>, <a title="Boxplots using lattice and ggplot2 tutorials" href="https://www.r-bloggers.com/search/boxplot" rel="nofollow">Boxplots</a>, <a title="Maps and gis" href="https://www.r-bloggers.com/search/map" rel="nofollow">maps</a>, <a title="Animation in R" href="https://www.r-bloggers.com/search/animation" rel="nofollow">animation</a>), programming (<a title="RStudio IDE for R" href="https://www.r-bloggers.com/search/RStudio" rel="nofollow">RStudio</a>, <a title="Sweave and literate programming" href="https://www.r-bloggers.com/search/sweave" rel="nofollow">Sweave</a>, <a title="LaTeX in R" href="https://www.r-bloggers.com/search/LaTeX" rel="nofollow">LaTeX</a>, <a title="SQL and databases" href="https://www.r-bloggers.com/search/SQL" rel="nofollow">SQL</a>, <a title="Eclipse IDE for R" href="https://www.r-bloggers.com/search/eclipse" rel="nofollow">Eclipse</a>, <a title="git and github, Version Control System" href="https://www.r-bloggers.com/search/git" rel="nofollow">git</a>, <a title="Large data in R using Hadoop" href="https://www.r-bloggers.com/search/hadoop" rel="nofollow">hadoop</a>, <a title="Web Scraping of google, facebook, yahoo, twitter and more using R" href="https://www.r-bloggers.com/search/Web+Scraping" rel="nofollow">Web Scraping</a>) statistics (<a title="Regressions and ANOVA analysis tutorials" href="https://www.r-bloggers.com/search/regression" rel="nofollow">regression</a>, <a title="principal component analysis tutorial" href="https://www.r-bloggers.com/search/PCA" rel="nofollow">PCA</a>, <a title="Time series" href="https://www.r-bloggers.com/search/time+series" rel="nofollow">time series</a>, <a title="finance trading" href="https://www.r-bloggers.com/search/trading" rel="nofollow">trading</a>) and more...
</div></p><img src="http://feeds.feedburner.com/~r/RBloggers/~4/pcG9IGRjqzA" height="1" width="1" alt=""/>]]></content:encoded>
	<enclosure url="" length="" type="" />
	<post-id xmlns="com-wordpress:feed-additions:1">153056</post-id>	<feedburner:origLink>https://www.r-bloggers.com/top-reasons-to-send-your-team-to-earl/</feedburner:origLink></item>
	</channel>
</rss>
