A friend of mine once told me a 'flawless' tactic for beating the roulette game.
1. bet on black - if you lose, bet on black again but now double the amount - if you win now, you will have not lost any money - if you lose again, bet on black again but now double the amount - if you win now, you will have not lost any money - if you lose again, bet on black again but now double the amount - if you win now, you will have not lost any money - if you lose again, bet on black again but now double the amount etc ... - if you win, profit! 2. repeat
The idea is that the probability of always getting red converges to zero and as of such you should always be able to win lost money back.
In this document I will explain why this tactic is flawed via probability theory as well as simulations. As a seperate goal, this document will also help explain simulation and lazy plotting patterns in R.
Simulating Gambles in R
Let's start by first simulating and drawing a random path. For these first simulations we will assume that everytime you gamble you win some or lose some depending on the output of a cointoss. A possible way of simulating this can be via;
randints = function(num) { sample(c(-1, 1), num, replace = TRUE) } df1 = data.frame(cs = cumsum(randints(100)), t = 1:100) ggplot() + geom_line(data = df1, aes(t, cs), alpha = 0.8)
Simulating another such line and adding that to the plot is straightforward.
df2 = data.frame(cs = cumsum(randints(100)), t = 1:100) df3 = data.frame(cs = cumsum(randints(100)), t = 1:100) ggplot() + geom_line(data = df1, aes(t, cs), alpha = 0.8) + geom_line(data = df2, aes(t, cs), alpha = 0.8) + geom_line(data = df3, aes(t, cs), alpha = 0.8)
The code can be made more elegant by exploiting the lazy evaluation of the ggplot2 library. You don't need to write a single line of ... + geom_line() + geom_line() + ...
code. You can append it to a variable instead.
df3 = data.frame(cs=cumsum(randints(100)), t=1:100) p = ggplot() p = p + geom_line(data=df1, aes(t,cs), alpha=0.8, color="red") p = p + geom_line(data=df2, aes(t,cs), alpha=0.8, color="green") p = p + geom_line(data=df3, aes(t,cs), alpha=0.8, color="blue") p
This is a more favorable pattern. You can keep adding plotting layers (from different datasets) to the variable p
and it will not draw a single pixel until you call p
at the end. It is clear for a user to see, line by line, that new layers are being added to the layer and you can use this pattern within a forloop. This means that you can do many simulations and draw them from a single function.
gamble_plot = function(nruns){ p = ggplot() for(i in 1:nruns){ df = data.frame(cs=cumsum(randints(100)), t=1:100) p = p + geom_line(data=df, aes(t,cs), alpha=0.1) } p } gamble_plot(300)
In this pattern it is also relatively easy to assign colors to series of gamblers if they result in a long term profit or loss. Also, it might make sense to have the length of the random path be an input for our simulation function.
gamble_plot = function(nruns, len){ p = ggplot() for(i in 1:nruns){ color = "darkgreen" df = data.frame(cs=cumsum(randints(len)), t=1:len) if(select(df, cs)[len,] < 0) color = "darkorange" p = p + geom_line(data=df, aes(t,cs), alpha=0.4, color=color) } p } gamble_plot(300, 200)
Casino Royale
Using this plotting pattern, let's simulate the results from the 'flawless' roulette tactic. The following functions help the simulation.
gamble = function(moneyin){ if( runif(1) < 0.5 ) return(moneyin) # result is black -moneyin # result is red } nextm = function(gamble, outcome){ if(outcome < 0) return(2*gamble) #on lose we double the money 1 } simulate = function(maxt){ df = data.frame(time=as.numeric(c()), money=as.numeric(c())) move = 0 outcome = 0 for(i in 1:maxt){ move = nextm(move, outcome) outcome = gamble(move) df = rbind(df, data.frame(time=i, money=outcome)) } df$money = cumsum(df$money) df } gamblersruin = function(num, maxsim){ p = ggplot() for(i in 1:num){ df = simulate(maxsim) p = p + geom_line(data=df, aes(time,money), alpha=0.3) } p }
Notice that I am assuming the probability of getting black (
Let's run some results.
set.seed(1) gamblersruin(1,100) + ggtitle('single simulation, 100 bets') gamblersruin(1,1000) + ggtitle('single simulation, 1000 bets') gamblersruin(1,10000) + ggtitle('single simulation, 10000 bets')
Ouch, on the long term it does seem like you are making a net profit but you are not without risk. The probability of getting red 12 times in a row is small but when you are playing this game for a very long time then this event suddenly becomes likely. And when it happens you need to compensate with
Probability Theory
So the simulations are giving us reasons to be pessimistic about the tactic, but the long term end result does seem to be positive. What does mathematics tell us?
I will be a bit formal here, but thats a given when dealing with math. Suppose that we play the roulette game until we have made a profit of 0 or 1. Let's consider this outcome to be a stochastic variable
Then also let
The variance of the game is then defined via the definition.
So what happens when we play this game an infinite amount of time?
This could feel very curious but should come with no suprise. If we play the game an infinite amount of time we will have an infinite amount of risk. In laymans terms, if you want to earn an infinite amount of money you will need an infite amount of money.
Gamblers Fail
Even if we assume that you have an infinite amount of betting money, the tactic will still fail in real life. The main reasons is that casinos tend to apply a maximum bet in all their gambling games. You can never bet above a certain amount in the casino which means that you cannot apply the 'doubling' tactic infinetely.
How likely is it to hit a casino limit during a game? Let
This can then be expanded to playing
Notice that you cannot play the game for an infinite amount of time now without risking such a loss.
f = function(k,n){ 1 - ( (2^k - 1) / ( 2^k ) )^n } df = data.frame(prob=f(9,1:10000), n=1:10000, limit=" 512") df = rbind(df, data.frame(prob=f(10,1:10000), n=1:10000, limit='1024')) df = rbind(df, data.frame(prob=f(11,1:10000), n=1:10000, limit='2048')) df = rbind(df, data.frame(prob=f(12,1:10000), n=1:10000, limit='4096')) p = ggplot() p = p + geom_line(data=df, aes(n, prob, colour=limit)) p + ggtitle("probability of hitting budget limit after 'n' gambles")
Also notice that if we consider the casino bounds then the expected value of the game
This conclusion changes further if we stop assuming that
Including Fail in Simulations
Let's run some more simulations again with these facts in mind.
nextm = function(gamble, outcome){ if(outcome < 0){ if(gamble > 500){ return(1) } return(2*gamble) } 1 } gamble = function(moneyin){ if( runif(1) < 0.4865 ) return(moneyin) # result is black -moneyin # result is red } gamblersruin(50,2500) + ggtitle("50 more realistic simulations")
And we can see that these outcomes seem to spread out. Making a histogram of the endgame states shows us how flawed the tactic really is.
gamblersend = function(num, maxsim){ df = data.frame(time=as.numeric(c()), money=as.numeric(c())) for(i in 1:num){ if( i %% 5 == 0 ){ cat(i, 'simulations have now run\n') } df = rbind(df, simulate(maxsim)[maxsim,]) } df } ends = gamblersend(200,2500) # big simulation, warning, takes long p = ggplot() p = p + geom_histogram(data=ends, aes(money), alpha=0.9, binwidth=1000) p + ggtitle("histogram of casino outcomes after 2500 bets")
The tactic simply isn't leading to a lead positive result, we are definately making an average loss with this tactic.
> mean(ends$money) [1] -336.2
Conclusion
These infinite betting strategies where the outcome seems riskless are known as martingales, you can find more information about them on wikipedia. Mathematics is a nice tool and should not be ignored when tackling these strategies but to confirm their flaws with something maybe a little more tangible R can be a great tool. Creating and running simulations in R is easier if you split your simulation into smaller functional bits. The lazy evaluation of ggplot can also be of help to keep the code you need clean.
You should also be able to find this blog on: r-bloggers