Randomness in R

Randomness is a double edged sword: sometimes you want it, sometimes you want to control it. Many processes in R include an element of randomness:

performing random picks from a collection of values
dimensionality reductions like PCA, t-SNE and UMAP
clustering

Because these processes involve random choices, running the same code twice can yield slightly different results.

Ensuring reproducibility

To ensure that random processes behave consistently in R R allows you to fix the state of the random number generator:

set.seed(n)

As long as you use the same seed value n, R will generate the same sequence of random numbers, and thus the same results, every time the code is executed.

Generating reproducible random data

To create a reproducible random data set you first set a seed and then use a function that picks random numbers like:

set.seed(42)
x <- rnorm(10)        # 10 values from a normal distribution
y <- runif(10)        # 10 values from a uniform distribution
z <- sample(1:20,10)  # 10 integers between 1 and 20

Previous Quiz

Back to Lesson

Next Topic

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.