<?xml version="1.0" encoding="utf-8" standalone="yes" ?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>Gutenbergr on rostrum.blog</title>
    <link>https://www.rostrum.blog/tags/gutenbergr/</link>
    <description>Recent content in Gutenbergr on rostrum.blog</description>
    <generator>Hugo -- gohugo.io</generator>
    <language>en-gb</language>
    <lastBuildDate>Sun, 12 Sep 2021 00:00:00 +0000</lastBuildDate>
    
	<atom:link href="https://www.rostrum.blog/tags/gutenbergr/index.xml" rel="self" type="application/rss+xml" />
    
    
    <item>
      <title>Extract punctuation from books with R</title>
      <link>https://www.rostrum.blog/2021/09/12/extract-punct/</link>
      <pubDate>Sun, 12 Sep 2021 00:00:00 +0000</pubDate>
      
      <guid>https://www.rostrum.blog/2021/09/12/extract-punct/</guid>
      <description>The start of ‘Moby Dick’ by Herman Melville  tl;dr I wrote an R function to extract only the punctuation marks from a provided text. It prints prettily to the console, but you can also take a character vector away for further analysis.
 Punct rock A few years ago Adam J Calhoun did a small but really neat thing: extracted and presented only the punctuation from some books. It appeared again recently in my Twitter timeline.</description>
    </item>
    
  </channel>
</rss>