<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>R | Reprex</title><link>https://reprex-next.netlify.app/tag/r/</link><atom:link href="https://reprex-next.netlify.app/tag/r/index.xml" rel="self" type="application/rss+xml"/><description>R</description><generator>Wowchemy (https://wowchemy.com)</generator><language>en-us</language><lastBuildDate>Fri, 07 Oct 2022 12:35:00 +0200</lastBuildDate><image><url>https://reprex-next.netlify.app/media/icon_hub9491570ac57158c0eeecc95c95b13e5_20247_512x512_fill_lanczos_center_3.png</url><title>R</title><link>https://reprex-next.netlify.app/tag/r/</link></image><item><title>Learn R with Reprex</title><link>https://reprex-next.netlify.app/slides/learn-with-reprex/</link><pubDate>Fri, 07 Oct 2022 12:35:00 +0200</pubDate><guid>https://reprex-next.netlify.app/slides/learn-with-reprex/</guid><description>&lt;h1 id="big-data-creates-inequalities">Big Data Creates Inequalities&lt;/h1>
&lt;p>Only the largest corporations, best-endowed universities, and rich governments can afford data collection and processing capacities that are large enough to harness the advantages of AI.&lt;/p>
&lt;hr>
&lt;h2 id="slide-navigation">Slide navigation&lt;/h2>
&lt;p>Fullscreen: &lt;code>F&lt;/code>&lt;/p>
&lt;ul>
&lt;li>Next: &lt;code>️&amp;gt;&lt;/code> or &lt;code>Space&lt;/code> | Previous :️&lt;code>&amp;lt;&lt;/code>&lt;/li>
&lt;li>Start: &lt;code>Home&lt;/code> | Finish: &lt;code>End&lt;/code>&lt;/li>
&lt;li>Overview: &lt;code>Esc&lt;/code>| Speaker notes: &lt;code>S&lt;/code>&lt;/li>
&lt;li>Zoom: &lt;code>Alt + Click 🖱️&lt;/code>&lt;/li>
&lt;/ul>
&lt;hr>
&lt;h2 id="big-data-that-works-for-all">Big data that works for all&lt;/h2>
&lt;ul>
&lt;li>
&lt;p style="font-size:75%">No matter how big is the problem or how small is your team, `Reprex` fill your reports, dashboards, newsletters, books with data and its visualization.
&lt;/li>
&lt;li>
&lt;p style="font-size:75%">Learn R with us: you can reduce the inequalities by joining the open source movement, learning to run open source software, ask for help, improve the tutorials, the documentation, and eventually learn to make the computer work for you.
&lt;/li>
&lt;li>
&lt;p style="font-size:75%">Contributor Covenant: Participating in open source is often a highly collaborative experience. We’re encouraged to create in public view, and we’re incentivized to welcome contributions of all kinds from people around the world. This makes the practice of open source as much social as it is technical.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;hr>
&lt;h2 id="get-inspired">Get Inspired&lt;/h2>
&lt;ul>
&lt;li>&lt;a href="https://curators.dataobservatory.eu/inspiration.html" target="_blank" rel="noopener">Find more interesting and better data&lt;/a>: you don&amp;rsquo;t have to be a data scientist or write code to contribute to our projects.&lt;/li>
&lt;li>&lt;a href="https://data-feminism.mitpress.mit.edu/" target="_blank" rel="noopener">Data feminism&lt;/a>: Catherine D&amp;rsquo;Ignazio and Lauren Klein present a new way of thinking about data science and data ethics—one that is informed by intersectional feminist thought. Highly inspirational, free, open-source book.&lt;/li>
&lt;li>&lt;a href="https://rladies.org/" target="_blank" rel="noopener">RLadies&lt;/a> is a world-wide organization to promote gender diversity in the R community.&lt;/li>
&lt;/ul>
&lt;hr>
&lt;h2 id="contributor-covenant">Contributor Covenant&lt;/h2>
&lt;ul>
&lt;li>
&lt;p style="font-size:75%">We as members, contributors, and leaders pledge to make participation in our community a harassment-free experience for everyone, regardless of age, body size, visible or invisible disability, ethnicity, sex characteristics, gender identity and expression, level of experience, education, socio-economic status, nationality, personal appearance, race, caste, color, religion, or sexual identity and orientation.&lt;/p>
&lt;/li>
&lt;li>
&lt;p style="font-size:75%">We pledge to act and interact in ways that contribute to an open, welcoming, diverse, inclusive, and healthy community.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;hr>
&lt;section data-noprocess data-shortcode-slide
data-background-image="retroharmonize_example_1.webp"
>
&lt;hr>
&lt;section data-noprocess data-shortcode-slide
data-background-image="retroharmonize_example_2.webp"
>
&lt;hr>
&lt;h2 id="run-code-from-tutorials">Run code from tutorials&lt;/h2>
&lt;p>&lt;a href="https://retroharmonize.dataobservatory.eu/" target="_blank" rel="noopener">retroharmonize.dataobservatory.eu&lt;/a>&lt;/br>
&lt;a href="https://retroharmonize.dataobservatory.eu/articles/retroharmonize.htmll" target="_blank" rel="noopener">🖱 Get started&lt;/a>&lt;/br>
[🖱️ Articles](&lt;a href="https://retroharmonize.dataobservatory.eu/articles/index.htm" target="_blank" rel="noopener">https://retroharmonize.dataobservatory.eu/articles/index.htm&lt;/a>&lt;/p>
&lt;hr>
&lt;section data-noprocess data-shortcode-slide
data-background-image="retroharmonize_readme.webp"
>
&lt;hr>
&lt;section data-noprocess data-shortcode-slide
data-background-image="github_issues_spotifyR.webp"
>
&lt;h2 id="find-help-ask-for-help-reprex">Find help, ask for help: reprex&lt;/h2>
&lt;hr>
&lt;section data-noprocess data-shortcode-slide
data-background-image="retroharmonize_tutorials.webp"
>
&lt;h2 id="documentation-for-better-tutorials">Documentation for better tutorials&lt;/h2>
&lt;hr>
&lt;section data-noprocess data-shortcode-slide
data-background-image="retroharmonize_r_testthat.webp"
>
&lt;h2 id="debugging-and-testing-code">Debugging and testing code&lt;/h2>
&lt;hr>
&lt;section data-noprocess data-shortcode-slide
data-background-image="retroharmonize_r_documentation.webp"
>
&lt;h2 id="contribute-to-documentation">Contribute to documentation&lt;/h2>
&lt;hr>
&lt;h2 id="r-is-a-functional-language">R is a functional language&lt;/h2>
&lt;ul>
&lt;li>R is both a statistical environment and a programming language&lt;/li>
&lt;li>R, the open source and further developed version of the S language, is mainly functional&lt;/li>
&lt;li>If you did a task at least twice, the 3rd time you better write a function script to keep doing it forever.&lt;/li>
&lt;li>Most of your effort will be to find a well-written function for your work&lt;/li>
&lt;li>If you cannot find a function, you will modify somebody else&amp;rsquo;s function, or eventually write your own&lt;/li>
&lt;/ul>
&lt;hr>
&lt;section data-noprocess data-shortcode-slide
data-background-image="retroharmonize_r_code.webp"
>
&lt;hr>
&lt;section data-noprocess data-shortcode-slide
data-background-image="rmd_example.webp"
>
&lt;h2 id="r--yaml--markdown--web-ready">R + YAML + markdown = web ready&lt;/h2>
&lt;hr>
&lt;ul>
&lt;li>&lt;a href="https://learnxinyminutes.com/docs/yaml/" target="_blank" rel="noopener">Learn YAML in Y minutes&lt;/a>: tell the computer what you want to do with a document&lt;/li>
&lt;li>&lt;a href="https://rmarkdown.rstudio.com/authoring_basics.html" target="_blank" rel="noopener">R Markdown basics&lt;/a>: it is just a plain markdown that allows you to insert little R program &amp;lsquo;chunks&amp;rsquo;.&lt;/li>
&lt;li>&lt;a href="https://github.com/mundimark/awesome-markdown-editors" target="_blank" rel="noopener">Awesome markdown editors and pre-writers&lt;/a>: find a convenient tool&lt;/li>
&lt;li>&lt;a href="https://workspace.google.com/marketplace/app/docs_to_markdown/700168918607" target="_blank" rel="noopener">Google Docs to markdown&lt;/a>: practice by translating your Google Docs text to markdown. It is &lt;em>very&lt;/em> easy.&lt;/li>
&lt;/ul>
&lt;hr>
&lt;section data-noprocess data-shortcode-slide
data-background-image="retroharmonize_website.webp"
>
&lt;h2 id="package-and-release-a-team-effort">Package and release: a team effort&lt;/h2>
&lt;hr>
&lt;h2 id="our-open-source-development-projects">Our open source development projects&lt;/h2>
&lt;p>🔢 &lt;a href="https://dataset.dataobservatory.eu/" target="_blank" rel="noopener">dataset&lt;/a>: Synchronize datasets with global knowledge hubs #️⃣ &lt;a href="https://statcodelists.dataobservatory.eu/" target="_blank" rel="noopener">statcodelists&lt;/a>: Make your data codes understood globally ♻️ &lt;a href="https://iotables.dataobservatory.eu/" target="_blank" rel="noopener">iotables&lt;/a>: Create economic or environmental impact assessments in any EU country 🌍 &lt;a href="https://regions.dataobservatory.eu/" target="_blank" rel="noopener">regions&lt;/a>: Create from raw survey data more granular statistics in any EU country ✅ &lt;a href="https://retroharmonize.dataobservatory.eu/" target="_blank" rel="noopener">retroharmonize&lt;/a>: Harmonize questions banks, recycle answers from past surveys ⏭️ &lt;a href="https://reprex.nl/#releases" target="_blank" rel="noopener">all in on one page&lt;/a>&lt;/p>
&lt;hr>
&lt;section data-noprocess data-shortcode-slide
data-background-image="create_with_reprex.webp"
>
&lt;h2 id="create-with-us">Create with us&lt;/h2>
&lt;hr>
&lt;h1 id="questions">Questions?&lt;/h1>
&lt;p>&lt;a href="https://reprex.nl/#contact" target="_blank" rel="noopener">Email&lt;/a> | &lt;a href="https://keybase.io/team/reprexcommunity" target="_blank" rel="noopener">Keybase&lt;/a>&lt;/p>
&lt;p>LinkedIn: &lt;a href="https://www.linkedin.com/in/antaldaniel/" target="_blank" rel="noopener">Daniel Antal&lt;/a> - &lt;a href="https://www.linkedin.com/company/68855596" target="_blank" rel="noopener">Reprex&lt;/a> | &lt;a href="https://reprex.nl/" target="_blank" rel="noopener">Home&lt;/a>&lt;/p></description></item><item><title>Learn R with Reprex</title><link>https://reprex-next.netlify.app/slides/learnr-with-reprex/</link><pubDate>Fri, 07 Oct 2022 12:35:00 +0200</pubDate><guid>https://reprex-next.netlify.app/slides/learnr-with-reprex/</guid><description>&lt;h1 id="big-data-creates-inequalities">Big Data Creates Inequalities&lt;/h1>
&lt;p>Only the largest corporations, best-endowed universities, and rich governments can afford data collection and processing capacities that are large enough to harness the advantages of AI.&lt;/p>
&lt;hr>
&lt;h2 id="slide-navigation">Slide navigation&lt;/h2>
&lt;p>Fullscreen: &lt;code>F&lt;/code>&lt;/p>
&lt;ul>
&lt;li>Next: &lt;code>️&amp;gt;&lt;/code> or &lt;code>Space&lt;/code> | Previous :️&lt;code>&amp;lt;&lt;/code>&lt;/li>
&lt;li>Start: &lt;code>Home&lt;/code> | Finish: &lt;code>End&lt;/code>&lt;/li>
&lt;li>Overview: &lt;code>Esc&lt;/code>| Speaker notes: &lt;code>S&lt;/code>&lt;/li>
&lt;li>Zoom: &lt;code>Alt + Click 🖱️&lt;/code>&lt;/li>
&lt;/ul>
&lt;hr>
&lt;h2 id="big-data-that-works-for-all">Big data that works for all&lt;/h2>
&lt;ul>
&lt;li>
&lt;p style="font-size:75%">No matter how big is the problem or how small is your team, `Reprex` fill your reports, dashboards, newsletters, books with data and its visualization.
&lt;/li>
&lt;li>
&lt;p style="font-size:75%">Learn R with us: you can reduce the inequalities by joining the open source movement, learning to run open source software, ask for help, improve the tutorials, the documentation, and eventually learn to make the computer work for you.
&lt;/li>
&lt;li>
&lt;p style="font-size:75%">Contributor Covenant: Participating in open source is often a highly collaborative experience. We’re encouraged to create in public view, and we’re incentivized to welcome contributions of all kinds from people around the world. This makes the practice of open source as much social as it is technical.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;hr>
&lt;h2 id="data-feminism">Data Feminism&lt;/h2>
&lt;hr>
&lt;h2 id="get-inspired">Get Inspired&lt;/h2>
&lt;ul>
&lt;li>&lt;a href="https://curators.dataobservatory.eu/inspiration.html" target="_blank" rel="noopener">Find more interesting and better data&lt;/a>: you don&amp;rsquo;t have to be a data scientist or write code to contribute to our projects.&lt;/li>
&lt;li>&lt;a href="https://data-feminism.mitpress.mit.edu/" target="_blank" rel="noopener">Data feminism&lt;/a>: Catherine D&amp;rsquo;Ignazio and Lauren Klein present a new way of thinking about data science and data ethics—one that is informed by intersectional feminist thought. Highly inspirational, free, open-source book.&lt;/li>
&lt;li>&lt;a href="https://rladies.org/" target="_blank" rel="noopener">RLadies&lt;/a> is a world-wide organization to promote gender diversity in the R community.&lt;/li>
&lt;/ul>
&lt;hr>
&lt;h2 id="contributor-covenant">Contributor Covenant&lt;/h2>
&lt;ul>
&lt;li>
&lt;p style="font-size:75%">We as members, contributors, and leaders pledge to make participation in our community a harassment-free experience for everyone, regardless of age, body size, visible or invisible disability, ethnicity, sex characteristics, gender identity and expression, level of experience, education, socio-economic status, nationality, personal appearance, race, caste, color, religion, or sexual identity and orientation.&lt;/p>
&lt;/li>
&lt;li>
&lt;p style="font-size:75%">We pledge to act and interact in ways that contribute to an open, welcoming, diverse, inclusive, and healthy community.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;hr>
&lt;section data-noprocess data-shortcode-slide
data-background-image="retroharmonize_example_1.webp"
>
&lt;hr>
&lt;section data-noprocess data-shortcode-slide
data-background-image="retroharmonize_example_2.webp"
>
&lt;hr>
&lt;h2 id="run-code-from-tutorials">Run code from tutorials&lt;/h2>
&lt;p>&lt;a href="https://retroharmonize.dataobservatory.eu/" target="_blank" rel="noopener">retroharmonize.dataobservatory.eu&lt;/a>&lt;/br>
&lt;a href="https://retroharmonize.dataobservatory.eu/articles/retroharmonize.htmll" target="_blank" rel="noopener">🖱 Get started&lt;/a>&lt;/br>
&lt;a href="https://retroharmonize.dataobservatory.eu/articles/index.html" target="_blank" rel="noopener">🖱️ Articles&lt;/a>&lt;/p>
&lt;hr>
&lt;section data-noprocess data-shortcode-slide
data-background-image="retroharmonize_readme.webp"
>
&lt;hr>
&lt;section data-noprocess data-shortcode-slide
data-background-image="github_issues_spotifyR.webp"
>
&lt;h2 id="find-help-ask-for-help-reprex">Find help, ask for help: reprex&lt;/h2>
&lt;hr>
&lt;section data-noprocess data-shortcode-slide
data-background-image="retroharmonize_tutorials.webp"
>
&lt;h2 id="documentation-for-better-tutorials">Documentation for better tutorials&lt;/h2>
&lt;hr>
&lt;section data-noprocess data-shortcode-slide
data-background-image="retroharmonize_r_testthat.webp"
>
&lt;h2 id="debugging-and-testing-code">Debugging and testing code&lt;/h2>
&lt;hr>
&lt;section data-noprocess data-shortcode-slide
data-background-image="retroharmonize_r_documentation.webp"
>
&lt;h2 id="contribute-to-documentation">Contribute to documentation&lt;/h2>
&lt;hr>
&lt;h2 id="r-is-a-functional-language">R is a functional language&lt;/h2>
&lt;ul>
&lt;li>R is both a statistical environment and a programming language&lt;/li>
&lt;li>R, the open source and further developed version of the S language, is mainly functional&lt;/li>
&lt;li>If you did a task at least twice, the 3rd time you better write a function script to keep doing it forever.&lt;/li>
&lt;li>Most of your effort will be to find a well-written function for your work&lt;/li>
&lt;li>If you cannot find a function, you will modify somebody else&amp;rsquo;s function, or eventually write your own&lt;/li>
&lt;/ul>
&lt;hr>
&lt;section data-noprocess data-shortcode-slide
data-background-image="retroharmonize_r_code.webp"
>
&lt;hr>
&lt;section data-noprocess data-shortcode-slide
data-background-image="rmd_example.webp"
>
&lt;h2 id="r--yaml--markdown--web-ready">R + YAML + markdown = web ready&lt;/h2>
&lt;hr>
&lt;ul>
&lt;li>&lt;a href="https://learnxinyminutes.com/docs/yaml/" target="_blank" rel="noopener">Learn YAML in Y minutes&lt;/a>: tell the computer what you want to do with a document&lt;/li>
&lt;li>&lt;a href="https://rmarkdown.rstudio.com/authoring_basics.html" target="_blank" rel="noopener">R Markdown basics&lt;/a>: it is just a plain markdown that allows you to insert little R program &amp;lsquo;chunks&amp;rsquo;.&lt;/li>
&lt;li>&lt;a href="https://github.com/mundimark/awesome-markdown-editors" target="_blank" rel="noopener">Awesome markdown editors and pre-writers&lt;/a>: find a convenient tool&lt;/li>
&lt;li>&lt;a href="https://workspace.google.com/marketplace/app/docs_to_markdown/700168918607" target="_blank" rel="noopener">Google Docs to markdown&lt;/a>: practice by translating your Google Docs text to markdown. It is &lt;em>very&lt;/em> easy.&lt;/li>
&lt;/ul>
&lt;hr>
&lt;section data-noprocess data-shortcode-slide
data-background-image="retroharmonize_website.webp"
>
&lt;h2 id="package-and-release-a-team-effort">Package and release: a team effort&lt;/h2>
&lt;hr>
&lt;h2 id="our-open-source-development-projects">Our open source development projects&lt;/h2>
&lt;p>🔢 &lt;a href="https://dataset.dataobservatory.eu/" target="_blank" rel="noopener">dataset&lt;/a>: Synchronize datasets with global knowledge hubs #️⃣ &lt;a href="https://statcodelists.dataobservatory.eu/" target="_blank" rel="noopener">statcodelists&lt;/a>: Make your data codes understood globally ♻️ &lt;a href="https://iotables.dataobservatory.eu/" target="_blank" rel="noopener">iotables&lt;/a>: Create economic or environmental impact assessments in any EU country 🌍 &lt;a href="https://regions.dataobservatory.eu/" target="_blank" rel="noopener">regions&lt;/a>: Create from raw survey data more granular statistics in any EU country ✅ &lt;a href="https://retroharmonize.dataobservatory.eu/" target="_blank" rel="noopener">retroharmonize&lt;/a>: Harmonize questions banks, recycle answers from past surveys ⏭️ &lt;a href="https://reprex.nl/#releases" target="_blank" rel="noopener">all in on one page&lt;/a>&lt;/p>
&lt;hr>
&lt;section data-noprocess data-shortcode-slide
data-background-image="create_with_reprex.webp"
>
&lt;h2 id="create-with-us">Create with us&lt;/h2>
&lt;hr>
&lt;h1 id="questions">Questions?&lt;/h1>
&lt;p>&lt;a href="https://reprex.nl/#contact" target="_blank" rel="noopener">Email&lt;/a> | &lt;a href="https://keybase.io/team/reprexcommunity" target="_blank" rel="noopener">Keybase&lt;/a>&lt;/p>
&lt;p>LinkedIn: &lt;a href="https://www.linkedin.com/in/antaldaniel/" target="_blank" rel="noopener">Daniel Antal&lt;/a> - &lt;a href="https://www.linkedin.com/company/68855596" target="_blank" rel="noopener">Reprex&lt;/a> | &lt;a href="https://reprex.nl/" target="_blank" rel="noopener">Home&lt;/a>&lt;/p></description></item><item><title>stacodelists: use standard, language-independent variable codes to help international data interoperability and machine reuse in R</title><link>https://reprex-next.netlify.app/post/2022-06-29-statcodelists/</link><pubDate>Wed, 29 Jun 2022 08:12:00 +0100</pubDate><guid>https://reprex-next.netlify.app/post/2022-06-29-statcodelists/</guid><description>&lt;td style="text-align: center;">
&lt;figure id="figure-visit-the-documentation-website-of-statcodelists-on-statcodelistsdataobservatoryeuhttpsstatcodelistsdataobservatoryeu">
&lt;div class="d-flex justify-content-center">
&lt;div class="w-100" >&lt;img alt="Visit the documentation website of statcodelists on [statcodelists.dataobservatory.eu/](https://statcodelists.dataobservatory.eu/)." srcset="
/media/img/blogposts_2022/statcodelists_website_huef7e1379be389a62e3a47c5a8502e55c_102481_0b514d80337ede30bff4c26cee6a6f11.webp 400w,
/media/img/blogposts_2022/statcodelists_website_huef7e1379be389a62e3a47c5a8502e55c_102481_1416f7a0950b1cecac8097850d995432.webp 760w,
/media/img/blogposts_2022/statcodelists_website_huef7e1379be389a62e3a47c5a8502e55c_102481_1200x1200_fit_q75_h2_lanczos_3.webp 1200w"
src="https://reprex-next.netlify.app/media/img/blogposts_2022/statcodelists_website_huef7e1379be389a62e3a47c5a8502e55c_102481_0b514d80337ede30bff4c26cee6a6f11.webp"
width="760"
height="428"
loading="lazy" data-zoomable />&lt;/div>
&lt;/div>&lt;figcaption>
Visit the documentation website of statcodelists on &lt;a href="https://statcodelists.dataobservatory.eu/" target="_blank" rel="noopener">statcodelists.dataobservatory.eu/&lt;/a>.
&lt;/figcaption>&lt;/figure>&lt;/td>
&lt;!-- badges: start -->
&lt;p>&lt;a href="https://dataobservatory.eu/" target="_blank" rel="noopener">
&lt;figure >
&lt;div class="d-flex justify-content-center">
&lt;div class="w-100" >&lt;img src="https://img.shields.io/badge/ecosystem-dataobservatory.eu-3EA135.svg" alt="dataobservatory" loading="lazy" data-zoomable />&lt;/div>
&lt;/div>&lt;/figure>
&lt;/a>
&lt;a href="https://doi.org/10.5281/zenodo.6751783" target="_blank" rel="noopener">
&lt;figure >
&lt;div class="d-flex justify-content-center">
&lt;div class="w-100" >&lt;img src="https://zenodo.org/badge/DOI/10.5281/zenodo.6751783.svg" alt="DOI" loading="lazy" data-zoomable />&lt;/div>
&lt;/div>&lt;/figure>
&lt;/a>&lt;/p>
&lt;!-- badges: end -->
&lt;p>The goal of &lt;code>statcodelists&lt;/code> is to promote the reuse and exchange of statistical information and related metadata with making the internationally standardized SDMX code lists available for the R user. SDMX – the &lt;a href="https://sdmx.org/" target="_blank" rel="noopener">Statistical Data and Metadata eXchange&lt;/a> has been published as an ISO International Standard (ISO 17369). The metadata definitions, including the codelists are updated regularly according to the standard. The authoritative version of the code lists made available in this package is &lt;a href="https://sdmx.org/?page_id=3215/" target="_blank" rel="noopener">https://sdmx.org/?page_id=3215/&lt;/a>.&lt;/p>
&lt;h2 id="purpose">Purpose&lt;/h2>
&lt;p>Cross-domain concepts in the SDMX framework describe concepts relevant to many, if not all, statistical domains. SDMX recommends using these concepts whenever feasible in SDMX structures and messages to promote the reuse and exchange of statistical information and related metadata between organisations.&lt;/p>
&lt;p>Code lists are predefined sets of terms from which some statistical coded concepts take their values. SDMX cross-domain code lists are used to support cross-domain concepts. What are these cross-domain coded concepts?&lt;/p>
&lt;ul>
&lt;li>Geographical codes, like &lt;code>NL&lt;/code>: the Netherlands in the &lt;a href="https://statcodelists.dataobservatory.eu/reference/CL_AREA.html" target="_blank" rel="noopener">CL_AREA&lt;/a> code list.&lt;/li>
&lt;li>Standard industry codes &lt;code>J631&lt;/code> for Data processing, hosting and related activities in Europe. (&lt;a href="https://statcodelists.dataobservatory.eu/reference/CL_ACTIVITY_NACE2.html" target="_blank" rel="noopener">NACE Rev 2&lt;/a> in Europe, beware, it is &lt;code>J592&lt;/code>in Australia and New Zealand, see &lt;a href="https://statcodelists.dataobservatory.eu/reference/CL_ACTIVITY_ANZSIC06.html" target="_blank" rel="noopener">CL_ACTIVITY_ANZSIC06&lt;/a>.)&lt;/li>
&lt;li>Occupations, like &lt;code>OC2521&lt;/code> for &lt;code>Database designers and administrators&lt;/code> in &lt;a href="https://statcodelists.dataobservatory.eu/reference/CL_OCCUPATION.html" target="_blank" rel="noopener">CL_OCCUPATIONS&lt;/a>&lt;/li>
&lt;li>Time fomatting standards, like &lt;code>CCYY&lt;/code> for annual data series in &lt;a href="https://statcodelists.dataobservatory.eu/reference/CL_TIME_FORMAT.html" target="_blank" rel="noopener">CL_TIME_FORMAT&lt;/a>.&lt;/li>
&lt;/ul>
&lt;p>Check out the available codlists on the &lt;a href="https://statcodelists.dataobservatory.eu/reference/index.html" target="_blank" rel="noopener">package homepage&lt;/a>.&lt;/p>
&lt;p>The use of common code lists will help users to work even more efficiently, easing the maintenance of and reducing the need for mapping systems and interfaces delivering data and metadata to them. A very obvious advantage of using the code systems is that you can retrieve data from national sources indifferent of the natural language used in North Macedonia, Japan, the U.S. or the Netherlands. While the data labels may change to be locally human-readable, computers and geeks can read the codes and understand them immediately. Provided that they use the standard codes.&lt;/p>
&lt;p>Our data observatories are rolling out SDMX coding across all datasets to help data ingestion and interoperability, data findability and data reuse. &lt;code>statcodelists&lt;/code> can help the use of standard SDMX codes in your R workflow&amp;ndash;both for downloading data from statistical agencies and to produce publication-ready datasets that the rest of the world (and even APIs) will understand.&lt;/p>
&lt;h2 id="installation">Installation&lt;/h2>
&lt;p>You can install &lt;code>statcodelists&lt;/code> from CRAN:&lt;/p>
&lt;div class="highlight">&lt;pre tabindex="0" class="chroma">&lt;code class="language-fallback" data-lang="fallback">&lt;span class="line">&lt;span class="cl">install.packages(&amp;#34;statcodelists&amp;#34;)
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>Further recommended code values for expressing general statistical concepts like &lt;code>not applicable&lt;/code>, etc., can be found in section &lt;code>Generic codes&lt;/code> of the &lt;a href="https://sdmx.org/?page_id=4345" target="_blank" rel="noopener">Guidelines for the creation and management of SDMX Cross-Domain Code Lists&lt;/a>.&lt;/p>
&lt;p>For further codelists used by reliable statistical agency but not harmonized on SDMX level please consult the &lt;a href="https://registry.sdmx.org/" target="_blank" rel="noopener">SDMX Global Registry&lt;/a> &lt;a href="https://registry.sdmx.org/items/codelist.html" target="_blank" rel="noopener">Codelists&lt;/a> page.&lt;/p>
&lt;p>The creator of this package is not affiliated with SDMX, and this package was has not been endorsed by SDMX.&lt;/p>
&lt;h2 id="code-of-conduct">Code of Conduct&lt;/h2>
&lt;p>Please note that the &lt;code>statcodelists&lt;/code> project is released with a &lt;a href="https://contributor-covenant.org/version/2/1/CODE_OF_CONDUCT.html" target="_blank" rel="noopener">Contributor Code of Conduct&lt;/a>. By contributing to this project, you agree to abide by its terms.&lt;/p></description></item><item><title>Including Indicators from Arab Barometer in Our Observatory</title><link>https://reprex-next.netlify.app/post/2021-06-28-arabbarometer/</link><pubDate>Mon, 28 Jun 2021 09:00:00 +0000</pubDate><guid>https://reprex-next.netlify.app/post/2021-06-28-arabbarometer/</guid><description>&lt;p>&lt;em>A new version of the retroharmonize R package – which is working with retrospective, ex post harmonization of survey data – was released yesterday after peer-review on CRAN. It allows us to compare opinion polling data from the Arab Barometer with the Eurobarometer and Afrorbarometer. This is the first version that is released in the rOpenGov community, a community of R package developers on open government data analytics and related topics.&lt;/em>&lt;/p>
&lt;p>Surveys are the most important data sources in social and economic
statistics – they ask people about their lives, their attitudes and
self-reported actions, or record data from companies and NGOs. Survey
harmonization makes survey data comparable across time and countries. It
is very important, because often we do not know without comparison if an
indicator value is &lt;em>low&lt;/em> or &lt;em>high&lt;/em>. If 40% of the people think that
&lt;em>climate change is a very serious problem&lt;/em>, it does not really tell us
much without knowing what percentage of the people answered this
question similarly a year ago, or in other parts of the world.&lt;/p>
&lt;p>With the help of Ahmed Shabani and Yousef Ibrahim, we created a third
case study after the
&lt;a href="https://retroharmonize.dataobservatory.eu/articles/eurobarometer.html" target="_blank" rel="noopener">Eurobarometer&lt;/a>,
and
&lt;a href="https://retroharmonize.dataobservatory.eu/articles/afrobarometer.html" target="_blank" rel="noopener">Afrobarometer&lt;/a>,
about working with the &lt;a href="https://retroharmonize.dataobservatory.eu/articles/arabbarometer.html" target="_blank" rel="noopener">Arab
Barometer&lt;/a>
harmonized survey data files.&lt;/p>
&lt;p>&lt;em>Ex ante&lt;/em> survey harmonization means that researchers design
questionnaires that are asking the same questions with the same survey
methodology in repeated, distinct times (waves), or across different
countries with carefully harmonized question translations. &lt;em>Ex post&lt;/em>
harmonizations means that the resulting data has the same variable
names, same variable coding, and can be joined into a tidy data frame
for joint statistical analysis. While seemingly a simple task, it
involves plenty of metadata adjustments, because established survey
programs like Eurobarometer, Afrobarometer or Arab Barometer have
several decades of history, and several decades of coding practices and
file formatting legacy.&lt;/p>
&lt;ul>
&lt;li>&lt;em>Variable harmonization&lt;/em> means that if the same question is called
in one microdata source &lt;code>Q108&lt;/code> and the other &lt;code>eval-parl-elections&lt;/code>
then we make sure that they get a harmonize and machine readable
name without spaces and special characters.&lt;/li>
&lt;li>&lt;em>Variable label harmonization&lt;/em> means that the same questionnaire
items get the same numeric coding and same categorical labels.&lt;/li>
&lt;li>&lt;em>Missing case harmonization&lt;/em> means that various forms of missingness
are treated the same way.&lt;/li>
&lt;/ul>
&lt;figure id="figure-for-the-evaluation-of-the-economic-situation-dataset-get-the-country-averages-and-aggregates-from-zenodohttpdoiorg105281zenodo5036432-and-the-plot-in-jpg-or-png-from-figsharehttpsfigsharecomarticlesfigurearab_barometer_5_econ_eval_by_country_png14865498">
&lt;div class="d-flex justify-content-center">
&lt;div class="w-100" >&lt;img src="https://reprex-next.netlify.app/img/blogposts_2021/arab_barometer_5_evon_eval_by_country.png" alt="For the evaluation of the economic situation dataset, get the country averages and aggregates from [Zenodo](http://doi.org/10.5281/zenodo.5036432), and the plot in `jpg` or `png` from [figshare](https://figshare.com/articles/figure/arab_barometer_5_econ_eval_by_country_png/14865498)." loading="lazy" data-zoomable />&lt;/div>
&lt;/div>&lt;figcaption data-pre="Figure&amp;nbsp;" data-post=":&amp;nbsp;" class="numbered">
For the evaluation of the economic situation dataset, get the country averages and aggregates from &lt;a href="http://doi.org/10.5281/zenodo.5036432" target="_blank" rel="noopener">Zenodo&lt;/a>, and the plot in &lt;code>jpg&lt;/code> or &lt;code>png&lt;/code> from &lt;a href="https://figshare.com/articles/figure/arab_barometer_5_econ_eval_by_country_png/14865498" target="_blank" rel="noopener">figshare&lt;/a>.
&lt;/figcaption>&lt;/figure>
&lt;p>In our new &lt;a href="https://retroharmonize.dataobservatory.eu/articles/arabbarometer.html" target="_blank" rel="noopener">Arab Barometer case
study&lt;/a>,
the evaulation of parliamentary elections has the following labels. We
code them consistently &lt;code>1: free_and_fair&lt;/code>, &lt;code>2: some_minor_problems&lt;/code>,
&lt;code>3: some_major_problems&lt;/code> and &lt;code>4: not_free&lt;/code>.&lt;/p>
&lt;table>
&lt;colgroup>
&lt;col style="width: 50%" />
&lt;col style="width: 50%" />
&lt;/colgroup>
&lt;tbody>
&lt;tr class="odd">
&lt;td style="text-align: left;">“0. missing”&lt;/td>
&lt;td style="text-align: left;">“1. they were completely free and fair”&lt;/td>
&lt;/tr>
&lt;tr class="even">
&lt;td style="text-align: left;">“2. they were free and fair, with some minor problems”&lt;/td>
&lt;td style="text-align: left;">“3. they were free and fair, with some major problems”&lt;/td>
&lt;/tr>
&lt;tr class="odd">
&lt;td style="text-align: left;">“4. they were not free and fair”&lt;/td>
&lt;td style="text-align: left;">“8. i don’t know”&lt;/td>
&lt;/tr>
&lt;tr class="even">
&lt;td style="text-align: left;">“9. declined to answer”&lt;/td>
&lt;td style="text-align: left;">“Missing”&lt;/td>
&lt;/tr>
&lt;tr class="odd">
&lt;td style="text-align: left;">“They were completely free and fair”&lt;/td>
&lt;td style="text-align: left;">“They were free and fair, with some minor breaches”&lt;/td>
&lt;/tr>
&lt;tr class="even">
&lt;td style="text-align: left;">“They were free and fair, with some major breaches”&lt;/td>
&lt;td style="text-align: left;">“They were not free and fair”&lt;/td>
&lt;/tr>
&lt;tr class="odd">
&lt;td style="text-align: left;">“Don’t know”&lt;/td>
&lt;td style="text-align: left;">“Refuse”&lt;/td>
&lt;/tr>
&lt;tr class="even">
&lt;td style="text-align: left;">“Completely free and fair”&lt;/td>
&lt;td style="text-align: left;">“Free and fair, but with minor problems”&lt;/td>
&lt;/tr>
&lt;tr class="odd">
&lt;td style="text-align: left;">“Free and fair, with major problems”&lt;/td>
&lt;td style="text-align: left;">“Not free or fair”&lt;/td>
&lt;/tr>
&lt;tr class="even">
&lt;td style="text-align: left;">“Don’t know (Do not read)”&lt;/td>
&lt;td style="text-align: left;">“Decline to answer (Do not read)”&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;p>Of course, this harmonization is essential to get clean results like this:&lt;/p>
&lt;figure id="figure-for-evaluation-or-reuse-of-parliamentary-elections-dataset-get-the-replication-data-and-the-code-from-the-zenodohhttpsdoiorg105281zenodo5034759-open-repository">
&lt;div class="d-flex justify-content-center">
&lt;div class="w-100" >&lt;img alt="For evaluation or reuse of parliamentary elections dataset get the replication data and the code from the [Zenodo](hhttps://doi.org/10.5281/zenodo.5034759) open repository." srcset="
/media/img/blogposts_2021/arabb-comparison-country-chart_hu876e56138097bf35e9ab80c0a7351314_159521_30b9d9bccbe8f347c912dbe10ef5159c.webp 400w,
/media/img/blogposts_2021/arabb-comparison-country-chart_hu876e56138097bf35e9ab80c0a7351314_159521_f7e62366b8310160e9cdd16714a5ac44.webp 760w,
/media/img/blogposts_2021/arabb-comparison-country-chart_hu876e56138097bf35e9ab80c0a7351314_159521_1200x1200_fit_q75_h2_lanczos_3.webp 1200w"
src="https://reprex-next.netlify.app/media/img/blogposts_2021/arabb-comparison-country-chart_hu876e56138097bf35e9ab80c0a7351314_159521_30b9d9bccbe8f347c912dbe10ef5159c.webp"
width="506"
height="760"
loading="lazy" data-zoomable />&lt;/div>
&lt;/div>&lt;figcaption data-pre="Figure&amp;nbsp;" data-post=":&amp;nbsp;" class="numbered">
For evaluation or reuse of parliamentary elections dataset get the replication data and the code from the &lt;a href="hhttps://doi.org/10.5281/zenodo.5034759">Zenodo&lt;/a> open repository.
&lt;/figcaption>&lt;/figure>
&lt;p>In our case study, we had three forms of missingness: the respondent
&lt;em>did not know&lt;/em> the answer, the respondent &lt;em>did not want&lt;/em> to answer, and
at last, in some cases the &lt;em>respondent was not asked&lt;/em>, because the
country held no parliamentary elections. While in numerical processing,
all these answers must be left out from calculating averages, for
example, in a more detailed, categorical analysis they represent very
different cases. A high level of refusal to answer may be an indicator
of surpressing democratic opinion forming in itself.&lt;/p>
&lt;p>Survey harmonization with many countries entails tens of thousands of
small data management task, which, unless automatically documented,
logged, and created with a reproducible code, is a helplessly
error-prone process. We believe that our open-source software will bring
many new statistical information to the light, which, while legally
open, was never processed due to the large investment needed.&lt;/p>
&lt;p>We also started building experimental APIs data is running
&lt;a href="https://retroharmonize.dataobservatory.eu/" target="_blank" rel="noopener">retroharmonize&lt;/a> regularly.
We will place cultural access and participation data in the &lt;a href="https://music.dataobservatory.eu/" target="_blank" rel="noopener">Digital
Music Observatory&lt;/a>, climate
awareness, policy support and self-reported mitigation strategies into
the &lt;a href="https://greendeal.dataobservatory.eu/" target="_blank" rel="noopener">Green Deal Data
Observatory&lt;/a>, and economy and
well-being data into our &lt;a href="https://economy.dataobservatory.eu/" target="_blank" rel="noopener">Economy Data
Observatory&lt;/a>.&lt;/p>
&lt;h2 id="further-plans">Further plans&lt;/h2>
&lt;p>Retrospective survey harmonization is a far more complex task than this
blogpost suggest. Retrospective survey harmonization is a far more complex task than this blogpost suggest, because established survey programs have gathered decades of legacy data in legacy coding schemes and legacy file formats. Putting the data right, and especially putting the invaluable descriptive and administrative (processing) metadata right is a huge undertaking. We are releasing example codes, datasets and charts for researchers to comapre our harmonized results with theirs, and improve our software. We are releasing example codes, datasets and charts
for researchers to comapre our harmonized results with theirs, and
improve our software.&lt;/p>
&lt;h3 id="use-our-software">Use our software&lt;/h3>
&lt;p>The &lt;code>retroharmonize&lt;/code> R package can be freely used, modified and
distributed under the GPL-3 license. For the main developer and
contributors, see the
&lt;a href="https://retroharmonize.dataobservatory.eu/" target="_blank" rel="noopener">package&lt;/a> homepage. If you
use it for your work, please kindly cite it as:&lt;/p>
&lt;p>Daniel Antal (2021). retroharmonize: Ex Post Survey Data Harmonization.
R package version 0.1.17. &lt;a href="https://doi.org/10.5281/zenodo.5034752" target="_blank" rel="noopener">https://doi.org/10.5281/zenodo.5034752&lt;/a>&lt;/p>
&lt;p>Download the &lt;a href="https://reprex-next.netlify.app/media/bibliography/cite-retroharmonize.bib" target="_blank">BibLaTeX entry&lt;/a>.&lt;/p>
&lt;h3 id="tutorial-to-work-with-the-arab-barometer-survey-data">Tutorial to work with the Arab Barometer survey data&lt;/h3>
&lt;p>Daniel Antal, &amp;amp; Ahmed Shaibani. (2021, June 26). Case Study: Working
With Arab Barometer Surveys for the retroharmonize R package (Version
0.1.6). Zenodo. &lt;a href="https://doi.org/10.5281/zenodo.5034759" target="_blank" rel="noopener">https://doi.org/10.5281/zenodo.5034759&lt;/a>&lt;/p>
&lt;p>For the replication data to report potential
&lt;a href="https://github.com/rOpenGov/retroharmonize/issues" target="_blank" rel="noopener">issues&lt;/a> and
improvement suggestions with the code:&lt;/p>
&lt;p>Daniel Antal, &amp;amp; Ahmed Shaibani. (2021). Replication Data for the
retroharmonize R Package Case Study: Working With Arab Barometer Surveys
(Version 0.1.6) [Data set]. Zenodo.
&lt;a href="https://doi.org/10.5281/zenodo.5034741" target="_blank" rel="noopener">https://doi.org/10.5281/zenodo.5034741&lt;/a>&lt;/p>
&lt;h3 id="experimental-api">Experimental API&lt;/h3>
&lt;p>We are also experimenting with the automated placement of authoritative
and citeable figures and datasets in open repositories. For the climate
awareness dataset get the country averages and aggregates from
&lt;a href="http://doi.org/10.5281/zenodo.5036432" target="_blank" rel="noopener">Zenodo&lt;/a>, and the plot in &lt;code>jpg&lt;/code>
or &lt;code>png&lt;/code> from &lt;a href="https://figshare.com/articles/figure/arab_barometer_5_econ_eval_by_country_png/14865498" target="_blank" rel="noopener">figshare&lt;/a>.
Our plan is to release open data in a modern API with rich descriptive
metadata meeting the &lt;em>Dublin Core&lt;/em> and &lt;em>DataCite&lt;/em> standards, and further
administrative metadata for correct coding, joining and further
manipulating or data, or for easy import into your database.&lt;/p>
&lt;h3 id="join-our-open-source-effort">Join our open source effort&lt;/h3>
&lt;p>Want to help us improve our open data service? Include
&lt;a href="https://www.latinobarometro.org/lat.jsp" target="_blank" rel="noopener">Lationbarómetro&lt;/a> and the
&lt;a href="https://caucasusbarometer.org/en/datasets/" target="_blank" rel="noopener">Caucasus Barometer&lt;/a> in our
offering? Join the rOpenGov community of R package developers, an our
open collaboration to create the automated data observatories. We are
not only looking for
&lt;a href="https://reprex-next.netlify.app/authors/developer/">developers&lt;/a>,
but &lt;a href="https://reprex-next.netlify.app/authors/curator/">data
curators&lt;/a> and
&lt;a href="https://reprex-next.netlify.app/authors/team/">service design
associates&lt;/a>, too.&lt;/p></description></item><item><title>Open Data - The New Gold Without the Rush</title><link>https://reprex-next.netlify.app/post/2021-06-18-gold-without-rush/</link><pubDate>Fri, 18 Jun 2021 17:00:00 +0000</pubDate><guid>https://reprex-next.netlify.app/post/2021-06-18-gold-without-rush/</guid><description>&lt;p>&lt;em>If open data is the new gold, why even those who release fail to reuse it? We created an open collaboration of data curators and open-source developers to dig into novel open data sources and/or increase the usability of existing ones. We transform reproducible research software into research- as-service.&lt;/em>&lt;/p>
&lt;p>Every year, the EU announces that billions and billions of data are now “open” again, but this is not gold. At least not in the form of nicely minted gold coins, but in gold dust and nuggets found in the muddy banks of chilly rivers. There is no rush for it, because panning out its value requires a lot of hours of hard work. Our goal is to automate this work to make open data usable at scale, even in trustworthy AI solutions.&lt;/p>
&lt;figure id="figure-there-is-no-rush-for-it-because-panning-out-its-value-requires-a-lot-of-hours-of-hard-work-our-goal-is-to-automate-this-work-to-make-open-data-usable-at-scale-even-in-trustworthy-ai-solutions">
&lt;div class="d-flex justify-content-center">
&lt;div class="w-100" >&lt;img alt="There is no rush for it, because panning out its value requires a lot of hours of hard work. Our goal is to automate this work to make open data usable at scale, even in trustworthy AI solutions." srcset="
/media/img/slides/gold_panning_slide_notitle_hu8f7296f20da8c17f972a0534c44322c2_1382486_b042523dffe8143dea3d8c8c9c3262f4.webp 400w,
/media/img/slides/gold_panning_slide_notitle_hu8f7296f20da8c17f972a0534c44322c2_1382486_faa00e96d3d0b700cfcf1daa513f3ad2.webp 760w,
/media/img/slides/gold_panning_slide_notitle_hu8f7296f20da8c17f972a0534c44322c2_1382486_1200x1200_fit_q75_h2_lanczos_3.webp 1200w"
src="https://reprex-next.netlify.app/media/img/slides/gold_panning_slide_notitle_hu8f7296f20da8c17f972a0534c44322c2_1382486_b042523dffe8143dea3d8c8c9c3262f4.webp"
width="760"
height="428"
loading="lazy" data-zoomable />&lt;/div>
&lt;/div>&lt;figcaption data-pre="Figure&amp;nbsp;" data-post=":&amp;nbsp;" class="numbered">
There is no rush for it, because panning out its value requires a lot of hours of hard work. Our goal is to automate this work to make open data usable at scale, even in trustworthy AI solutions.
&lt;/figcaption>&lt;/figure>
&lt;p>Most open data is not public, it is not downloadable from the Internet – in the EU parlance, “open” only means a legal entitlement to get access to it. And even in the rare cases when data is open and public, often it is mired by data quality issues. We are working on the prototypes of a data-as-service and research-as-service built with open-source statistical software that taps into various and often neglected open data sources.&lt;/p>
&lt;p>We are in the prototype phase in June and our intentions are to have a well-functioning service by the time of the conference, because we are working only with open-source software elements; our technological readiness level is already very high. The novelty of our process is that we are trying to further develop and integrate a few open-source technology items into technologically and financially sustainable data-as-service and even research-as-service solutions.&lt;/p>
&lt;figure id="figure-our-review-of-about-80-eu-un-and-oecd-data-observatories-reveals-that-most-of-them-do-not-use-these-organizationss-open-data---instead-they-use-various-and-often-not-well-processed-proprietary-sources">
&lt;div class="d-flex justify-content-center">
&lt;div class="w-100" >&lt;img alt="Our review of about 80 EU, UN and OECD data observatories reveals that most of them do not use these organizations&amp;#39;s open data - instead they use various, and often not well processed proprietary sources." srcset="
/media/img/observatory_screenshots/observatory_collage_16x9_800_hu47f74f5cdae63c7248c2367b9d148671_353025_0079ea9844f6c5e52b52fd0e627467a2.webp 400w,
/media/img/observatory_screenshots/observatory_collage_16x9_800_hu47f74f5cdae63c7248c2367b9d148671_353025_ecd6d08ba5e9bac19c8173546f036651.webp 760w,
/media/img/observatory_screenshots/observatory_collage_16x9_800_hu47f74f5cdae63c7248c2367b9d148671_353025_1200x1200_fit_q75_h2_lanczos_3.webp 1200w"
src="https://reprex-next.netlify.app/media/img/observatory_screenshots/observatory_collage_16x9_800_hu47f74f5cdae63c7248c2367b9d148671_353025_0079ea9844f6c5e52b52fd0e627467a2.webp"
width="760"
height="428"
loading="lazy" data-zoomable />&lt;/div>
&lt;/div>&lt;figcaption data-pre="Figure&amp;nbsp;" data-post=":&amp;nbsp;" class="numbered">
Our review of about 80 EU, UN and OECD data observatories reveals that most of them do not use these organizations&amp;rsquo;s open data - instead they use various, and often not well processed proprietary sources.
&lt;/figcaption>&lt;/figure>
&lt;p>We are taking a new and modern approach to the &lt;code>data observatory&lt;/code> concept, and modernizing it with the application of 21st century data and metadata standards, the new results of reproducible research and data science. Various UN and OECD bodies, and particularly the European Union support or maintain more than 60 data observatories, or permanent data collection and dissemination points, but even these do not use these organizations and their members open data. We are building open-source data observatories, which run open-source statistical software that automatically processes and documents reusable public sector data (from public transport, meteorology, tax offices, taxpayer funded satellite systems, etc.) and reusable scientific data (from EU taxpayer funded research) into new, high quality statistical indicators.&lt;/p>
&lt;figure id="figure-we-are-taking-a-new-and-modern-approach-to-the-data-observatory-concept-and-modernizing-it-with-the-application-of-21st-century-data-and-metadata-standards-the-new-results-of-reproducible-research-and-data-science">
&lt;div class="d-flex justify-content-center">
&lt;div class="w-100" >&lt;img src="https://reprex-next.netlify.app/img/slides/automated_observatory_value_chain.jpg" alt="We are taking a new and modern approach to the ‘data observatory’ concept, and modernizing it with the application of 21st century data and metadata standards, the new results of reproducible research and data science" loading="lazy" data-zoomable />&lt;/div>
&lt;/div>&lt;figcaption data-pre="Figure&amp;nbsp;" data-post=":&amp;nbsp;" class="numbered">
We are taking a new and modern approach to the ‘data observatory’ concept, and modernizing it with the application of 21st century data and metadata standards, the new results of reproducible research and data science
&lt;/figcaption>&lt;/figure>
&lt;ul>
&lt;li>We are building various open-source data collection tools in R and Python to bring up data from big data APIs and legally open, but not public, and not well served data sources. For example, we are working on capturing representative data from the Spotify API or creating harmonized datasets from the Eurobarometer and Afrobarometer survey programs.&lt;/li>
&lt;li>Open data is usually not public; whatever is legally accessible is usually not ready to use for commercial or scientific purposes. In Europe, almost all taxpayer funded data is legally open for reuse, but it is usually stored in heterogeneous formats, processed into an original government or scientific need, and with various and low documentation standards. Our expert data curators are looking for new data sources that should be (re-) processed and re-documented to be usable for a wider community. We would like to introduce our service flow, which touches upon many important aspects of data scientist, data engineer and data curatorial work.&lt;/li>
&lt;li>We believe that even such generally trusted data sources as Eurostat often need to be reprocessed, because various legal and political constraints do not allow the common European statistical services to provide optimal quality data – for example, on the regional and city levels.&lt;/li>
&lt;li>With &lt;a href="https://reprex-next.netlify.app/authors/ropengov/">rOpenGov&lt;/a> and other partners, we are creating open-source statistical software in R to re-process these heterogenous and low-quality data into tidy statistical indicators to automatically validate and document it.&lt;/li>
&lt;li>We are carefully documenting and releasing administrative, processing, and descriptive metadata, following international metadata standards, to make our data easy to find and easy to use for data analysts.&lt;/li>
&lt;li>We are automatically creating depositions and authoritative copies marked with an individual digital object identifier (DOI) to maintain data integrity.&lt;/li>
&lt;li>We are building simple databases and supporting APIs that release the data without restrictions, in a tidy format that is easy to join with other data, or easy to join into databases, together with standardized metadata.&lt;/li>
&lt;li>We maintain observatory websites (see: &lt;a href="https://music.dataobservatory.eu/" target="_blank" rel="noopener">Digital Music Observatory&lt;/a>, &lt;a href="https://greendeal.dataobservatory.eu/" target="_blank" rel="noopener">Green Deal Data Observatory&lt;/a>, &lt;a href="https://economy.dataobservatory.eu/" target="_blank" rel="noopener">Economy Data Observatory&lt;/a>) where not only the data is available, but we provide tutorials and use cases to make it easier to use them. Our mission is to show a modern, 21st century reimagination of the data observatory concept developed and supported by the UN, EU and OECD, and we want to show that modern reproducible research and open data could make the existing 60 data observatories and the planned new ones grow faster into data ecosystems.&lt;/li>
&lt;/ul>
&lt;p>We are working around the open collaboration concept, which is well-known in open source software development and reproducible science, but we try to make this agile project management methodology more inclusive, and include data curators, and various institutional partners into this approach. Based around our early-stage startup, Reprex, and the open-source developer community rOpenGov, we are working together with other developers, data scientists, and domain specific data experts in climate change and mitigation, antitrust and innovation policies, and various aspects of the music and film industry.&lt;/p>
&lt;figure id="figure-our-open-collaboration-is-truly-open-new-data-curatorsauthorscuratordevelopersauthorsdeveloper-and-service-designersauthorsteam-even-volunteers-and-citizen-scientists-are-welcome-to-join">
&lt;div class="d-flex justify-content-center">
&lt;div class="w-100" >&lt;img alt="Our open collaboration is truly open: new [data curators](/authors/curator/),[developers](/authors/developer/) and [service designers](/authors/team/), even volunteers and citizen scientists are welcome to join." srcset="
/media/img/observatory_screenshots/dmo_contributors_hua4f41ef7327b64bb97f169af135070bd_140729_a07a8e618fa7317f6f8256b9a334262e.webp 400w,
/media/img/observatory_screenshots/dmo_contributors_hua4f41ef7327b64bb97f169af135070bd_140729_3a4ae7f72478fd880961b08e1f7075dd.webp 760w,
/media/img/observatory_screenshots/dmo_contributors_hua4f41ef7327b64bb97f169af135070bd_140729_1200x1200_fit_q75_h2_lanczos_3.webp 1200w"
src="https://reprex-next.netlify.app/media/img/observatory_screenshots/dmo_contributors_hua4f41ef7327b64bb97f169af135070bd_140729_a07a8e618fa7317f6f8256b9a334262e.webp"
width="760"
height="427"
loading="lazy" data-zoomable />&lt;/div>
&lt;/div>&lt;figcaption data-pre="Figure&amp;nbsp;" data-post=":&amp;nbsp;" class="numbered">
Our open collaboration is truly open: new &lt;a href="https://reprex-next.netlify.app/authors/curator/">data curators&lt;/a>,&lt;a href="https://reprex-next.netlify.app/authors/developer/">developers&lt;/a> and &lt;a href="https://reprex-next.netlify.app/authors/team/">service designers&lt;/a>, even volunteers and citizen scientists are welcome to join.
&lt;/figcaption>&lt;/figure>
&lt;p>Our open collaboration is truly open: new &lt;a href="https://reprex-next.netlify.app/authors/curator/">data curators&lt;/a>, data scientists and data engineers are welcome to join. We develop open-source software in an agile way, so you can join in with an intermediate programming skill to build unit tests or add new functionality, and if you are a beginner, you can start with documentation and testing our tutorials. For business, policy, and scientific data analysts, we provide unexploited, exciting new datasets. Advanced developers can &lt;a href="https://reprex-next.netlify.app/authors/developer/">join&lt;/a> our development team: the statistical data creation is mainly made in the R language, and the service infrastructure in Python and Go components.&lt;/p></description></item><item><title>There are Numerous Advantages of Switching from a National Level of the Analysis to a Sub National Level</title><link>https://reprex-next.netlify.app/post/2021-06-16-regions-release/</link><pubDate>Wed, 16 Jun 2021 12:00:00 +0000</pubDate><guid>https://reprex-next.netlify.app/post/2021-06-16-regions-release/</guid><description>
&lt;figure >
&lt;div class="d-flex justify-content-center">
&lt;div class="w-100" >&lt;img src="https://reprex-next.netlify.app/img/package_screenshots/regions_017_169.png" alt="" loading="lazy" data-zoomable />&lt;/div>
&lt;/div>&lt;/figure>
&lt;p>The new version of our &lt;a href="https://ropengov.org/" target="_blank" rel="noopener">rOpenGov&lt;/a> R package
&lt;a href="https://regions.dataobservatory.eu/" target="_blank" rel="noopener">regions&lt;/a> was released today on
CRAN. This package is one of the engines of our experimental open
data-as-service &lt;a href="https://greendeal.dataobservatory.eu/" target="_blank" rel="noopener">Green Deal Data
Observatory&lt;/a> , &lt;a href="https://economy.dataobservatory.eu/" target="_blank" rel="noopener">Economy Data
Observatory&lt;/a> , &lt;a href="https://music.dataobservatory.eu/" target="_blank" rel="noopener">Digital Music
Observatory&lt;/a> prototypes, which aim to
place open data packages into open-source applications.&lt;/p>
&lt;p>In international comparison the use of nationally aggregated indicators
often have many disadvantages: they inhibit very different levels of
homogeneity, and data is often very limited in number of observations
for a cross-sectional analysis. When comparing European countries, a few
missing cases can limit the cross-section of countries to around 20
cases which disallows the use of many analytical methods. Working with
sub-national statistics has many advantages: the similarity of the
aggregation level and high number of observations can allow more precise
control of model parameters and errors, and the number of observations
grows from 20 to 200-300.&lt;/p>
&lt;figure id="figure-the-change-from-national-to-sub-national-level-comes-with-a-huge-data-processing-price-internal-administrative-boundaries-their-names-codes-codes-change-very-frequently">
&lt;div class="d-flex justify-content-center">
&lt;div class="w-100" >&lt;img alt="The change from national to sub-national level comes with a huge data processing price: internal administrative boundaries, their names, codes codes change very frequently." srcset="
/media/img/blogposts_2021/indicator_with_map_hue9f606f6489f63a22f67aeb7e2b3402b_98843_df043b13fb62aa7b45aa15fad51f4229.webp 400w,
/media/img/blogposts_2021/indicator_with_map_hue9f606f6489f63a22f67aeb7e2b3402b_98843_09a0d6124e334c5f1727420a059512a9.webp 760w,
/media/img/blogposts_2021/indicator_with_map_hue9f606f6489f63a22f67aeb7e2b3402b_98843_1200x1200_fit_q75_h2_lanczos_3.webp 1200w"
src="https://reprex-next.netlify.app/media/img/blogposts_2021/indicator_with_map_hue9f606f6489f63a22f67aeb7e2b3402b_98843_df043b13fb62aa7b45aa15fad51f4229.webp"
width="760"
height="428"
loading="lazy" data-zoomable />&lt;/div>
&lt;/div>&lt;figcaption data-pre="Figure&amp;nbsp;" data-post=":&amp;nbsp;" class="numbered">
The change from national to sub-national level comes with a huge data processing price: internal administrative boundaries, their names, codes codes change very frequently.
&lt;/figcaption>&lt;/figure>
&lt;p>Yet the change from national to sub-national level comes with a huge
data processing price. While national boundaries are relatively stable,
with only a handful of changes in each recent decade. The change of
national boundaries requires a more-or-less global consensus. But states
are free to change their internal administrative boundaries, and they do
it with large frequency. This means that the names, identification codes
and boundary definitions of sub-national regions change very frequently.
Joining data from different sources and different years can be very
difficult.&lt;/p>
&lt;figure id="figure-our-regions-r-packagehttpsregionsdataobservatoryeu-helps-the-data-processing-validation-and-imputation-of-sub-national-regional-datasets-and-their-coding">
&lt;div class="d-flex justify-content-center">
&lt;div class="w-100" >&lt;img alt="Our [regions R package](https://regions.dataobservatory.eu/) helps the data processing, validation and imputation of sub-national, regional datasets and their coding." srcset="
/media/img/blogposts_2021/recoded_indicator_with_map_hubda8124fbfd6305eacfd3d4f0fcd06cc_71873_65df57cf4311bb2623535a1a5be044c0.webp 400w,
/media/img/blogposts_2021/recoded_indicator_with_map_hubda8124fbfd6305eacfd3d4f0fcd06cc_71873_81a53fd42fac7f0c3fe4e1a89d5b7892.webp 760w,
/media/img/blogposts_2021/recoded_indicator_with_map_hubda8124fbfd6305eacfd3d4f0fcd06cc_71873_1200x1200_fit_q75_h2_lanczos_3.webp 1200w"
src="https://reprex-next.netlify.app/media/img/blogposts_2021/recoded_indicator_with_map_hubda8124fbfd6305eacfd3d4f0fcd06cc_71873_65df57cf4311bb2623535a1a5be044c0.webp"
width="760"
height="428"
loading="lazy" data-zoomable />&lt;/div>
&lt;/div>&lt;figcaption data-pre="Figure&amp;nbsp;" data-post=":&amp;nbsp;" class="numbered">
Our &lt;a href="https://regions.dataobservatory.eu/" target="_blank" rel="noopener">regions R package&lt;/a> helps the data processing, validation and imputation of sub-national, regional datasets and their coding.
&lt;/figcaption>&lt;/figure>
&lt;p>There are numerous advantages of switching from a national level of the
analysis to a sub-national level comes with a huge price in data
processing, validation and imputation, and the
&lt;a href="https://regions.dataobservatory.eu/" target="_blank" rel="noopener">regions&lt;/a> package aims to help this
process.&lt;/p>
&lt;p>You can review the problem, and the code that created the two map
comparisons, in the &lt;a href="https://regions.dataobservatory.eu/articles/maping.html" target="_blank" rel="noopener">Maping Regional Data, Maping Metadata
Problems&lt;/a>
vignette article of the package. A more detailed problem description can
be found in &lt;a href="https://regions.dataobservatory.eu/articles/Regional_stats.html" target="_blank" rel="noopener">Working With Regional, Sub-National Statistical
Products&lt;/a>.&lt;/p>
&lt;p>This package is an offspring of the
&lt;a href="https://ropengov.github.io/eurostat/" target="_blank" rel="noopener">eurostat&lt;/a> package on
&lt;a href="https://ropengov.github.io/" target="_blank" rel="noopener">rOpenGov&lt;/a>. It started as a tool to
validate and re-code regional Eurostat statistics, but it aims to be a
general solution for all sub-national statistics. It will be developed
parallel with other rOpenGov packages.&lt;/p>
&lt;h2 id="get-the-package">Get the Package&lt;/h2>
&lt;p>You can install the development version from
&lt;a href="https://github.com/" target="_blank" rel="noopener">GitHub&lt;/a> with:&lt;/p>
&lt;pre>&lt;code>devtools::install_github(&amp;quot;rOpenGov/regions&amp;quot;)
&lt;/code>&lt;/pre>
&lt;p>or the released version from CRAN:&lt;/p>
&lt;pre>&lt;code>install.packages(&amp;quot;regions&amp;quot;)
&lt;/code>&lt;/pre>
&lt;p>You can review the complete package documentation on
&lt;a href="https://regions.dataobservatory.eu/" target="_blank" rel="noopener">regions.dataobservaotry.eu&lt;/a>. If
you find any problems with the code, please raise an issue on
&lt;a href="https://github.com/rOpenGov/regions" target="_blank" rel="noopener">Github&lt;/a>. Pull requests are welcome
if you agree with the &lt;a href="https://contributor-covenant.org/version/2/0/CODE_OF_CONDUCT.html" target="_blank" rel="noopener">Contributor Code of
Conduct&lt;/a>&lt;/p>
&lt;p>If you use &lt;code>regions&lt;/code> in your work, please cite the
package as:
Daniel Antal, Kasia Kulma, Istvan Zsoldos, &amp;amp; Leo Lahti. (2021, June 16). regions (Version 0.1.7). CRAN. &lt;a href="%28https://doi.org/10.5281/zenodo.4965909%29">http://doi.org/10.5281/zenodo.4965909&lt;/a>&lt;/p>
&lt;p>&lt;a href="https://cran.r-project.org/package=regions" target="_blank" rel="noopener">
&lt;figure >
&lt;div class="d-flex justify-content-center">
&lt;div class="w-100" >&lt;img src="https://www.r-pkg.org/badges/version/regions" alt="CRAN_Status_Badge" loading="lazy" data-zoomable />&lt;/div>
&lt;/div>&lt;/figure>
&lt;/a>&lt;/p>
&lt;h2 id="join-us">Join us&lt;/h2>
&lt;p>&lt;em>Join our open collaboration Economy Data Observatory team as a &lt;a href="https://reprex-next.netlify.app/authors/curator">data curator&lt;/a>, &lt;a href="https://reprex-next.netlify.app/authors/developer">developer&lt;/a> or &lt;a href="https://reprex-next.netlify.app/authors/team">business developer&lt;/a>. More interested in environmental impact analysis? Try our &lt;a href="https://greendeal.dataobservatory.eu/#contributors" target="_blank" rel="noopener">Green Deal Data Observatory&lt;/a> team! Or your interest lies more in data governance, trustworthy AI and other digital market problems? Check out our &lt;a href="https://music.dataobservatory.eu/#contributors" target="_blank" rel="noopener">Digital Music Observatory&lt;/a> team!&lt;/em>&lt;/p>
&lt;p>&lt;a href="https://twitter.com/intent/follow?screen_name=EconDataObs" target="_blank" rel="noopener">
&lt;figure >
&lt;div class="d-flex justify-content-center">
&lt;div class="w-100" >&lt;img src="https://img.shields.io/twitter/follow/EconDataObs.svg?style=social" alt="Follow GreenDealObs" loading="lazy" data-zoomable />&lt;/div>
&lt;/div>&lt;/figure>
&lt;/a>&lt;/p></description></item><item><title>Creating Algorithmic Tools to Interpret and Communicate Open Data Efficiently</title><link>https://reprex-next.netlify.app/post/2021-06-04-developer-leo-lahti/</link><pubDate>Fri, 04 Jun 2021 10:00:00 +0000</pubDate><guid>https://reprex-next.netlify.app/post/2021-06-04-developer-leo-lahti/</guid><description>&lt;p>&lt;strong>As a developer at rOpenGov, what type of data do you usually use in your work?&lt;/strong>&lt;/p>
&lt;p>As an academic data scientist whose research focuses on the development of general-purpose algorithmic methods, I work with a range of applications from life sciences to humanities. Population studies play a big role in our research, and often the information that we can draw from public sources - geospatial, demographic, environmental - provides invaluable support. We typically use open data in combination with sensitive research data but some of the research questions can be readily addressed based on open data from statistical authorities such as Statistics Finland or Eurostat.&lt;/p>
&lt;figure >
&lt;div class="d-flex justify-content-center">
&lt;div class="w-100" >&lt;img src="https://reprex-next.netlify.app/img/partners/rOpenGov-intro.png" alt="" loading="lazy" data-zoomable />&lt;/div>
&lt;/div>&lt;/figure>
&lt;p>&lt;strong>In your ideal data world, what would be the ultimate dataset, or datasets that you would like to see in the Music Data Observatory?&lt;/strong>&lt;/p>
&lt;p>One line of our research analyses the historical trends and spread of knowledge production, in particular book printing based on large-scale metadata collections. It would be interesting to extend this research to music, to understand the contemporary trends as well as the broader historical developments. Gaining access to a large systematic collection of music and composition data from different countries across long periods of time would make this possible.&lt;/p>
&lt;p>&lt;strong>Why did you decide to join the challenge and why do you think that this would be a game changer for researchers and policymakers?&lt;/strong>&lt;/p>
&lt;p>Joining the challenge was a natural development based on our overall activities in this area; &lt;a href="http://ropengov.org/community/" target="_blank" rel="noopener">the rOpenGov project&lt;/a> has been around for a decade now, since the early days of the broader open data movement. This has also created an active international developer network and we felt well equipped for picking up the challenge. The game changer for researchers is that the project highlights the importance of data quality, even when dealing with official statistics, and provides new methods to solve these issues efficiently through the open collaboration model. For policymakers, this provides access to new high-quality curated data and case studies that can support evidence-based decision-making.&lt;/p>
&lt;p>&lt;strong>Do you have a favorite, or most used open governmental or open science data source? What do you think about it? Could it be improved?&lt;/strong>&lt;/p>
&lt;p>Regarding open government data, one of my favorites is not a single data source but a data representation standard. The &lt;a href="https://www.scb.se/en/services/statistical-programs-for-px-files/#:~:text=PX%20is%20a%20standard%20format,and%20data." target="_blank" rel="noopener">px format&lt;/a> is widely used by statistical authorities in various countries, and this has allowed us to create R tools that allow the retrieval and analysis of official statistics from many countries across Europe, spanning dozens of statistical institutions. Standardization of open data formats allows us to build robust algorithmic tools for downstream data analysis and visualization. Open government data is still too often shared in obscure, non-standard or closed-source file formats and this is creating significant bottlenecks for the development of scalable and interoperable AI and machine learning methods that can harness the full potential of open data.&lt;/p>
&lt;figure id="figure-regarding-open-government-data-one-of-my-favorites-is-not-a-single-data-source-but-a-data-representation-standard-the-px-format">
&lt;div class="d-flex justify-content-center">
&lt;div class="w-100" >&lt;img src="https://reprex-next.netlify.app/img/developers/PxWeb.png" alt="Regarding open government data, one of my favorites is not a single data source but a data representation standard, the Px format." loading="lazy" data-zoomable />&lt;/div>
&lt;/div>&lt;figcaption data-pre="Figure&amp;nbsp;" data-post=":&amp;nbsp;" class="numbered">
Regarding open government data, one of my favorites is not a single data source but a data representation standard, the Px format.
&lt;/figcaption>&lt;/figure>
&lt;p>&lt;strong>From your perspective, what do you see being the greatest problem with open data in 2021?&lt;/strong>&lt;/p>
&lt;p>Although there are a variety of open data sources available (and the numbers continue to increase), the availability of open algorithmic tools to interpret and communicate open data efficiently is lagging behind. One of the greatest challenges for open data in 2021 is to demonstrate how we can maximize the potential of open data by designing smart tools for open data analytics.&lt;/p>
&lt;p>&lt;strong>What can our automated data observatories do to make open data more credible in the European economic policy community and be accepted as verified information?&lt;/strong>&lt;/p>
&lt;p>The role of the professional network backing up the project, and the possibility of getting critical feedback and later adoption by the academic communities will support the efforts. Transparency of the data harmonization operations is the key to credibility, and will be further supported by concrete benchmarks that highlight the critical differences in drawing conclusions based on original sources versus the harmonized high-quality data sets.&lt;/p>
&lt;figure id="figure-we-need-to-get-critical-feedback-and-later-adoption-by-the-academic-communities">
&lt;div class="d-flex justify-content-center">
&lt;div class="w-100" >&lt;img alt="We need to get critical feedback and later adoption by the academic communities." srcset="
/media/img/observatory_screenshots/greendeal_and_zenodo_huddcd7485e56cb33c97d3e664ae383275_281994_debfc54dcf2193c7c800dab0f36de429.webp 400w,
/media/img/observatory_screenshots/greendeal_and_zenodo_huddcd7485e56cb33c97d3e664ae383275_281994_3b536090581f2795373e801d65371e20.webp 760w,
/media/img/observatory_screenshots/greendeal_and_zenodo_huddcd7485e56cb33c97d3e664ae383275_281994_1200x1200_fit_q75_h2_lanczos_3.webp 1200w"
src="https://reprex-next.netlify.app/media/img/observatory_screenshots/greendeal_and_zenodo_huddcd7485e56cb33c97d3e664ae383275_281994_debfc54dcf2193c7c800dab0f36de429.webp"
width="760"
height="507"
loading="lazy" data-zoomable />&lt;/div>
&lt;/div>&lt;figcaption data-pre="Figure&amp;nbsp;" data-post=":&amp;nbsp;" class="numbered">
We need to get critical feedback and later adoption by the academic communities.
&lt;/figcaption>&lt;/figure>
&lt;p>&lt;strong>How we can ensure the long-term sustainability of the efforts?&lt;/strong>&lt;/p>
&lt;p>The extent of open data space is such that no single individual or institution can address all the emerging needs in this area. The open developer networks play a huge role in the development of algorithmic methods, and strong communities have developed around specific open data analytical environments such as R, Python, and Julia. These communities support networked collaboration and provide services such as software peer review. The long-term sustainability will depend on the support that such developer communities can receive, both from individual contributors as well as from institutions and governments.&lt;/p>
&lt;figure id="figure-join-our-open-collaboration-economy-data-observatory-team-as-a-data-curatorauthorscurator-developerauthorsdeveloper-or-business-developerauthorsteam-or-share-your-data-in-our-public-repository-economy-data-observatory-on-zenodohttpszenodoorgcommunitieseconomy_observatory">
&lt;div class="d-flex justify-content-center">
&lt;div class="w-100" >&lt;img src="https://reprex-next.netlify.app/img/observatory_screenshots/edo_and_zenodo.png" alt="Join our open collaboration Economy Data Observatory team as a [data curator](/authors/curator), [developer](/authors/developer) or [business developer](/authors/team), or share your data in our public repository [Economy Data Observatory on Zenodo](https://zenodo.org/communities/economy_observatory/)" loading="lazy" data-zoomable />&lt;/div>
&lt;/div>&lt;figcaption data-pre="Figure&amp;nbsp;" data-post=":&amp;nbsp;" class="numbered">
Join our open collaboration Economy Data Observatory team as a &lt;a href="https://reprex-next.netlify.app/authors/curator">data curator&lt;/a>, &lt;a href="https://reprex-next.netlify.app/authors/developer">developer&lt;/a> or &lt;a href="https://reprex-next.netlify.app/authors/team">business developer&lt;/a>, or share your data in our public repository &lt;a href="https://zenodo.org/communities/economy_observatory/" target="_blank" rel="noopener">Economy Data Observatory on Zenodo&lt;/a>
&lt;/figcaption>&lt;/figure>
&lt;h2 id="join-us">Join us&lt;/h2>
&lt;p>&lt;em>Join our open collaboration Economy Data Observatory team as a &lt;a href="https://reprex-next.netlify.app/authors/curator">data curator&lt;/a>, &lt;a href="https://reprex-next.netlify.app/authors/developer">developer&lt;/a> or &lt;a href="https://reprex-next.netlify.app/authors/team">business developer&lt;/a>. More interested in environmental impact analysis? Try our &lt;a href="https://greendeal.dataobservatory.eu/#contributors" target="_blank" rel="noopener">Green Deal Data Observatory&lt;/a> team! Or your interest lies more in data governance, trustworthy AI and other digital market problems? Check out our &lt;a href="https://music.dataobservatory.eu/#contributors" target="_blank" rel="noopener">Digital Music Observatory&lt;/a> team!&lt;/em>&lt;/p></description></item><item><title>Reproducible Survey Harmonization: retroharmonize Is Released</title><link>https://reprex-next.netlify.app/post/2020-09-21-retroharmonize_release/</link><pubDate>Mon, 21 Sep 2020 11:31:39 +0000</pubDate><guid>https://reprex-next.netlify.app/post/2020-09-21-retroharmonize_release/</guid><description>&lt;p>Our original intention was to make surveying more accessible for music and creative industry partners, by relying more on already existing survey data, and better designing complementary, smaller surveys, becasue surveying, opinion polling is becoming increasingly expensive in the develop world. People are less and less likely to sit down for an interview in their houses. We have tried to harmonize our custom surveys, particuarly with Kantar in Hungary and Focus in Slovakia with exisiting EU projects. But we ended up making a part of international survey harmonization across countries and throughout years easier to automate.&lt;/p>
&lt;p>
&lt;figure >
&lt;div class="d-flex justify-content-center">
&lt;div class="w-100" >&lt;img src="https://reprex-next.netlify.app/img/packages/ab_plot1.png" alt="Harmonized results from Afrobarometer" loading="lazy" data-zoomable />&lt;/div>
&lt;/div>&lt;/figure>
&lt;/p>
&lt;p>Surveys are like sensors for natural sciences and industrial production. They are essential for almost any social and economic statistical indicator, for calculating the inflation, parts of the GDP, participation in education programs. Making surveys easier to harmonize and exploit more already existing survey data can bring down research cost, and can increase research value at the same time. (See our earlier blog post &lt;a href="https://dataobservatory.eu/post/2020-07-10-retroharmonize/" target="_blank" rel="noopener">Increase The Value Of Market Research With Open Data And Survey Harmonization&lt;/a>.)&lt;/p>
&lt;p>So, if you are an R user, you can use &lt;code>install.packages(“retroharmonize”)&lt;/code> to get the released 0.1.13 version and make tutorials with real Eurobarometer or Afrobarometer microdata. With &lt;code>devtools::install_github(&amp;quot;antaldaniel/retroharmonize&amp;quot;)&lt;/code> you can already install the current development version 0.1.14, which handles perl-like regex, which will be necessary for our next tutorial in the making for &lt;a href="https://www.arabbarometer.org/" target="_blank" rel="noopener">Arab Barometer&lt;/a>.&lt;/p>
&lt;p>&lt;strong>Related&lt;/strong>:&lt;/p>
&lt;ul>
&lt;li>
&lt;p>&lt;a href="https://retroharmonize.dataobservatory.eu/" target="_blank" rel="noopener">retroharmonize package website&lt;/a>&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;a href="https://github.com/antaldaniel/retroharmonize/" target="_blank" rel="noopener">retroharmonize on github&lt;/a>&lt;/p>
&lt;/li>
&lt;/ul></description></item><item><title>Launching Our Demo Music Observatory</title><link>https://reprex-next.netlify.app/post/2020-09-15-music-observatory-launch/</link><pubDate>Tue, 15 Sep 2020 08:00:39 +0000</pubDate><guid>https://reprex-next.netlify.app/post/2020-09-15-music-observatory-launch/</guid><description>&lt;p>Today, on 15 September 2020, we officially launched our &lt;code>minimal viable product&lt;/code> as we promised to partners back in February. This was a particularly difficult period for everybody. We aspired to deliver by September in a very different environment, our hopes for commissioned work went up in flames with the pandemic, and our targeted users, musicians and music entrepreneurs, talent managers, music venues lost most of their income. The organizations helping them, granting authorities, export offices and collective management societies are overwhelmed with the problem. During these troublesome times, our team expanded, attracted great new talent, and kept working.&lt;/p>
&lt;p>Our first product is the &lt;a href="https://music.dataobservatory.eu/" target="_blank" rel="noopener">Demo Music Observatory&lt;/a>, a collaborative, automated research-based &lt;a href="https://dataobservatory.eu/faq/observatories/" target="_blank" rel="noopener">observatory&lt;/a> for the music industry, one that is particularly hard hit by the COVID19 crisis. Not only great artists, composers, technicians, managers fell victim to the virus, but musicians lost about 50–90% of their income from live music. This translates to a 100% loss for the live music technicians and managers.&lt;/p>
&lt;p>
&lt;div style="position: relative; padding-bottom: 56.25%; height: 0; overflow: hidden;">
&lt;iframe src="https://www.youtube.com/embed/fQJHflWPS34" style="position: absolute; top: 0; left: 0; width: 100%; height: 100%; border:0;" allowfullscreen title="YouTube Video">&lt;/iframe>
&lt;/div>
See our &lt;a href="https://dataobservatory.eu/post/2020-09-11-creating-automated-observatory/" target="_blank" rel="noopener">earlier blogpost&lt;/a> on what you see on the video.&lt;/p>
&lt;p>The music industry was never a place for great job security. For putting up a show, you usually need a network of 10–200 artists, technicians and managers to work together as freelancers without all those social benefits that many people enjoy in other walks of life. We have been trying to figure out how to help this microenterprise and freelancer-network based industry with research for five years. Our aim is to make them competitive when they are talking with their buyers: Google, Apple, Spotify, who are really heavy-weight data and AI pros. Our better plan their tours, when they will be back on the road, to understand what sort of audiences and purchasing power waits for them in different European cities.&lt;/p>
&lt;p>We are launching at a time when the music industry is crying for help.Therefore, we have decided to make our demo observatory open and unfinished. Over the last 7 years, we have built up about 2000 music and creative sector indicators to be used for business KPIs, forecasting targets, grant evaluations, royalty valuations, concert demography target group analysis and other professional uses. We would like to open up, based on your needs, about 50 well-designed indicators, and pledge to keep it daily refreshed, corrected, documented, citaable, downloadable. Also, feel free to use our most valuable source code—use it for your own purposes, even modify it, as long as you keep it open.&lt;/p>
&lt;p>For our smaller partners, we follow what musicians do these days on Bandcamp: name your price. We make a pledge to our small partners: if you need reliable data to plan your next grant calls, calculate royalties, compensations, predict hit candidates, give us the job—and name your price. Post-corona, you can take for a dollar the best music from Bandcamp. You can take our research products, for a limited period, for any amount you name, as long as it is for a good cause and serves the industry, musicians, technicians or managers. In return, we ask for your feedback. Help us validate whether we are on the right track, tell us how we can cooperate after the pandemic, in better times.&lt;/p>
&lt;p>Our larger and better funded partners? We ask you to pay the price we name, because we believe that it is a well-justified, fair and competitive price, set by pricing experts.&lt;/p>
&lt;p>We appreciate it if you take a look at our offering, or if you pass this blogpost on to your colleagues in the industry. Our main target audience initially are music professional in broader Europe, but we are planning to cover all major global markets very soon, too. Feedback from the U.S., Australia, Canada, Colombia, Brazil &amp;amp; Argentina is particularly welcome as we have great plans over there!&lt;/p>
&lt;h2 id="who-we-are">Who we are?&lt;/h2>
&lt;p>We &lt;a href="https://dataobservatory.eu/post/2020-08-24-start-up/" target="_blank" rel="noopener">started&lt;/a> our operations on 1 September 2020 on the basis of &lt;a href="http://documentation.ceemid.eu/" target="_blank" rel="noopener">CEEMID&lt;/a>, a pan-European data observatory that created about 2000 music and creative industry indicators for its users. In the coming days, we are gradually opening up about 50 &lt;a href="https://music.dataobservatory.eu/" target="_blank" rel="noopener">music industry&lt;/a> and 50 broader creative industry indicators in a fully reproducible workflow, with daily re-freshed, re-processed, well-formatted and documented indicators for business and policy decisions.&lt;/p>
&lt;p>We would like to validate this approach in one of the world&amp;rsquo;s most prestigious university-backed incubator programs, in the &lt;a href="https://www.yesdelft.com/yes-programs/ai-blockchain-validation-lab/" target="_blank" rel="noopener">Yes!Delft AI/Blockchain Validation Lab&lt;/a>. We&amp;rsquo;re finalist on their selection, and all help before 23 September from our friends in the music industry is more than appreciated. If we get there, we can rely on probably the best pros in Europe to make our offering better tailored and financially sustainable.&lt;/p>
&lt;h2 id="get-in-touch">Get in touch!&lt;/h2>
&lt;p>We use the very simple and extremely secure &lt;strong>keybase.io&lt;/strong>, a kind of mix of Whatsapp, Skype, Google Drive, One Drive and zoom. You can get in touch on that platform with us in anytime &lt;a href="https://keybase.io/team/reprexcommunity" target="_blank" rel="noopener">here&lt;/a>.&lt;/p>
&lt;p>You can easily contact on LinkedIn &lt;a href="https://www.linkedin.com/in/antaldaniel/" target="_blank" rel="noopener">Daniel&lt;/a> or &lt;a href="https://www.linkedin.com/in/k%C3%A1tya-nagy-a9447730/" target="_blank" rel="noopener">Kátya&lt;/a> and of course, we have a usually working &lt;a href="https://dataobservatory.eu/#about" target="_blank" rel="noopener">email contact form&lt;/a>, too. Our email is name.surname at our main domain.&lt;/p>
&lt;h2 id="video-credits">Video credits&lt;/h2>
&lt;ul>
&lt;li>Data acquisition and processing: Daniel Antal, CFA and Marta Kołczyńska, PhD (&lt;a href="https://music.dataobservatory.eu/economy.html#demand" target="_blank" rel="noopener">survey data&lt;/a>).&lt;/li>
&lt;li>Documentation automation: Sandor Budai&lt;/li>
&lt;li>Video art: Line Matson&lt;/li>
&lt;li>Music: &lt;a href="https://www.youtube.com/moonmoonmoon" target="_blank" rel="noopener">Moon Moon Moon&lt;/a>.&lt;/li>
&lt;/ul></description></item><item><title>Creating An Automated Data Observatory</title><link>https://reprex-next.netlify.app/post/2020-09-11-creating-automated-observatory/</link><pubDate>Fri, 11 Sep 2020 16:00:39 +0000</pubDate><guid>https://reprex-next.netlify.app/post/2020-09-11-creating-automated-observatory/</guid><description>&lt;p>We are building data ecosystems, so called observatories, where scientific, business, policy and civic users can find factual information, data, evidence for their domain. Our open source, open data, open collaboration approach allows to connect various open and proprietary data sources, and our reproducible research workflows allow us to automate data collection, processing, publication, documentation and presentation.&lt;/p>
&lt;p>Our scripts are checking data sources, such as Eurostat&amp;rsquo;s Eurobase, Spotify&amp;rsquo;s API and other music industry sources every day for new information, and process any data corrections or new disclosure, interpolate, backcast or forecast missing values, make currency translations and unit conversions. This is shown illustrated with an &lt;a href="https://dataobservatory.eu/post/2020-07-25-reproducible_ingestion/" target="_blank" rel="noopener">earlier post&lt;/a>.&lt;/p>
&lt;div style="position: relative; padding-bottom: 56.25%; height: 0; overflow: hidden;">
&lt;iframe src="https://www.youtube.com/embed/fQJHflWPS34" style="position: absolute; top: 0; left: 0; width: 100%; height: 100%; border:0;" allowfullscreen title="YouTube Video">&lt;/iframe>
&lt;/div>
&lt;p>For direct access to the file visit &lt;a href="https://dataobservatory.eu/video/making-of-dmo.mp4" target="_blank" rel="noopener">this link&lt;/a>.&lt;/p>
&lt;p>In the video we show automated the creation of an observatory website with well-formatted, statistical data dissemination, a technical document in PDF and an ebook can be automated. In our view, our technology is particularly useful technology in business and scientific researech projects, where it is important that always the most timely and correct data is being analyzed, and remains automatically documented and cited. We are ready deploy public, collaborative, or private data observatories in short time.&lt;/p>
&lt;p>Data processing costs can be as high as 80% for any in-house AI deployment project. We work mainly with organization that do not have in house data science team, and acquire their data anyway from outside the organization. In their case, this rate can be as high as 95%, meaning that getting and processing the data for deploying AI can be 20x more expensive than the AI solution itself.&lt;/p>
&lt;p>AI solutions require a large amount of standardized, well processed data to learn from. We want to radically decrease the cost of data acquisition and processing for our users so that exploiting AI becomes in their reach. This is particularly important in one of our target industries, the music industries, where most of the global sales is algorithmic and AI-driven. Artists, bands, small labels, publishers, even small country national associations cannot remain competitive if they cannot participate in this technological revolution.&lt;/p>
&lt;p>We &lt;a href="https://dataobservatory.eu/post/2020-08-24-start-up/" target="_blank" rel="noopener">started&lt;/a> our operations on 1 September 2020 on the basis of &lt;a href="http://documentation.ceemid.eu/" target="_blank" rel="noopener">CEEMID&lt;/a>, a pan-European data observatory that created about 2000 music and creative industry indicators for its users. In the coming days, we are gradually opening up about 50 &lt;a href="https://music.dataobservatory.eu/" target="_blank" rel="noopener">music industry&lt;/a> and 50 broader creative industry indicators in a fully reproducible workflow, with daily re-freshed, re-processed, well-formatted and documented indicators for business and policy decisions.&lt;/p>
&lt;p>We would like to validate this approach in one of the world&amp;rsquo;s most prestigious university-backed incubator programs, in the &lt;a href="https://www.yesdelft.com/yes-programs/ai-blockchain-validation-lab/" target="_blank" rel="noopener">Yes!Delft AI/Blockchain Validation Lab&lt;/a>.&lt;/p>
&lt;h2 id="video-credits">Video credits&lt;/h2>
&lt;ul>
&lt;li>Data acquisition and processing: Daniel Antal, CFA and Marta Kołczyńska, PhD (&lt;a href="https://music.dataobservatory.eu/economy.html#demand" target="_blank" rel="noopener">survey data&lt;/a>).&lt;/li>
&lt;li>Documentation automation: Sandor Budai&lt;/li>
&lt;li>Video art: Line Matson&lt;/li>
&lt;li>Music: &lt;a href="https://www.youtube.com/moonmoonmoon" target="_blank" rel="noopener">Moon Moon Moon&lt;/a>.&lt;/li>
&lt;/ul></description></item><item><title>Starting-up</title><link>https://reprex-next.netlify.app/post/2020-08-24-start-up/</link><pubDate>Mon, 24 Aug 2020 10:15:00 +0000</pubDate><guid>https://reprex-next.netlify.app/post/2020-08-24-start-up/</guid><description>&lt;p>The big day has come: the co-founders singed off the documents at the public notary and started the registration of a reproducible research start-up in Leiden. We got a lot of support from our friends! Your encouragement gives us a lot of energy to accomplish our first milestones, and to get Reprex B.V. going!&lt;/p>
&lt;blockquote>
&lt;p>Reprex means &amp;lsquo;reproducible example&amp;rsquo; in data science. When you are stuck with a problem, creating a reproducible example allows other computer scientists, statisticians, programmers or data users to solve it. In 80% of the cases, you usually find the solution while creating a generalized example. In the 20% other cases, you can reach out for help easily.&lt;/p>
&lt;/blockquote>
&lt;p>In the coming days, we are launching demo versions of our headline products, data observatories. &lt;a href="https://music.dataobservatory.eu/index.html" target="_blank" rel="noopener">music.dataobservatory.eu&lt;/a> will be a fully automated online service that every day collects, processes, cleans, and publishes scientifically valid data about European music. Very soon after we will launch two other observatories.&lt;/p>
&lt;p>The creative and cultural sector, NGOs, most research institutions, data journalism teams are usually very small, and they do not have internal IT or data science capacities. We would like to provide them a transparent, high quality, and fully open source solution to acquire data, process it without errors, document it and make sense of it. We would like to embrace the idea of open collaboration among creative enterprises, scientific researchers, NGOs, data journalists and policymakers with our work.&lt;/p>
&lt;p>Our work will comply with the &lt;a href="https://www.bitss.org/opa/" target="_blank" rel="noopener">Open Policy Analysis&lt;/a> standards developed by the &lt;a href="https://www.bitss.org/" target="_blank" rel="noopener">Berkeley Initiative for Transparency in the Social Sciences&lt;/a> &amp;amp; &lt;a href="https://cega.berkeley.edu/" target="_blank" rel="noopener">Center for Effective Global Action&lt;/a> and the four principles of &lt;a href="http://dataobservatory.eu/reproducible/" target="_blank" rel="noopener">reproducible research&lt;/a>: reviewability, replicability, confirmability and auditability. We believe that these standards apply in reproducible finance, empirical evidence presentation in courts, or advocating sound policies and producing high-quality journalism.&lt;/p>
&lt;h2 id="help">Do you want to help our start?&lt;/h2>
&lt;p>We would like to enter into the Validation Lab of one of the best artificial intelligence incubators in early September. Talented team members, letters of intents and assignments from organizations will give a lot of credibility to our start &lt;a href="http://dataobservatory.eu/team/" target="_blank" rel="noopener">Meet our team »&lt;/a>.&lt;/p>
&lt;ul>
&lt;li>
&lt;p>Put as in contact with people who love to write code in R and interested in automating business and social science research and primary data collection such as surveying. &lt;a href="http://dataobservatory.eu/#featured" target="_blank" rel="noopener">Check out what sort of code we create »&lt;/a>&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Introduce us to people who need data and information to make better informed decision and analysis in music, film, book publishing, photography services or socially responsible finance.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Share contacts of data journalists who would like to develop stories from big survey programs like &lt;a href="https://ec.europa.eu/commfrontoffice/publicopinion/index.cfm" target="_blank" rel="noopener">Eurobarometer&lt;/a>, &lt;a href="https://www.afrobarometer.org/" target="_blank" rel="noopener">Afrobarometer&lt;/a> and &lt;a href="https://www.latinobarometro.org/lat.jsp" target="_blank" rel="noopener">Lationbarometro&lt;/a>, or base their storytelling on data and its visualizations. &lt;a href="http://retroharmonize.satellitereport.com/" target="_blank" rel="noopener">See our survey harmonization examples »&lt;/a>&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>Do you know such people? Send over this post or connect us in an email or social media message!&lt;/p>
&lt;p>&lt;em>Thanks again for your good wishes and encouragements, and hope to hear from you soon!&lt;/em>&lt;/p></description></item></channel></rss>