\chapter{Datasets} %general % from archive.org \cite{archivestackexchange} % list of datasets % selected largest dataset, smaller datasets data to sparse to take concolusions, statistcal change of outliner to big, outlines would effect the outcome by too much % larger data sets yield more consistent results %dataset include data since inception of community until some date %TODO find last dates %sections 1 per site \section{StackOverflow.com} %TODO insert values StackOverflow is the largest and oldest community of the StackExchange platform. The community has 165567 registered users of which 3467 were active in May of 2019. Members asked 116797 questions in total and gave 202751 answers with an average answer density of 1.73 answers per question. New users asked 42996 questions with an average of 1.129 questions per new user during their first week after registration. \begin{figure}[H] \begin{subfigure}[c]{0.5\textwidth} \includegraphics[scale=0.35]{../stackoverflow.com/output/posthist/activeusers-i3.png} \label{so_activeusers} \subcaption{Active users with activity in the last 3 months} \end{subfigure} \begin{subfigure}[c]{0.5\textwidth} \includegraphics[scale=0.35]{../stackoverflow.com/output/posthist/postsanswers-i3.png} \label{so_postsanswers} \subcaption{Questions and answers counts over time} \end{subfigure} \end{figure} \section{math.stackexchange.com} ``Mathematics Stack Exchange is a question and answer site for people studying math at any level and professionals in related fields.'' \cite{mathstackexchangecom} The community has 551397 registered users of which 18080 were active in May of 2019. Members asked 1066979 questions in total and gave 1440948 answers with an average answer density of 1.35 answers per question. New users asked 248867 questions with an average of 1.34 questions per new user during their first week after registration. \begin{figure}[H] \begin{subfigure}[c]{0.5\textwidth} \includegraphics[scale=0.35]{../math.stackexchange.com/output/posthist/activeusers-i3.png} \label{math_activeusers} \subcaption{Active users with activity in the last 3 months} \end{subfigure} \begin{subfigure}[c]{0.5\textwidth} \includegraphics[scale=0.35]{../math.stackexchange.com/output/posthist/postsanswers-i3.png} \label{math_postsanswers} \subcaption{Questions and answers counts over time} \end{subfigure} \end{figure} \section{MathOverflow.net} MathOverflow.net is a rather small community for professional mathematicians. The community has 94559 registered users of which 1718 were active in May of 2019. Members asked 100922 questions in total and gave 139077 answers with an average answer density of 1.378 answers per question. New users asked 22794 questions with an average of 1.134 questions per new user during their first week after registration. \begin{figure}[H] \begin{subfigure}[c]{0.5\textwidth} \includegraphics[scale=0.35]{../mathoverflow.net/output/posthist/activeusers-i3.png} \label{matho_activeusers} \subcaption{Active users with activity in the last 3 months} \end{subfigure} \begin{subfigure}[c]{0.5\textwidth} \includegraphics[scale=0.35]{../mathoverflow.net/output/posthist/postsanswers-i3.png} \label{matho_postsanswers} \subcaption{Questions and answers counts over time} \end{subfigure} \end{figure} \section{AskUbuntu.com} AskUbuntu.com is a rather small community for Ubuntu users and developers. The community has 783614 registered users of which 7033 were active in Feburary of 2020. Members asked 334194 questions in total and gave 418051 answers with an average answer density of 1.25 answers per question. New users asked 157018 questions with an average of 1.101 questions per new user during their first week after registration. \begin{figure}[H] \begin{subfigure}[c]{0.5\textwidth} \includegraphics[scale=0.35]{../askubuntu.com/output/posthist/activeusers-i3.png} \label{ubuntu_activeusers} \subcaption{Active users with activity in the last 3 months} \end{subfigure} \begin{subfigure}[c]{0.5\textwidth} \includegraphics[scale=0.35]{../askubuntu.com/output/posthist/postsanswers-i3.png} \label{ubuntu_postsanswers} \subcaption{Questions and answers counts over time} \end{subfigure} \end{figure} \section{ServerFault.com} %TODO insert values ServerFault.com is a rather small community for system and network administrators. The community has 165567 registered users of which 3467 were active in May of 2019. Members asked 116797 questions in total and gave 202751 answers with an average answer density of 1.73 answers per question. New users asked 42996 questions with an average of 1.129 questions per new user during their first week after registration. \begin{figure}[H] \begin{subfigure}[c]{0.5\textwidth} \includegraphics[scale=0.35]{../serverfault.com/output/posthist/activeusers-i3.png} \label{fault_activeusers} \subcaption{Active users with activity in the last 3 months} \end{subfigure} \begin{subfigure}[c]{0.5\textwidth} \includegraphics[scale=0.35]{../serverfault.com/output/posthist/postsanswers-i3.png} \label{fault_postsanswers} \subcaption{Questions and answers counts over time} \end{subfigure} \end{figure} \section{SuperUser.com} SuperUser.com is a rather small community for computer enthusiasts and power users. The community has 766028 registered users of which 11643 were active in May of 2019. Members asked 396611 questions in total and gave 561645 answers with an average answer density of 1.416 answers per question. New users asked 147080 questions with an average of 1.091 questions per new user during their first week after registration. \begin{figure}[H] \begin{subfigure}[c]{0.5\textwidth} \includegraphics[scale=0.35]{../superuser.com/output/posthist/activeusers-i3.png} \label{super_activeusers} \subcaption{Active users with activity in the last 3 months} \end{subfigure} \begin{subfigure}[c]{0.5\textwidth} \includegraphics[scale=0.35]{../superuser.com/output/posthist/postsanswers-i3.png} \label{super_postsanswers} \subcaption{Questions and answers counts over time} \end{subfigure} \end{figure} \section{electronic.stackexchange.com} electronic.stackexchange.com is a rather small community for electrical engeneering. The community has 165567 registered users of which 3467 were active in May of 2019. Members asked 116797 questions in total and gave 202751 answers with an average answer density of 1.73 answers per question. New users asked 42996 questions with an average of 1.129 questions per new user during their first week after registration. \begin{figure}[H] \begin{subfigure}[c]{0.5\textwidth} \includegraphics[scale=0.35]{../electronics.stackexchange.com/output/posthist/activeusers-i3.png} \label{elec_activeusers} \subcaption{Active users with activity in the last 3 months} \end{subfigure} \begin{subfigure}[c]{0.5\textwidth} \includegraphics[scale=0.35]{../electronics.stackexchange.com/output/posthist/postsanswers-i3.png} \label{elec_postsanswers} \subcaption{Questions and answers counts over time} \end{subfigure} \end{figure} \section{stats.stackexchange.com (Cross Validated)} ``Cross Validated is a question and answer site for people interested in statistics, machine learning, data analysis, data mining, and data visualization.'' \cite{statsstackexchangecom} The community has 202879 registered users of which 5252 were active in May of 2019. Members asked 137318 questions in total and gave 135350 answers with an average answer density of 0.985 answers per question. New users asked 52588 questions with an average of 1.113 questions per new user during their first week after registration. \begin{figure}[H] \begin{subfigure}[c]{0.5\textwidth} \includegraphics[scale=0.35]{../stats.stackexchange.com/output/posthist/activeusers-i3.png} \label{stats_activeusers} \subcaption{Active users with activity in the last 3 months} \end{subfigure} \begin{subfigure}[c]{0.5\textwidth} \includegraphics[scale=0.35]{../stats.stackexchange.com/output/posthist/postsanswers-i3.png} \label{stats_postsanswers} \subcaption{Questions and answers counts over time} \end{subfigure} \end{figure} \section{tex.stackexchange.com} tex.stackexchange.com is a rather small community for TEX and related type setting systems. The community has 155352 registered users of which 3630 were active in May of 2019. Members asked 173991 questions in total and gave 135350 answers with an average answer density of 0.777 answers per question. New users asked 55313 questions with an average of 1.195 questions per new user during their first week after registration. \begin{figure}[H] \begin{subfigure}[c]{0.5\textwidth} \includegraphics[scale=0.35]{../tex.stackexchange.com/output/posthist/activeusers-i3.png} \label{tex_activeusers} \subcaption{Active users with activity in the last 3 months} \end{subfigure} \begin{subfigure}[c]{0.5\textwidth} \includegraphics[scale=0.35]{../tex.stackexchange.com/output/posthist/postsanswers-i3.png} \label{tex_postsanswers} \subcaption{Questions and answers counts over time} \end{subfigure} \end{figure} \section{unix.stackexchange.com} unix.stackexchange.com is a rather small community for Linux and Unix-like operating systems. The community has 316144 registered users of which 4624 were active in May of 2019. Members asked 158714 questions in total and gave 236797 answers with an average answer density of 0.67 answers per question. New users asked 56211 questions with an average of 1.128 questions per new user during their first week after registration. \begin{figure}[H] \begin{subfigure}[c]{0.5\textwidth} \includegraphics[scale=0.35]{../unix.stackexchange.com/output/posthist/activeusers-i3.png} \label{unix_activeusers} \subcaption{Active users with activity in the last 3 months} \end{subfigure} \begin{subfigure}[c]{0.5\textwidth} \includegraphics[scale=0.35]{../unix.stackexchange.com/output/posthist/postsanswers-i3.png} \label{unix_postsanswers} \subcaption{Questions and answers counts over time} \end{subfigure} \end{figure} % general information % dataset from to dates % #user, #questions, #answers, #votes, #avg answer/question %plots % #users % #questions, #answers