This commit is contained in:
wea_ondara
2020-08-13 17:52:15 +02:00
parent d018ead5b5
commit b2a8d59ddc
5 changed files with 82 additions and 31 deletions

View File

@@ -13,16 +13,16 @@ StackExchange is a Q\&A platform and consists of 174 communities \cite{stackexch
% write new numbers about so %TODO % write new numbers about so %TODO
In August of 2018, the StackExchange team introduced a small change which may have had a huge impact on the platform. They added a new feature to visibly mark questions from new contributors, as part of their effort to make the site more welcoming for new users \cite{post2018come}. Specifically members who want to answer a question created by a new contributor are shown a notification in the answer box that this question is from a new contributor. The StackExchange team hopes that this little change encourages members to be more friendly and forgiving toward new users. In August of 2018, the StackExchange team introduced a small change which may have had a huge impact on the platform. They added a new feature to visibly mark questions from new contributors, as part of their effort to make the site more welcoming for new users \cite{post2018come}. Specifically, members who want to answer a question created by a new contributor are shown a notification in the answer box that this question is from a new contributor. The StackExchange team hopes that this little change encourages members to be more friendly and forgiving toward new users.
% write about the change investigated % write about the change investigated
% stackexchange new contriutor post: https://meta.stackexchange.com/questions/314287/come-take-a-look-at-our-new-contributor-indicator?cb=1 % stackexchange new contriutor post: https://meta.stackexchange.com/questions/314287/come-take-a-look-at-our-new-contributor-indicator?cb=1
% what did change intend? % what did change intend?
This thesis evaluates whether this change has a real impact on the community and if so in how the community reacts. For this analysis, this thesis utilizes Vader \cite{hutto2014vader}, a sentiment analysis tool, to measure the sentiments of the answers submitted to questions of new contributors. An interrupted time series is then applied to these values to evalutate whether the change achieved its purpose of making the platform more welcoming. This thesis evaluates whether this change has a real impact on the community and if so in how the community reacts. For this analysis, this thesis utilizes Vader \cite{hutto2014vader}, a sentiment analysis tool, to measure the sentiments of the answers submitted to questions of new contributors. Furhermore, this thesis includes the votes these questions recieve and the number of questions new contributors ask. Interrupted time series are then applied to these values to evalutate whether the change achieved its purpose of making the platform more welcoming.
% how is change investigated by this thesis % how is change investigated by this thesis
% vader library % vader library
This thesis investigates the ten largest communities of the StackExchange platform measured by number of posts. This includes prominent communities, for instance, StackOverflow, MathOverflow, Math, AskUbuntu, and SuperUser as well as some lesser known communities. This thesis investigates the ten largest communities of the StackExchange platform measured by number of posts. This includes most prominent communities, for instance, StackOverflow, MathOverflow, Math, AskUbuntu, and SuperUser as well as some lesser known communities.
% write about other communities (e.g. i investigated) % write about other communities (e.g. i investigated)
@@ -30,7 +30,7 @@ This thesis investigates the ten largest communities of the StackExchange platfo
%write a bit about results %write a bit about results
%write about next chapters %write about next chapters
The remaining part of this thesis is structured as follows: Section 2 explains StackExchange, how it works, and shows related work. Section 3 shows the method this thesis uses for analysis. Section 4 contains the investigated datasets. Results are presented in Section 5 and discussed in Section 6. Section 7 conculdes this thesis. The remaining part of this thesis is structured as follows: Section 2 explains StackExchange, how it works, and shows related work. Section 3 shows the method this thesis uses for analysis in detail. Section 4 contains the investigated datasets. Results are presented in Section 5 and discussed in Section 6. Section 7 conculdes this thesis.

View File

@@ -1,6 +1,6 @@
\chapter{Method} \chapter{Method}
StackExchange introduced a \emph{new contributor} indicator to all communities on $21^{st}$ of August in 2018 at 9 pm UTC \cite{post2018come}. This step is one of many StackExchange took to make the platform and its members more welcoming towards new users. This indicator is shown to potential answerers in the answer text box of a question flagged as from a new contributor as shown in figure \ref{newcontributor}. The indicator is added to a question if the question is the first contribution of a user or if the first contribution (question or answer) of the user was less than 7 days ago \cite{sonic2018what}. The indicator is then shown for 7 days from the creation date of the question. Note that the user can be registered for a long time and then post their first question and it is counted as a question from a new contributor. Also, if a user decides to delete all their contributions from the site and then creates a new question this question will have the \emph{new contributor} indicator attached. The sole deciding factor for the indicator is the date and time of the first non-deleted contribution and the 7-day window afterward. StackExchange introduced a \emph{new contributor} indicator to all communities on $21^{st}$ of August in 2018 at 9 pm UTC \cite{post2018come}. This step is one of many StackExchange took to make the platform and its members more welcoming towards new users. This indicator is shown to potential answerers in the answer text box of a question flagged as from a new contributor as shown in figure \ref{newcontributor}. The indicator is added to a question if the question is the first contribution of the user or if the first contribution (question or answer) of the user was less than 7 days ago \cite{sonic2018what}. The indicator is then shown for 7 days from the creation date of the question. Note that the user can be registered for a long time and then post their first question and it is counted as a question from a new contributor. Also, if a user decides to delete all their existing contributions from the site and then creates a new question this question will have the \emph{new contributor} indicator attached. The sole deciding factor for the indicator is the date and time of the first non-deleted contribution and the 7-day window afterward.
\begin{figure} \begin{figure}
\centering\includegraphics[scale=0.47]{figures/new_contributor} \centering\includegraphics[scale=0.47]{figures/new_contributor}
@@ -26,7 +26,7 @@ StackExchange provides anonymized data dumps of all their communities for resear
% broken entries, missing user id % broken entries, missing user id
% answers in html -> strip html and remove code sections, no contribution to sentiment % answers in html -> strip html and remove code sections, no contribution to sentiment
After preprocessing the raw data, relevant data is filtered and computed. Questions and answers in the data are mixed together and have to be separated and answers have to be linked to their questions. Also, questions in these datasets do not have the \emph{new contributor} indicator attached to them and neither do users. So, the first contribution date and time of users have to be calculated via the creation dates of the questions and answers the user has posted. Then, questions are filtered per user and by whether they are created within the 7-day window after the first contribution of the user. These questions were created during the period where the \emph{new contributor} indicator would have been displayed, in case the questions had been posted before the change, or has been displayed after the change. From these questions, all answers which arrived within the 7-day window are considered for the analysis. Answers which arrived at a later point are excluded as the answerer most likely has not seen the disclaimer shown in figure \ref{newcontributor}. Included answers are then analyzed with Vader and the resulting sentiments are stored. After preprocessing the raw data, relevant data is filtered and computed. Questions and answers in the data are mixed together and have to be separated and answers have to be linked to their questions. Also, questions in these datasets do not have the \emph{new contributor} indicator attached to them and neither do users. So, the first contribution date and time of users have to be calculated via the creation dates of the questions and answers the user has posted. Then, questions are filtered per user and by whether they are created within the 7-day window after the first contribution of the user. These questions were created during the period where the \emph{new contributor} indicator would have been displayed, in case the questions had been posted before the change, or has been displayed after the change. From these questions, all answers which arrived within the 7-day window are considered for the analysis. Answers which arrived at a later point are excluded as the answerer most likely has not seen the disclaimer shown in figure \ref{newcontributor}. Included answers are then analyzed with Vader and the resulting sentiments are stored. Furhtermore, votes to questions of new contributors are counted if they arrived within the 7-day window and count 1 if it is an upvote and -1 if it is a downvote. Moreover, number of questions new contributors ask are counted and divided into two classes: 1st-question of a user and follow up questions of a new contributor.
% calc sentiment for answers % calc sentiment for answers
% questions do not have a tag if from a new contribtor -> calc first contributor % questions do not have a tag if from a new contribtor -> calc first contributor
@@ -43,7 +43,7 @@ After preprocessing the raw data, relevant data is filtered and computed. Questi
\section{Analysis} \section{Analysis}
An interrupted time series (ITS) analysis captures trends before and after a change in a system and fits very well with the question this thesis investigates. ITS can be applied to a large variety of data if the data contains the same kind of data points before and after the change and when the change date and time are known. \citeauthor{bernal2017interrupted} published a paper on how ITS works \cite{bernal2017interrupted}. ITS performes well on medical data, for instance, when a new treatment is introduced ITS can visualize if the treatment improves a condition. For ITS no control group is required and often control groups are not feasible. ITS only works with the before and after data and a point in time where a change was introduced. An interrupted time series (ITS) analysis captures trends before and after a change in a system and fits very well with the question this thesis investigates. ITS can be applied to a large variety of data if the data contains the same kind of data points before and after the change and when the change date and time are known. \citeauthor{bernal2017interrupted} published a paper on how ITS works \cite{bernal2017interrupted}. ITS performes well on medical data, for instance, when a new treatment is introduced ITS can visualize if the treatment improves a condition. For ITS no control group is required and often control groups are not feasible. ITS only works with the before and after data and a point in time where a change was introduced.
ITS relies on linear regression and tries to fit a three-segment linear function to the data. The authors also described cases where more than three segments are used but these models quickly raise the complexity of the analysis and for this thesis a three-segment linear regression is sufficient. The three segments are lines to fit the data before and after the change as well as one line to connect the other two lines at the change date. Figure \ref{itsexample} shows an example of an ITS. Each segment is captured by a tensor of the following formula $Y_t = \beta_0 + \beta_1T + \beta_2X_t + \beta_3TX_t$, where $T$ represents time as a number, for instance, number of months since the start of data recording, $X_t$ represents 0 or 1 depending on whether the change is in effect, $\beta_0$ represents the value at $T = 0$, $\beta_1$ represents the slope before the change, $\beta_2$ represents the value when the change is introduced, and $\beta_3$ represents the slope after the change. Contrary to the method in \cite{bernal2017interrupted} where the ITS is performed on aggregated values per month, this thesis performs the ITS on single data points, as the premise that the aggregated values all have the same weight within a certain margin is not fulfilled. Performing the ITS with aggregated values would skew the linear regression more towards data points with less weight. Single data point fitting prevents this, as weight is taken into account with more data points. ITS relies on linear regression and tries to fit a three-segment linear function to the data. The authors also described cases where more than three segments are used but these models quickly raise the complexity of the analysis and for this thesis a three-segment linear regression is sufficient. The three segments are lines to fit the data before and after the change as well as one line to connect the other two lines at the change date. Figure \ref{itsexample} shows an example of an ITS. Each segment is captured by a tensor of the following formula $Y_t = \beta_0 + \beta_1T + \beta_2X_t + \beta_3TX_t$, where $T$ represents time as a number, for instance, number of months since the start of data recording, $X_t$ represents 0 or 1 depending on whether the change is in effect, $\beta_0$ represents the value at $T = 0$, $\beta_1$ represents the slope before the change, $\beta_2$ represents the value when the change is introduced, and $\beta_3$ represents the slope after the change. Contrary to the method in \cite{bernal2017interrupted} where the ITS is performed on aggregated values per month, this thesis performs the ITS on single data points, as the premise that the aggregated values all have the same weight within a certain margin is not fulfilled for sentiment and vote score values. Performing the ITS with aggregated values would skew the linear regression more towards data points with less weight. Single data point fitting prevents this, as weight is taken into account with more data points.
\begin{figure} \begin{figure}

View File

@@ -1,7 +1,7 @@
\chapter{Results} \chapter{Results}
%TODO some text here %TODO some text here
This section shows the results of the experiments described in section 3 on the data sets described in section 4. In the following diagrams, the blue line states the (a) average sentiment of the answers to questions from new contributors/(b) average vote score of questions from new contributors. This line also has numbers attached to it at every datapoint and shows (a) the number of answers that formed the sentiment average/(b) the number of questions that formed the average vote score. The orange line shows ITS analysis as a 3-segment line. This section shows the results of the experiments described in section 3 on the data sets described in section 4. In the following diagrams, the blue line states the (a) average sentiment of the answers to questions from new contributors/(b) average vote score of questions from new contributors and (c) the number of 1st and follow up questions of new contributors. These lines also have numbers attached to it at every datapoint and shows (a) the number of answers that formed the sentiment average/(b) the number of questions that formed the average vote score. The orange line shows ITS analysis as a 3-segment line.
% pvalues ... % pvalues ...
@@ -9,7 +9,9 @@ This section shows the results of the experiments described in section 3 on the
%TODO write some text to each result %TODO write some text to each result
\pagebreak
\section{StackOverflow.com} \section{StackOverflow.com}
StackOverflow shows a very slight decrease in average sentiment of time before the change had been introduced. When the change occured the average sentiment jumped up. After the change the sentiments reached higher levels and kept rising. The average vote score rose right before and stay fairly constant after the change. this indicates that the vote score was not affected by the change. However, the number of questions from new contributors increased after the change while prir to the change the was fairly constant. The number of follow up questions from new contributors were declinied prior to the change and rose after the change.
\begin{figure}[H] \begin{figure}[H]
\begin{subfigure}[t]{0.5\textwidth} \begin{subfigure}[t]{0.5\textwidth}
\includegraphics[scale=0.37]{../stackoverflow.com/output/its/average_sentiments-i1.png} \includegraphics[scale=0.37]{../stackoverflow.com/output/its/average_sentiments-i1.png}
@@ -20,15 +22,19 @@ This section shows the results of the experiments described in section 3 on the
\includegraphics[scale=0.37]{../stackoverflow.com/output/votesits/average_votes-i1.png} \includegraphics[scale=0.37]{../stackoverflow.com/output/votesits/average_votes-i1.png}
\caption{An interrupted time series analysis of the vote score of questions created by new contributors on StackOverflow.com} \caption{An interrupted time series analysis of the vote score of questions created by new contributors on StackOverflow.com}
\label{stackoverflow_votesits} \label{stackoverflow_votesits}
\end{subfigure}\\
\begin{subfigure}[c]{0.5\textwidth}
\includegraphics[scale=0.37]{../stackoverflow.com/output/questionits/average_questions-i1.png}
\caption{An interrupted time series analysis of the number of questions created by new contributors on StackOverflow.com}
\label{stackoverflow_questionsits}
\end{subfigure} \end{subfigure}
\end{figure} \end{figure}
StackOverflow shows a very slight decrease in average sentiment of time before the change had been introduced. When the change occured the average sentiment jumped up. After the change the sentiments reached higher levels and kept rising.
% sentiment falling prior to change % sentiment falling prior to change
% jump upward at the change % jump upward at the change
% sentiments rising after change % sentiments rising after change
\section{math.stackexchange.com} \section{math.stackexchange.com}
The math.stackexchange.com community shows a decrease in average sentiments prior to the change. The sentiment make a small jump upward when the change is introduced, however, the sentiments decrease faster after the introduction of the change compared to before the change. The math.stackexchange.com community shows a decrease in average sentiments, vote score, and number of questions prior to the change. The measurements make a small jump upward when the change is introduced, however, they continue their downward trend after the introduction of the change. Only the number of follow up questions stabilized and began increasing after the change.
\begin{figure}[H] \begin{figure}[H]
\begin{subfigure}[t]{0.5\textwidth} \begin{subfigure}[t]{0.5\textwidth}
\includegraphics[scale=0.37]{../math.stackexchange.com/output/its/average_sentiments-i1.png} \includegraphics[scale=0.37]{../math.stackexchange.com/output/its/average_sentiments-i1.png}
@@ -39,30 +45,40 @@ The math.stackexchange.com community shows a decrease in average sentiments prio
\includegraphics[scale=0.37]{../math.stackexchange.com/output/votesits/average_votes-i1.png} \includegraphics[scale=0.37]{../math.stackexchange.com/output/votesits/average_votes-i1.png}
\caption{An interrupted time series analysis of the vote score of questions created by new contributors on math.stackexchange.com} \caption{An interrupted time series analysis of the vote score of questions created by new contributors on math.stackexchange.com}
\label{math_votesits} \label{math_votesits}
\end{subfigure}\\
\begin{subfigure}[c]{0.5\textwidth}
\includegraphics[scale=0.37]{../math.stackexchange.com/output/questionits/average_questions-i1.png}
\caption{An interrupted time series analysis of the number of questions created by new contributors on math.stackexchange.com}
\label{math_questionsits}
\end{subfigure} \end{subfigure}
\end{figure} \end{figure}
% sentiments falling prior to the change % sentiments falling prior to the change
% sentiments falling faster than before the change % sentiments falling faster than before the change
\section{MathOverflow.net} \section{MathOverflow.net}
MathOverflow shows a constant regresssion before the change, however, average sentiments are low at about 10 months before the change and spiked high directly before the change. When the change is introduced regression makes a small jump up and decreases thereafter. This data set is sparse compared to the other datasets. MathOverflow shows a constant regresssion before the change, however, average sentiments are low at about 10 months before the change and spiked high directly before the change. When the change is introduced regression makes a small jump up and decreases thereafter. The votes score steadily increased prior to the change and then quickly returned to the level from 3 years before the change. The number of 1st questions are falling prior the change and stabilized thereafter. This data set is sparse compared to the other datasets. Also the vote scores are high compared to other datasets.
\begin{figure}[H] \begin{figure}[H]
\begin{subfigure}[t]{0.5\textwidth} \begin{subfigure}[t]{0.5\textwidth}
\includegraphics[scale=0.37]{../mathoverflow.net/output/its/average_sentiments-i1.png} \includegraphics[scale=0.37]{../mathoverflow.net/output/its/average_sentiments-i1.png}
\caption{An interrupted time series analysis of the sentiments of answer to questions created by new contributors on MathOverflow.com} \caption{An interrupted time series analysis of the sentiments of answer to questions created by new contributors on MathOverflow.net}
\label{matho_its} \label{matho_its}
\end{subfigure} \end{subfigure}
\begin{subfigure}[t]{0.5\textwidth} \begin{subfigure}[t]{0.5\textwidth}
\includegraphics[scale=0.37]{../mathoverflow.net/output/votesits/average_votes-i1.png} \includegraphics[scale=0.37]{../mathoverflow.net/output/votesits/average_votes-i1.png}
\caption{An interrupted time series analysis of the vote score of questions created by new contributors on MathOverflow.com} \caption{An interrupted time series analysis of the vote score of questions created by new contributors on MathOverflow.net}
\label{math_votesits} \label{matho_votesits}
\end{subfigure}\\
\begin{subfigure}[c]{0.5\textwidth}
\includegraphics[scale=0.37]{../mathoverflow.net/output/questionits/average_questions-i1.png}
\caption{An interrupted time series analysis of the number of questions created by new contributors on MathOverflow.net}
\label{matho_questionsits}
\end{subfigure} \end{subfigure}
\end{figure} \end{figure}
% senitments stable/constant prior to the change % senitments stable/constant prior to the change
% falling after the change % falling after the change
\section{AskUbuntu.com} \section{AskUbuntu.com}
AskUbuntu saw a decrease in average sentiments prior to the change. After the introduction of the change the regression dipped but sentiments keep rising drastically since then. AskUbuntu saw a decrease in average sentiments prior to the change. After the introduction of the change the regression dipped but sentiments keep rising drastically since then. The vote score has a huge range of values prior and after the change, hoewever the graph indicates the the vote score is declining after the change. The number of 1st questions slightly decreased prior the change and started rising after the change.
\begin{figure}[H] \begin{figure}[H]
\begin{subfigure}[t]{0.5\textwidth} \begin{subfigure}[t]{0.5\textwidth}
\includegraphics[scale=0.37]{../askubuntu.com/output/its/average_sentiments-i1.png} \includegraphics[scale=0.37]{../askubuntu.com/output/its/average_sentiments-i1.png}
@@ -72,7 +88,12 @@ AskUbuntu saw a decrease in average sentiments prior to the change. After the in
\begin{subfigure}[t]{0.5\textwidth} \begin{subfigure}[t]{0.5\textwidth}
\includegraphics[scale=0.37]{../askubuntu.com/output/votesits/average_votes-i1.png} \includegraphics[scale=0.37]{../askubuntu.com/output/votesits/average_votes-i1.png}
\caption{An interrupted time series analysis of the vote score of questions created by new contributors on AskUbuntu.com} \caption{An interrupted time series analysis of the vote score of questions created by new contributors on AskUbuntu.com}
\label{math_votesits} \label{ubuntu_votesits}
\end{subfigure}\\
\begin{subfigure}[c]{0.5\textwidth}
\includegraphics[scale=0.37]{../askubuntu.com/output/questionits/average_questions-i1.png}
\caption{An interrupted time series analysis of the number of questions created by new contributors on AskUbuntu.com}
\label{ubuntu_questionsits}
\end{subfigure} \end{subfigure}
\end{figure} \end{figure}
%senitments have gradually fallen prior to the change %senitments have gradually fallen prior to the change
@@ -80,7 +101,7 @@ AskUbuntu saw a decrease in average sentiments prior to the change. After the in
%maybe: sentiments did not change drastically as seen in maths communities %maybe: sentiments did not change drastically as seen in maths communities
\section{ServerFault.com} \section{ServerFault.com}
ServerFault shows gradually rising average sentiments prior to the change. At the time of the change the regession makes a jump upward and the average sentiment decrease slowly afterward. ServerFault shows gradually rising average sentiments prior to the change. At the time of the change the regession makes a jump upward and the average sentiment decrease slowly afterward. The vote score fell prior to the change, made a huge jump upward and quickly returned to the levels just prior to the change. The number of 1st questions, however, saw a drastic change. Prio to the change the number of 1st questions decreased steadily, while after the change the numbers increase at the same pace as they were falling prior to the change. The number of follow up questions also saw the same course direction, falling prior and raising after the change.
\begin{figure}[H] \begin{figure}[H]
\begin{subfigure}[t]{0.5\textwidth} \begin{subfigure}[t]{0.5\textwidth}
\includegraphics[scale=0.37]{../serverfault.com/output/its/average_sentiments-i1.png} \includegraphics[scale=0.37]{../serverfault.com/output/its/average_sentiments-i1.png}
@@ -90,14 +111,19 @@ ServerFault shows gradually rising average sentiments prior to the change. At th
\begin{subfigure}[t]{0.5\textwidth} \begin{subfigure}[t]{0.5\textwidth}
\includegraphics[scale=0.37]{../serverfault.com/output/votesits/average_votes-i1.png} \includegraphics[scale=0.37]{../serverfault.com/output/votesits/average_votes-i1.png}
\caption{An interrupted time series analysis of the vote score of questions created by new contributors on ServerFault.com} \caption{An interrupted time series analysis of the vote score of questions created by new contributors on ServerFault.com}
\label{math_votesits} \label{fault_votesits}
\end{subfigure}\\
\begin{subfigure}[c]{0.5\textwidth}
\includegraphics[scale=0.37]{../serverfault.com/output/questionits/average_questions-i1.png}
\caption{An interrupted time series analysis of the number of questions created by new contributors on ServerFault.com}
\label{fault_questionsits}
\end{subfigure} \end{subfigure}
\end{figure} \end{figure}
%sentiments fairly stable before and after the change %sentiments fairly stable before and after the change
% small jump in avg sentiments at change date % small jump in avg sentiments at change date
\section{SuperUser.com} \section{SuperUser.com}
SuperUser shows only sightly decreasing average sentiment up to the change. At the change time the regression takes a dip down and the regression shows a downward trend after the change. Indeed the average sentiments dipped considerably when the change is introducted the average sentiment recovers about 13 months later. Data available in the future will show if the recovery is persistent. SuperUser shows only sightly decreasing average sentiment and vote score up to the change. At the change time the regressions take a dip down and the regression shows a downward trend after the change. Indeed the average sentiments and vote score dipped considerably when the change is introducted. The average sentiment recovers about 13 months later, while the vote score does not recover as well. The number of 1st questions are decrasing prior to the change and then went through the roof indicating a huge wave of new users. This drastic influx of new users may explain the crash of the average sentiment and vote score that occured at the same time. Data available in the future will show if the recovery is persistent.
\begin{figure}[H] \begin{figure}[H]
\begin{subfigure}[t]{0.5\textwidth} \begin{subfigure}[t]{0.5\textwidth}
\includegraphics[scale=0.37]{../superuser.com/output/its/average_sentiments-i1.png} \includegraphics[scale=0.37]{../superuser.com/output/its/average_sentiments-i1.png}
@@ -107,7 +133,12 @@ SuperUser shows only sightly decreasing average sentiment up to the change. At t
\begin{subfigure}[t]{0.5\textwidth} \begin{subfigure}[t]{0.5\textwidth}
\includegraphics[scale=0.37]{../superuser.com/output/votesits/average_votes-i1.png} \includegraphics[scale=0.37]{../superuser.com/output/votesits/average_votes-i1.png}
\caption{An interrupted time series analysis of the vote score of questions created by new contributors on SuperUser.com} \caption{An interrupted time series analysis of the vote score of questions created by new contributors on SuperUser.com}
\label{math_votesits} \label{super_votesits}
\end{subfigure}\\
\begin{subfigure}[c]{0.5\textwidth}
\includegraphics[scale=0.37]{../superuser.com/output/questionits/average_questions-i1.png}
\caption{An interrupted time series analysis of the number of questions created by new contributors on SuperUser.com}
\label{super_questionsits}
\end{subfigure} \end{subfigure}
\end{figure} \end{figure}
%sentiments fairly stable until the change date %sentiments fairly stable until the change date
@@ -115,7 +146,7 @@ SuperUser shows only sightly decreasing average sentiment up to the change. At t
% recovery after after 13 months to not quite the previous levels % recovery after after 13 months to not quite the previous levels
\section{electronics.stackexchange.com} \section{electronics.stackexchange.com}
On electronics.stackexchange.com the average sentiment decreases continuously prior to the change. At the change date the regression makes a little jump upward but the trend from before the change continues afterward. Similarly to SuperUser, the average sentiment recover at about 12 months after the change is introduced and future data will be necessary to determine if the recovery is persistent.s On electronics.stackexchange.com the average sentiment and votes decrease continuously prior to the change. At the change date the regressions makes a little jump upward but the trend from before the change continues afterward. Similarly to SuperUser, the average sentiment recover at about 12 months after the change is introduced and future data will be necessary to determine if the recovery is persistent. The number of 1st questions rose continuously prior the the change and decreased thereafter. The number of follow up questions fell slightly prior to the change and stabilized afterward.
\begin{figure}[H] \begin{figure}[H]
\begin{subfigure}[t]{0.5\textwidth} \begin{subfigure}[t]{0.5\textwidth}
\includegraphics[scale=0.37]{../electronics.stackexchange.com/output/its/average_sentiments-i1.png} \includegraphics[scale=0.37]{../electronics.stackexchange.com/output/its/average_sentiments-i1.png}
@@ -125,7 +156,12 @@ On electronics.stackexchange.com the average sentiment decreases continuously pr
\begin{subfigure}[t]{0.5\textwidth} \begin{subfigure}[t]{0.5\textwidth}
\includegraphics[scale=0.37]{../electronics.stackexchange.com/output/votesits/average_votes-i1.png} \includegraphics[scale=0.37]{../electronics.stackexchange.com/output/votesits/average_votes-i1.png}
\caption{An interrupted time series analysis of the vote score of questions created by new contributors on electronics.stackexchange.com} \caption{An interrupted time series analysis of the vote score of questions created by new contributors on electronics.stackexchange.com}
\label{math_votesits} \label{ele_votesits}
\end{subfigure}\\
\begin{subfigure}[c]{0.5\textwidth}
\includegraphics[scale=0.37]{../electronics.stackexchange.com/output/questionits/average_questions-i1.png}
\caption{An interrupted time series analysis of the number of questions created by new contributors on electronics.stackexchange.com}
\label{ele_questionsits}
\end{subfigure} \end{subfigure}
\end{figure} \end{figure}
%sentiments were falling continuously before and after the change %sentiments were falling continuously before and after the change
@@ -133,7 +169,7 @@ On electronics.stackexchange.com the average sentiment decreases continuously pr
% more data in the future will be required to determine if upward trend in the end continues % more data in the future will be required to determine if upward trend in the end continues
\section{stats.stackexchange.com} \section{stats.stackexchange.com}
On stats.stackexchange.com the average sentiment is steadily decreasing prior to the change. The regression dips when the change is introduced. However, the average sentiment after the change indicate a slight upward trend. On stats.stackexchange.com the average sentiment is steadily decreasing prior to the change. The regression dips when the change is introduced. However, the average sentiment after the change indicate a slight upward trend. The vote score also decreased prior to the change but does not recover afterward. However, the number of 1st questions and follow up questions are raising prior to the change and increase even faster after the change.
\begin{figure}[H] \begin{figure}[H]
\begin{subfigure}[t]{0.5\textwidth} \begin{subfigure}[t]{0.5\textwidth}
\includegraphics[scale=0.37]{../stats.stackexchange.com/output/its/average_sentiments-i1.png} \includegraphics[scale=0.37]{../stats.stackexchange.com/output/its/average_sentiments-i1.png}
@@ -143,7 +179,12 @@ On stats.stackexchange.com the average sentiment is steadily decreasing prior to
\begin{subfigure}[t]{0.5\textwidth} \begin{subfigure}[t]{0.5\textwidth}
\includegraphics[scale=0.37]{../stats.stackexchange.com/output/votesits/average_votes-i1.png} \includegraphics[scale=0.37]{../stats.stackexchange.com/output/votesits/average_votes-i1.png}
\caption{An interrupted time series analysis of the vote score of questions created by new contributors on stats.stackexchange.com} \caption{An interrupted time series analysis of the vote score of questions created by new contributors on stats.stackexchange.com}
\label{math_votesits} \label{stats_votesits}
\end{subfigure}\\
\begin{subfigure}[c]{0.5\textwidth}
\includegraphics[scale=0.37]{../stats.stackexchange.com/output/questionits/average_questions-i1.png}
\caption{An interrupted time series analysis of the number of questions created by new contributors on stats.stackexchange.com}
\label{stats_questionsits}
\end{subfigure} \end{subfigure}
\end{figure} \end{figure}
% sentiments steadily decreasing prior to the change % sentiments steadily decreasing prior to the change
@@ -151,7 +192,7 @@ On stats.stackexchange.com the average sentiment is steadily decreasing prior to
% sight upward trend after the change % sight upward trend after the change
\section{tex.stackexchange.com} \section{tex.stackexchange.com}
On tex.stackexchange.com the average sentiment is low comapred to the other investigated data sets. Prior to the change the average sentiment only slightly decreases. When the change is introduced the regreesion takes a dip down. After the change the analysis indicates a strong increase in average sentiment. Future data will be required to see if this upward trend continues or evens out. On tex.stackexchange.com the average sentiment is low compared to the other investigated data sets. Prior to the change the average sentiment only slightly decreases. When the change is introduced the regression takes a dip down and after the change the average sentiment increases drastically. Future data will be required to see if this upward trend continues or evens out. In stark contrast, the vote score shows a downward trend, although there is a short window around the change date where vote scores are higher compared to before and after change. The number of 1st questions have downward trend before the change and an upward trend afterward. The downward trend of the number of follow up questions is uninterrupted by the change.
\begin{figure}[H] \begin{figure}[H]
\begin{subfigure}[t]{0.5\textwidth} \begin{subfigure}[t]{0.5\textwidth}
\includegraphics[scale=0.37]{../tex.stackexchange.com/output/its/average_sentiments-i1.png} \includegraphics[scale=0.37]{../tex.stackexchange.com/output/its/average_sentiments-i1.png}
@@ -161,7 +202,12 @@ On tex.stackexchange.com the average sentiment is low comapred to the other inve
\begin{subfigure}[t]{0.5\textwidth} \begin{subfigure}[t]{0.5\textwidth}
\includegraphics[scale=0.37]{../tex.stackexchange.com/output/votesits/average_votes-i1.png} \includegraphics[scale=0.37]{../tex.stackexchange.com/output/votesits/average_votes-i1.png}
\caption{An interrupted time series analysis of the vote score of questions created by new contributors on tex.stackexchange.com} \caption{An interrupted time series analysis of the vote score of questions created by new contributors on tex.stackexchange.com}
\label{math_votesits} \label{tex_votesits}
\end{subfigure}\\
\begin{subfigure}[c]{0.5\textwidth}
\includegraphics[scale=0.37]{../tex.stackexchange.com/output/questionits/average_questions-i1.png}
\caption{An interrupted time series analysis of the number of questions created by new contributors on tex.stackexchange.com}
\label{tex_questionsits}
\end{subfigure} \end{subfigure}
\end{figure} \end{figure}
%avg sentiment fairly low compared to the other investigated communities %avg sentiment fairly low compared to the other investigated communities
@@ -170,7 +216,7 @@ On tex.stackexchange.com the average sentiment is low comapred to the other inve
% trend after change strongly upward % trend after change strongly upward
\section{unix.stackexchange.com} \section{unix.stackexchange.com}
On unix.stackexchange.com the average sentiment is decreasing prior to the change. When the change is introduced the regreesion take a small dip down, however, the average sentiment increases fast after the change. On unix.stackexchange.com the average sentiment is decreasing prior to the change. When the change is introduced the regression take a small dip down, however, the average sentiment increases fast after the change. The vote score shows continuous downward trend. The number of 1st and follow up questions are falling slightly prio to the change and increase afterward.
\begin{figure}[H] \begin{figure}[H]
\begin{subfigure}[t]{0.5\textwidth} \begin{subfigure}[t]{0.5\textwidth}
\includegraphics[scale=0.37]{../unix.stackexchange.com/output/its/average_sentiments-i1.png} \includegraphics[scale=0.37]{../unix.stackexchange.com/output/its/average_sentiments-i1.png}
@@ -180,7 +226,12 @@ On unix.stackexchange.com the average sentiment is decreasing prior to the chang
\begin{subfigure}[t]{0.5\textwidth} \begin{subfigure}[t]{0.5\textwidth}
\includegraphics[scale=0.37]{../unix.stackexchange.com/output/votesits/average_votes-i1.png} \includegraphics[scale=0.37]{../unix.stackexchange.com/output/votesits/average_votes-i1.png}
\caption{An interrupted time series analysis of the vote score of questions created by new contributors on unix.stackexchange.com} \caption{An interrupted time series analysis of the vote score of questions created by new contributors on unix.stackexchange.com}
\label{math_votesits} \label{unix_votesits}
\end{subfigure}\\
\begin{subfigure}[c]{0.5\textwidth}
\includegraphics[scale=0.37]{../unix.stackexchange.com/output/questionits/average_questions-i1.png}
\caption{An interrupted time series analysis of the number of questions created by new contributors on unix.stackexchange.com}
\label{unix_questionsits}
\end{subfigure} \end{subfigure}
\end{figure} \end{figure}
%sentiments decreasing prior to the change %sentiments decreasing prior to the change

View File

@@ -160,7 +160,7 @@
\newcommand{\myhomestreet}{Petergasse~23/13} %% your home street (with house number) \newcommand{\myhomestreet}{Petergasse~23/13} %% your home street (with house number)
\newcommand{\myhometown}{Graz} %% your home town \newcommand{\myhometown}{Graz} %% your home town
\newcommand{\myhomepostalnumber}{8010} %% your postal number of home town \newcommand{\myhomepostalnumber}{8010} %% your postal number of home town
\newcommand{\mysubmissionmonth}{May} %% month you are handing in \newcommand{\mysubmissionmonth}{August} %% month you are handing in
\newcommand{\mysubmissionyear}{2020} %% year you are handing in \newcommand{\mysubmissionyear}{2020} %% year you are handing in
\newcommand{\mysubmissiontown}{\myhometown} %% town of handing in (or \myhometown) \newcommand{\mysubmissiontown}{\myhometown} %% town of handing in (or \myhometown)

View File

@@ -423,8 +423,8 @@ BCOR=\myBCOR, %% binding correction (depends on how you bind
%doc% package for defining header and footer information. Please refer to the \myacro{KOMA} %doc% package for defining header and footer information. Please refer to the \myacro{KOMA}
%doc% script documentation how to use this package. %doc% script documentation how to use this package.
%doc% %doc%
\usepackage{scrpage2} %% advanced page style using KOMA \usepackage[headsepline]{scrlayer-scrpage} %% advanced page style using KOMA
\KOMAoptions{headsepline=false}
%doc% %doc%
%doc% \subsection{References}\myimportant %doc% \subsection{References}\myimportant