<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>BullyAlert- A Mobile Application for Adaptive Cyberbullying Detection</title></titleStmt>
			<publicationStmt>
				<publisher></publisher>
				<date>2020 Winter</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10226740</idno>
					<idno type="doi"></idno>
					<title level='j'>International Conference on Mobile Computing Applications and Services</title>
<idno>2411-7080</idno>
<biblScope unit="volume"></biblScope>
<biblScope unit="issue"></biblScope>					

					<author>R Rafiq</author><author>R Han</author><author>Q Lv</author><author>S Mishra</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[Due to the prevalence and severe consequences of cyberbullying, numerous research works have focused on mining and analyzing social network data to understand cyberbullying behavior and then using the gathered insights to develop accurate classifiers to detect cyberbullying. Some recent works have been proposed to leverage the detection classifiers in a centralized cyberbullying detection system and send notifications to the concerned authority whenever a person is perceived to be victimized. However, two concerns limit the effectiveness of a centralized cyberbullying detection system. First, a centralized detection system gives a uniform severity level of alerts to everyone, even though individual guardians might have different tolerance levels when it comes to what constitutes cyberbullying. Second, the volume of data being generated by old and new social media makes it computationally prohibitive for a centralized cyberbullying detection system to be a viable solution. In this work, we propose BullyAlert, an android mobile application for guardians that allows the computations to be delegated to the hand-held devices. In addition to that, we incorporate an adaptive classification mechanism to accommodate the dynamic tolerance level of guardians when receiving cyberbullying alerts. Finally, we include a preliminary user analysis of guardians and monitored users using the data collected from BullyAlert usage.]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>Cyberbullying in Online Social Networks (OSNs) has seen an unprecedented rise in recent years. Continued democratization of internet-usage, advancements, and innovations in the area of online social networking and lack of diligent steps to mitigate the effect of cyberbullying by the concerned entities, all have contributed to this unfortunate consequence. The constant threat of cyberbullying in these OSNs has become so expansive and pervasive that it has been reported that in America alone, more than fifty percent of teenage OSNs users have been affected by the threat of cyberbullying <ref type="bibr">[4]</ref>. While real-life bullying may involve verbal and/or physical assault, cyberbullying is different in the sense that it occurs under the umbrella of an electronic context that is available 24/7, thereby rendering the victims vulnerable to its threats on a constant and relentless basis. This unique feature of cyberbullying subjects the victims to devastating psychological effects that later cause nervous breakdowns, low self-esteem, self-harm, clinical depression, and in some extreme cases, suicides <ref type="bibr">[10]</ref>, <ref type="bibr">[8]</ref>. Disturbing events of teens committing suicides after being victimized by cyberbullies have also been reported <ref type="bibr">[9]</ref>, <ref type="bibr">[30]</ref>. Moreover, nine suicide cases have already been attributed to cyberbullying in Ask.fm alone <ref type="bibr">[2]</ref>. Cyberbullying has been reported as one of the many potential factors, if not the only factor, for these suicides <ref type="bibr">[3]</ref>. Figure <ref type="figure">1</ref> shows an example of a cyberbullying instance on Instagram. There have been numerous works on cyberaggression and cyberbullying by performing thorough analyses of the labeled media sessions from a social network. Cyberaggression is defined as a type of behavior in an electronic context that is meant to intentionally harm another person <ref type="bibr">[16]</ref>. Cyberbullying is defined in a stronger and more specific way as an aggressive behavior that is carried out repeatedly in OSNs against a person who cannot easily defend himself or herself, creating a power imbalance <ref type="bibr">[16]</ref>, <ref type="bibr">[22]</ref>, <ref type="bibr">[13]</ref>, <ref type="bibr">[16]</ref>, <ref type="bibr">[21]</ref>, <ref type="bibr">[29]</ref>. Developing classifiers for cyberbullying detection for different social networks such as Twitter, Ask.fm, Vine, Instagram, YouTube etc have been proposed as part of most of the previous works <ref type="bibr">[28]</ref>, <ref type="bibr">[11]</ref>, <ref type="bibr">[15]</ref>, <ref type="bibr">[35]</ref>, <ref type="bibr">[6]</ref>, <ref type="bibr">[26]</ref>, <ref type="bibr">[27]</ref>. System-related challenges such as responsiveness and scalability of a potential cyberbullying detection system have also been proposed <ref type="bibr">[25]</ref>. However, there are two key issues that a potential centralized cyberbullying system faces:</p><p>-A centralized cyberbullying detection system still needs to have a lot of computing resources to be able to accommodate all the OSNs in the world. Therefore, a system solution that can accommodate all the OSNs currently avail-able will be faced with a daunting challenge of meeting the computationalresource requirements to maintain an acceptable responsiveness -A potential centralized cyberbullying detection system, such as one presented in <ref type="bibr">[25]</ref>, makes use of a cyberbullying classifier component that is applied generally to all the guardians. We argue that different guardians will have different tolerance levels, which in turn, might be dependent on their personal preferences, demographic information, location, age, gender, and so on. So, we argue that a system solution that allows different levels of cyberbullying alerts to be sent to parents based on their tolerance levels is the most natural solution.</p><p>In this paper, we present the design and implementation of an Android mobile application for guardians: BullyAlert. This mobile application allows the guardians to monitor the online social network activities (currently only supports Instagram) of their kids and get notifications whenever the monitored social network profiles receive a potential cyberbullying instance. The reasons for developing this mobile application are twofold. First, it allows us to delegate classifier computations of cyberbullying detection to the hand-held devices of the guardians, thereby reducing the computational resources needed for a potential centralized cyberbullying detection system. Second, BullyAlert allows the guardians to give the resident classifier feedback about how right or wrong each notification is. The resident classifier then updates itself accordingly to calibrate its tolerance level with that of the guardian using it. This mechanism allows for personalized cyberbullying notification of an individual guardian.</p><p>We make the following contributions in this paper.</p><p>-We propose the design and implementation of an android application, BullyAlert.</p><p>-We present a preliminary user-experience analysis of the guardians who downloaded the mobile application by using the current crop of data -We present a preliminary comparison the behavior of the users who were being monitored by the guardians with the general population of Instagram to derive some initial key insights by leveraging the current collection of data</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Related Works</head><p>As mentioned earlier, the majority of researches on cyberbullying have focused on improving the accuracy of detection classifiers. Collection, mining and analyses of different social media networks such as Twitter <ref type="bibr">[28]</ref>, Ask.fm <ref type="bibr">[18]</ref>, YouTube <ref type="bibr">[5]</ref>,</p><p>Instagram <ref type="bibr">[12]</ref>, <ref type="bibr">[6]</ref>, chat-services <ref type="bibr">[15]</ref>, Vine <ref type="bibr">[26]</ref> have been performed for cyberbullying, cyberaggression <ref type="bibr">[34]</ref>, harassment <ref type="bibr">[35]</ref>, aggression under the hood of anonymity <ref type="bibr">[18]</ref> and predatory behaviors <ref type="bibr">[14]</ref>. As a natural next step, many works have proposed accurate classifiers to detect and predict cyberbullying incidents as well as finding most contributing factors to successful classification of cyberbullying incidents <ref type="bibr">[20][6]</ref>.</p><p>The scalability and responsiveness challenges of a potential cyberbullying detection system have also been put under the microscope recently <ref type="bibr">[25]</ref>. Several systems have been proposed with scalable architectures to detect cyberbullying in social media networks <ref type="bibr">[35]</ref>, <ref type="bibr">[33]</ref>, <ref type="bibr">[7]</ref>. However, none of the previous researches addressed the issue of the individual tolerance level of the guardians when it comes to the detection of cyberbullying, which might be a factor dependent on many variables such as the age of the person being monitored, demographics information, gender, and so on. Moreover, due to many novel social networks being introduced every year <ref type="bibr">[32]</ref> and the enormous amount of data being generated by the existing ones such as Instagram <ref type="bibr">[17]</ref> and YouTube <ref type="bibr">[19]</ref>, we argue, eventually it will be computationally prohibitive to sustain a centralized cyberbullying detection architecture. Some mobile applications are available for guardians, however, they are either mostly outgoing packet network sniffing tools <ref type="bibr">[7]</ref> or profanity detection apps <ref type="bibr">[31]</ref>. None of them incorporates a sophisticated cyberbullying classification or a mechanism to enable dynamic tolerance-level detection.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">System Design and Implementation</head><p>This section presents the design, implementation, and architecture of BullyAlert. We begin by describing the typical user work-flow through a series of use cases, and then present the architecture and implementation of different components of BullyAlert.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1">Use Cases</head><p>Guardian registers After a guardian downloads the application and opens it, the screen in Table <ref type="table">1a</ref> is presented. The Guardian has to enter unique email id and password to be able to register into our system. Options to divulge additional information such as age group, gender, and ethnicity, are also provided. Each of these information is presented through a drop-down list. The Guardian can also use the option to decline to give this information.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Guardian logins</head><p>The login screen is shown in Table <ref type="table">1b</ref> has two input fields. These fields ask for the email and password used to register into the system. After clicking the log-in button, the guardian is directed to the dashboard of the application.</p><p>Guardian Searches for users to monitor The Guardian can search for public Instagram profiles by going to the user search component shown in Table <ref type="table">1c</ref>. At first, the guardian selects the social network profile from the drop-down list. Right now, we are only supporting public Instagram social network profiles. Then in the text field, the username of the user to be monitored is typed. When the search button is clicked, a list of users matching the username entered is shown along with the associated profile pictures for the guardian to facilitate a better identification. To start monitoring a profile, the guardian has to select the profile and then click the monitor button.  Guardian examines user profile information details The Guardian can see the basic profile details of the users being monitored, as shown in Table <ref type="table">1d</ref>. This page shows guardians the current profile picture, the number of total media shared, and the number of total followers and followings of the user being monitored.</p><p>Guardian gets a list of notifications Table <ref type="table">1e</ref> shows the screen that the guardian sees when a host of notifications are present in the dashboard. The list has three columns, the first column shows the username of the profile where this cyberbullying notification has originated, the second column indicates the social network and finally, the third column outlines the application's classifier's perceived level of severity for this notification. Currently, the application has two levels of severity, namely low and high. The Guardian can see the details of the notification by clicking the individual notification boxes.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Guardian examines notification details and give feedback</head><p>To enable the guardians to see the full context of a particular notification and give feedback as to whether the application right or wrong in terms of the severity level, table 1f is presented. The Guardian has options to click the "see full context" button which will then load not just the latest comments but also the previous comments of that media session. This enables the guardians to get a full picture of the happenings in the media session. The guardian can give feedback through the two buttons, namely right or wrong. This feedback is then used by the application to calibrate its tolerance level according to that of the guardian.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2">Architecture and Implementation</head><p>This section describes the architecture and implementation of the BullyAlert application's different components. Figure <ref type="figure">2</ref> presents the architecture diagram of the BullyAlert system. The guardian communicates with the BullyAlert application for registering, logging in, and getting notifications for potential cyberbullying instances. The application sends guardian data, notification data, and feedback data to the BullyAlert server. The application also contacts the BullyAlert server for authenticating a user log-in. Moreover, the application implements a polling mechanism by which it periodically collects media session data of the Instagram-users (who are being monitored by the guardians) from the Instagram servers.</p><p>BullyAlert Server BullyAlert server is responsible for the following:</p><p>-During the registration process, it is responsible for checking that the registration information is verified. It first checks if the email that is being used to register is unique in the system and the password is at least six characters. When the registration is successful, the server stores both the username and the password in an encrypted format. In addition to this, the server stores the demographic information provided by the guardian through the registration form, such as age group, gender, and ethnicity. -During login, the server is responsible to check the login credentials of a guardian with the system's stored credentials. It gives an error if the login credentials are not verified which is then in turn shown to the user. -Storing all the notifications of the guardians. Every time a guardian receives a notification, the application sends the notification meta-data to the server. The data consists of a list of comments for which the notification was raised, the user-name in whose profile the notification was raised, and the severity level of the notification (high or low). -Storing the feedback that a guardian gives to the application for a particular notification. Every time the guardian gives the application feedback ( right or wrong), the application sends the relevant data ( list of comments, the application's perceived severity level, the guardian's feedback) to the server. -Storing each guardian's resident classifier information. This is to facilitate continuity of the guardian's classifier so that when the guardian uninstalls the application, the classifier will keep being stored in our server. This means that, if the guardian, at some later time, chooses to re-install the application, the old classifier will become the classifier of the application instead of the general one.</p><p>We have used MongoDB, RESTful API, and node.js for the implementation of this component. The code for this can be found in <ref type="bibr">[24]</ref>.</p><p>Adaptive Classifier An adaptive classifier for each guardian is more suitable for our application than a general classifier for every guardian because of the potentially subjective nature of cyberbullying. We hypothesize that each guardian will have their tolerance level when it comes to cyberbullying, which in turn, can be dependent on several factors, such as gender, age, race, etc.</p><p>The ways we develop this adaptive classifier are as follows. First, we incorporate a feedback mechanism in our application by which, the guardians, upon receiving a potential cyberbullying notification, will be able to give us feedback saying how right or wrong the notification is. We also show the guardians a list of other media sessions which were not deemed as bullying by our application, to make sure we also get feedback for media sessions which were not in the potential bullying notifications page. This is to enable the application to keep track of the false negatives in addition to false positives. Second, we use the logistic regression classifier from <ref type="bibr">[25]</ref> for the implementation of the application's resident cyberbullying detection component. Every time an instance of the classifier gets feedback, the feedback data encapsulate the media session's list of comments for which the alert was raised and the guardian's label (right or wrong). This datum is considered as a labeled training data for the resident classifier. Feature values described for the logistic regression classifier used in <ref type="bibr">[25]</ref> are extracted for each of these feedback sessions. Upon converting the feedback data into training data, we then perform stochastic gradient descent <ref type="bibr">[23]</ref> for the resident classifier. Each parent's classifier then reaches a different local optimum, thereby facilitating the adaptive nature of the classifier.</p><p>For the guardians whose numbers of feedback are not substantial enough to perform an individual adaptation process, we implement the following. We first collect all the feedback given by all the guardians in our server. Then, based on all this feedback, we update our general classifier that was used by the guardians when they first install our application. We call this updated general classifier. This updated general classifier is then propagated to the guardians who don't have enough individual feedback to make sure their classifiers are updated as well. The implementation code can be found in <ref type="bibr">[1]</ref>.</p><p>Polling Mechanism The polling mechanism is responsible for the following:</p><p>-When the guardian searches for a particular user by username, this mechanism fetches the user profiles of which the username-string is a match. -After a monitoring request of a user by a guardian is approved, the polling mechanism starts polling that user profile every hour for any new posts. This is to make sure the app is updated with the latest media postings of the monitored user. -In addition to polling for newly posted media, this component is also responsible for getting the newest comments for all the media posted by the user. Every time a host of new comments is posted for a media session, this mechanism fetches those new comments and sends this newest media session data to the adaptive classifier component for classification.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">User Data Analysis</head><p>(a) age group (b) gender Table <ref type="table">2</ref>. BullyAlert Guardian's Gender and Age Distribution This section presents a preliminary analysis of the data collected until now from BullyAlert. First, it explores the guardian data and then it performs a comparison of social network behaviors between the general Instagram population <ref type="bibr">[18]</ref> and the users who were being monitored by the guardians who downloaded our application.</p><p>When a guardian registers into our system, in addition to the email and password, we also ask them to provide their gender, ethnicity and age information, if they choose to divulge those. Table <ref type="table">2a</ref> and 2b show the distribution of gender and age-group of the 100 guardians who have downloaded BullyAlert until now. From the distributions, it is fairly clear that a substantial portion of the people chose not to provide the demographic information, 31 and 34 percent for age group and gender respectively. In addition to that, the most prominent age group and gender where 36-45 and female respectively.</p><p>The reasons we collected this demographic information are twofold. First, when we start to get the classifier feedback data for all the guardians for their different tolerance levels, we will want to investigate if there are any correlations between different guardians' tolerance levels and their demographic information. Second, if we do find that people with the same demographics tend to have the same tolerance levels, we will then want to be able to build different general classifiers for different clusters where each cluster hosts guardians with similar demographics. While we acknowledge that these 100 guardians' data is an insufficient representation of guardians, we postulate that this preliminary demographics distribution still introduces us to a new systems challenge: what about guardians who do not provide demographic information and thus will not belong to any particular cluster by default?</p><p>Next, we investigate a comparison analysis between the Instagram users who are being monitored by our application and the general Instagram population, a data-set collected from <ref type="bibr">[18]</ref>. First, we compare both sets of users' follow and media-sharing activities. Table <ref type="table">3a</ref> and 3b show the CCDF of both set of users' number of people they follow and number of medias they have shared in their profile. It can be seen that the follows activity of both sets follow the same pattern, which is understandable because a user can only follow so many people. But there is a discernible difference in the media sharing activity. The general population's line tends to fall far slower in the graph than that of the BullyAlertmonitored users, with almost 80 percent of the BullyAlert users having less than 100 shared media in their profile. This means that the users who are monitored by the guardians tend to be not as active as the general population. This particular observation also poses an interesting system perspective. Because most of the people who are likely to be monitored by our application will not be sharing as much media, we can afford to incorporate some sophisticated machine learning classifiers in the application instead of worrying about responsiveness, discussed in <ref type="bibr">[25]</ref>. Again, we like to emphasize here that these are preliminary derivations drawn from our current small set of collected data.  <ref type="table">3</ref>. Comparison between Monitored users of BullyAlert and Instagram population collected in <ref type="bibr">[18]</ref> In continuing the narrative, we also put forth a detailed analysis of activities of other people in the user's profile, for example, the likes and comments received in the shared media sessions. Table <ref type="table">4a</ref> and 4b show the CCDF of the number of likes and comments received for the media sessions for both set of Instagram users. It can be seen that the media sessions shared by the users being monitored  <ref type="table">4</ref>. Comparison between Monitored users of BullyAlert and Instagram population collected in <ref type="bibr">[18]</ref> are far less active in terms of getting likes and comments than their general counterparts. This further solidifies our system perspective that our application's classifier will have fewer data to take care for, thus the classifier does not have to be as lightweight as described in <ref type="bibr">[25]</ref>, based on our current crop of data. We acknowledge that the current crop of data is not enough to make a decision, so we plan to keep collecting the data to solidify this preliminary insight.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5">Conclusion</head><p>In this paper, we make the following contributions. First, we outline the motivation and design of a mobile application, BullyAlert, that adapts itself according to individual tolerance level for cyberbullying of the guardian. Second, we present a thorough architecture description of the components implemented to develop BullyAlert. Third, we provide a preliminary user analysis of both the guardians and the users being monitored by the application, and in the process, present several potential system issues/ challenges/perspectives using our current crop of data. In the future, we plan to expand our study, collection, and analysis of the guardian data.</p></div></body>
		</text>
</TEI>
