Simple Science

Cutting edge science explained simply

# Computer Science# Computation and Language# Computers and Society# Machine Learning

Addressing Subjectivity in News Reporting

A study on detecting subjective statements in news articles using advanced techniques.

― 5 min read


Tackling Bias in NewsTackling Bias in Newsjournalism.A method to identify bias in
Table of Contents

This paper is protected by copyright, and its use is allowed under the Creative Commons License Attribution 4.0 International.

Introduction

In the world of journalism, it is crucial to identify when a text expresses personal views rather than factual information. This is important because biased news can shape public opinion, even if some parts of an article are based on facts. The ability to automatically determine whether a text is Subjective or Objective could greatly help editors and fact-checkers.

The Problem of Subjectivity in News Articles

News articles often mix facts with opinions. This combination can lead to confusion among readers, as subjective statements can distort the truth. Our task is to classify sentences from news articles as either subjective, meaning they reflect personal viewpoints, or objective, meaning they present factual information without personal bias.

One of the main challenges we face is class imbalance in the dataset. There are often many more objective sentences than subjective ones. This imbalance can result in Models that are poor at identifying subjective sentences. Additionally, the idea of subjectivity can vary between different cultures and contexts. Thus, simple rewriting of sentences may not capture the journalistic aspect of subjectivity.

Our Approach to Data Generation

To address these issues, we created new training data using a method involving GPT-3 models. We based our work on a checklist derived from journalistic standards to ensure the generated texts reflect various subjective styles. This allows us to create training materials that better represent the topic.

We conducted experiments in three languages: English, Turkish, and German. Our results show that employing different subjective styles boosts the performance of models designed to detect subjectivity. This highlights the significance of diverse subjective expressions within each language.

Another key finding is that using style-based Oversampling, which means creating more samples from subjective styles, works better than normal paraphrasing, particularly in Turkish and English. However, we noted that GPT-3 sometimes struggles to produce quality texts in non-English languages.

Creating A Subjectivity Checklist

To effectively generate texts that reflect a journalistic perspective, we developed a checklist. We consulted editors to understand how they assess subjectivity in articles. After gathering this information, we cross-referenced it with existing journalism and linguistic literature. The outcome was a comprehensive checklist that includes distinct styles representing various subjective angles.

Designing Prompts for Text Generation

Our next step involved creating prompts in English, Turkish, and German to instruct the GPT-3 models on how to generate texts based on the identified styles. We initially crafted an English template, but it did not perform well in other languages. Consequently, we adapted the templates for each language separately.

The first two authors of our work, being native Turkish and German speakers, discussed the English prompts and ensured that translations captured the intended meaning. This approach maintained coherence across languages while allowing flexibility for stylistic differences.

Data Generation and Balancing Techniques

To generate our dataset, we first measured the gap between the number of subjective and objective sentences. We then randomly selected samples to create a balanced dataset. By focusing on the differences in the number of samples, we ensured that our models would have enough data to learn from.

We used both under-sampling and over-sampling techniques to handle the class imbalance. Under-sampling means removing objective samples until they match the number of subjective samples, while over-sampling involves adding generated samples to the original dataset to balance the classes.

Training Language-Specific Models

For our subjectivity detection task, we relied on language-specific models: Roberta-base for English, German Bert for German, and BERTurk for Turkish. These models have proven effective for tasks in their respective languages. We limited the length of the input to ensure efficient processing and conducted training over several epochs to refine the models.

Evaluating Our Methods

After we trained the models, we assessed how well the new samples generated with GPT-3 improved the models' accuracy. We compared the performance of our models against three baselines: those trained only on original Datasets, those trained with normal paraphrasing, and those using paraphrased objective texts.

Our results showed that style-based oversampling significantly improved the performance of models for English and Turkish. However, it did not yield the same benefits for German transformers. Among various styles, we found that certain styles like partisan and exaggerated worked well for Turkish, while propaganda and exaggerated styles had a positive effect on English models.

Comparing Different GPT-3 Models

We also wanted to explore how different GPT-3 models performed in generating training samples. For this purpose, we compared text-davinci-003 with gpt-3.5-turbo (ChatGPT). While there were no significant differences in performance overall, some improvements were noted in certain subjective styles using the ChatGPT model.

Qualitative Assessment of Generated Texts

In addition to quantitative evaluations, we conducted a qualitative assessment of the generated texts. We looked at the naturalness, correctness, and relevance of the texts produced by both models. We discovered that the English samples often contained exaggerated phrases and sometimes used offensive language. In the case of Turkish samples, we noticed that first-person references were common, making the texts feel less formal. German samples occasionally contained language that was not suitable for the context.

Conclusion

In summary, our study employed style-based sampling with GPT-3 models, focusing on journalistic styles to tackle the scarcity of data in subjectivity detection. Our experiments highlighted that this approach is more effective than standard paraphrasing. Different styles provided varying benefits depending on the language, reflecting cultural distinctions and potential biases in the data.

Our work is specific to each language and limited by the availability of high-quality data for less commonly used languages. Future research should look into finding better models for these languages and improving the phrasing of prompts to yield more accurate results. Additionally, sample selection plays a critical role in achieving effective style transfer, which we plan to investigate further in upcoming studies.

Similar Articles