INTRODUCTION

There are several approaches to engage the community in research including Community-Based Participatory Research (CBPR), Participatory Action Research (PAR), Community-Based Participatory Action Research, Community Engaged Research, among others (Balls-Berry & Acosta-Pérez, 2017; Vaughn & Jacquez, 2020; Wallerstein et al., 2017). These approaches share core principles of elevating and engaging with the community to enhance the impact and rigor of the research (Holkup et al., 2004; Israel et al., 2020). The ultimate goal is to develop research methodologies where investigators partner with community members (e.g. youth, patients, marginalized communities, consumers) - providing them with ownership, perspective, and expertise in the scientific process. As a research community, we have increasingly recognized the importance of integrating community involvement ideally at each stage of the research: partnership, design, data collection, analysis, dissemination, and subsequent action (Vaughn & Jacquez, 2020). A plethora of research on partnerships have merged across the nation with several integrating careful reflection and assessment of their diverse partnerships (Boursaw et al., 2021; Taffere et al., 2024; Wallerstein & Duran, 2010). Despite the growing interest and engagement, few scholars engage the community in all phases of the research process, particularly with regards to the data analysis stage (Cashman et al., 2008).

Community and participatory data analysis has often been a challenging area for many projects and scholars. Despite the intention to involve the community at every stage of a research initiative, community partners engage in data analysis and interpretation far less frequently than in other phases, often due to varying levels in ability, time, and resources. This results in missed opportunities for community members, regardless of their educational background or research experience, to engage with data analysis beyond being involved in the data collection or recruitment. This paper describes our innovative community-engaged analysis process and offers lessons learned and concrete strategies for other teams to engage the community more holistically and comprehensively in the data analysis stage.

Overview of the Cervical Cancer Case Study for Innovative Participatory Data Analysis

The Cervical Cancer Project’s goal is to uplift the diverse experiences of middle to older age Latina women diagnosed with cervical cancer. Cervical cancer disproportionately impacts Latinas (Austin et al., 2002; Heintzman et al., 2018; Zeno et al., 2022). Our team is comprised of five women of color including two Latina undergraduate students, two master level research assistants, and the PI, a bilingual 1.5. generation Latina. Using the Database of Individual Patient Experiences (DIPEx) Methodology, which is described elsewhere in detail (Herxheimer et al., 2003; Ziebland & McPherson, 2006), we conducted 60-90-minute audio and video recorded patient interviews. We worked with a diverse community advisory board (CAB).

CAB members were either self-selected, invited, and/or were referred for participation. We presented the project to multiple community organizations and recruited from nearly twenty community events inviting folks to be part of this group. The CAB is composed of 15 members, representing a multi-community with intergenerational perspectives across the lifespan. This includes community members, undergraduate students, graduate students in medicine, clinical faculty, researchers, former participants of the project who were interviewed, as well as government sector representatives and staff from various community healthcare centers. Approximately 80% of CAB members speak Spanish, 50% identify as Latina and/or immigrant, and 30% have personal or familial experience with some type of cancer. Given the diverse background in research and education levels among CAB members, we drafted an agenda that fostered connection between the entire community and the project (see Appendix I).

The CAB was invited to participate in a data analysis meeting scheduled for 3 hours, allowing ample time for meaningful engagement. During this meeting, we employed three focused strategies to ensure successful community participation in data analysis. We first built community rapport through creating psychological safety. Second, we established a common language for qualitative data analysis by leveraging common social media language. Third, we utilized a dynamic visualization board to keep the community engaged throughout the process. Our stakeholder community group convened on July 14th, 2023, to participate in a data analysis activity. The community had previously met twice to provide feedback on interview instrument and recruitment strategies. As an established group that had guided the project over the past year, the CAB was well position to contribute to this third gathering where the primary goal was to inform the data analysis by providing direct input on the codebook. About ¼ of the transcripts have been analyzed in NVIVO that informed the development of the codebook presented. During this meeting that was in English, the CAB’s role was to review the codebook presented in both English and Spanish at a third grade reading level to answer what might be missing, what they liked, and what was unclear.

Strategy 1: Building a Psychologically Safe Virtual Space

We informed the CAB from the outset that they would be involved in all aspects of the research process for this project. Each time we met we focused on a specific part= of the research process. The goal was to ensure their voices and perspectives were included, regardless of their individual background or comfort level with research. We are aware there are often negative and strong emotional reactions among students and faculty, let alone among non-academic communities, with regards to “data” “research” or “analysis.” Therefore, to address this real and perceived unequal power dynamics among this diverse group of community advisory board (CAB), we continued to intentionally schedule additional time at the beginning of every meeting, building rapport with an activity that would: a) increase vulnerability b) demonstrate safe space, and c) normalize being on a growth edge (Brown et al., 2020; Wennerstrom et al., n.d.). They were all compensated for their time on the CAB.

At this third gathering, we engaged in an activity called: What Do You Carry with You? Each person shared an item that they carried with them in their purse, pocket, and/or near them to share with the team. We had people share a meaningful painting behind them passed down in their family; we had others share a picture they carried in their wallet of their family members. This activity invited learning about each other and about our family and history as well. This set the tone for our gathering by establishing emotional psychosocial safety.

Strategy 2: Leveraging social media organizational structures

Because data analysis is less familiar to many of the CAB members, we took extra care and effort to ensure everyone was speaking the same language. After setting the tone, we guided the community into data analysis jargon. While our team first provided an overview of qualitative analysis and coding definitions as well as an example of the software used (NVIVO 14), we took the additional step of translating qualitative research analysis jargon to common everyday real-world experiences. We intentionally used the term “hashtag” instead of coding and gave an example of posting a picture on Instagram or Facebook. For example, we communicated with the group, “On social media, to describe a picture, we use hashtags, to categorize and describe that photo. So, in parallel we do the same with codes and the transcript data.” This guided the stakeholder group into reflection and conversation with each other on the coding tree, allowing them to provide feedback and co-create. Our team felt strongly that starting with the codebook was the ideal entry point for data analysis (Ando et al., 2014; DeCuir-Gunby et al., 2011). Codebooks serve as the guiding map for analysis, and by re-purposing the term “hashtag,” we were able to effectively convey our goal for today: reviewing the codebook and providing feedback that informs data analysis.

Their feedback on the codebook could significantly influence the direction of the analysis and chape the themes that emerge. By involving CAB members in the coding tree development process, we facilitated their active participation in data analysis in a less intensive manner such as learning to line by line code. Using the term “hashtag” helped enable their understanding of the codebook, allowing them to provide more effective feedback. The approach helped the CAB conceptualize how the codes would generate relevant information, which would then serve as the foundation for interpertation and the generation of themes.

Strategy 3: Utilizing visualization software for dynamic review of codebook

During our 3-hour meeting we innovatively utilized a platform called Miro board to engage the CAB members. A coding tree or codebook is defined as the categories used to organize and analyze data in a qualitative study. Coding trees are often created using inductive and deductive processes. Inductive consist of reviewing the data to ensure the voices of the participants guide the development of the coding tree. Deductive or a-priori processes consist of reviewing the literature to ensure the already well-established patterns and/or themes that are part of the evidence-based research. The research team reviewed all 22 transcripts, coded ¼ of them, and engaged with the literature over a 3-month period to ensure we came up with a comprehensive list of both deductive and inductive codes. We condensed the list and brainstormed ways to share with the stakeholder community in a manner that was open to co-creation and co-refinement of the code book.

Miro board allows for an interactive and visually appealing presentation of the coding tree (see Figure 1, 2 and 3). For the community data analysis portion, we shared two versions of the coding tree and provided examples of a de-identified transcription section from actual participants in this study to illustrate how the coding tree is applied. CAB members were invited to navigate to the Miro board and review the codebook together. We stayed in a larger group rather than divide up into smaller groups for the sake of time. Although CAB members had the option to add and rearrange items on the Miro board themselves, they chose instead to engage in an open discussion about what they were observing, while my team made the changes and edits as we discussed.

While other researchers have involved community members in the actual analysis of the text (see Jackson, 2008), we focused on including them in the creation of the categories that inform the analysis. Overall, their involvement led to four concrete changes. The first was the addition of three new parent code around family history and family experiences with cancer, cervical cancer knowledge and education, and a code focused on stigma and shame. The second was the addition of fertility/family planning under the impact code and non-formal interpersonal relationship under social support as well as adding money financial under cervical cancer journey factors. The final change was the addition of children and other non-formal interpersonal relationships under social support as important subcode to capture.

Despite having limited time, different technology background, and a diverse linguistic and educational background CAB, the three strategies utilized facilitated an active and fruitful discussion on the codebook informing analysis (See Appendix II).

CONCLUSION

Involving the community in data analysis can seem to be overwhelming, excessive, or impossible. However, we facilitated community engagement by using the codebook and presenting it in everyday language, incorporating hashtag with visuals. We are just beginning to explore how to effectively involve the community into qualitative data analysis (Hess et al., 2022). What is clear is that their insights and modifications to the codebook can significantly influence the direction, perspective, and content that we, as a research team, focus on during the more formal traditional coding process in NVIVO. It is not necessary to train community members in NVIVO or have them line-code the transcripts to capture their voice. By inviting their input through the codebook, we can engage them in a direct and meaningful manner. The next step will be to return to the CAB with the emerging themes to refine the outcomes of our in-depth analysis. This process could also include evaluating their experiences in this collaborative effort.

We hope other community-centered projects can learn and implement more innovative qualitative community data analysis techniques that engage community members. While progress has been made, more work is needed to fully involve community partners in all aspects of the research (Cashman et al., 2008; Jackson et al., 2018). We shared one innovative approach in which community partners can in an important part of qualitative data analysis, rather than being limited to validity and reliability checks. While some studies involve the community in creating the categories, that can be time consuming. Input at the codebook level is also critical: inclusive, community stakeholder-informed codebooks can significantly change the direction and outcomes of data analysis. Importantly, community members do not need to learn NVIVO to be part of data analysis. They can still play a crucial role in guiding and informing the themes that emerge from the research findings by actively participating in decisions about which codes are applied during line-by-line analysis. The diverse roles and skills of the community members - whether students, patients, clinicians, or academics- complement each other. We suggest that including CABs in refining the codebook enhances the data analysis process, ultimately enriching the insights and findings.


Funding

Julie Stott Center for Women’s Health Grant, National Center for Advancing Translational Science by the Oregon Clinical & Translational Research Institute (OCTRI) KL2 award 5 KL2 TR 2370-5 and Building Interdisciplinary Research Careers in Women’s Health (BIRCHW) award K12 HD043488.

Conflict of Interest

None