I never really thought much about “community engagement” or knew anyone closely involved in “community building” activities back in China. I mean, outside of academic settings.
I grabbed coffee with 2 classmates today during the 10-mins class break, and our conversation turned to what our parents do. I said my mom was a pharmacist but became a housewife after my third or fourth year of elementary school. I also shared that what I admire most about her is she’s good at getting along with her own, living, and being compatible with herself. She’s making friends despite not having a career, and having hobbies of focused things to do. I certainly couldn’t do that on my own, and I can hardly imagine what it would be like if I didn’t have a job. I was carefully using the words job or career, in case someone might say housewife is also a job.
Then, I started reflecting more on her life. She’s been deeply engaged in a book club, even as the president, for several years. [I don’t think ‘president’ is the official term; it’s just the person taking on the responsibility of managing some stuff.] This made me wonder about the nature of this group - could it be considered a community? People coming to this book club are mainly non-working, stay-at-home moms. They met weekly and often invited a teacher, which can be lectures in series or one-time talks, sometimes followed by lunch and even extra outings to parks or museums.
My mom doesn’t put all her energy into this book club or its members. She mentioned to me that she doesn’t enjoy socializing very much, that classes and reading are more important to her, that she sometimes doesn’t like too many lunches together, or that she doesn’t like some other members. These are all gossip and trivial matters.
She also has friends from calligraphy, Chinese painting, and oil painting classes, and groups where they learn about and taste tea together. I don’t know much if they met through the book club and expanded into other interests together, or if they met at other activities and then my mom invited them to join the book club - or maybe it was a mix of both. There definitely is a community [are many communities] among these middle-aged women, though I’ve never quite seen it from this perspective before. [Recently, after COVID, I saw that my aunt became the president, and she’s sharing all the club’s information on her social media.]
During the first and second weeks, we also briefly discussed the differences between libraries in China and the United States. In addition to the systematic and institutional differences, my views were very much related to my own perceptions of myself in the different environments, and the sense of where I could position myself in the community I was in. For example, in China (Beijing), my familiarity with the surroundings meant that I never felt the need for public library resources or services, leaving me with little impression of them. But in the U.S., it’s a totally different story. [I love visiting public libraries when I travel, and the DC/Arlington Public Libraries greatly improve my quality of life in the US!] My mom’s activities aren’t associated with the library, but I’m wondering if libraries have similar initiatives and programs. What’s hilarious is that my grandma was a librarian, though an academic librarian.
Of course, I’ve also realized that the mindset with which one approaches daily life is so different from the mindset required for conducting social science research. Personally, pursuing social science research is not a path. That’s incompatible with my mental health. Haha :-D
2024.02.13
I have my new research questions, though I’m not going to do this research yet. pov. Policies of AI in Academia
For non-native English-speaking students, whether using AI tools, such as translators, writing and grammar checker apps, or paraphrasing tools (e.g., ChatGPT, DeepL, Grammarly, QuillBot, etc.) to complete assignments violates academic integrity.
Can native speaker professors discern if an assignment was produced using AI tools? Similarly, can professors who are not native English speakers detect AI-generated work?
There’re more and more articles and materials on the Internet generated by AI tools. If the language material we are learning comes from AI-generated sources, are we learning “English”, or are we learning a different “language”?
Now, you can evaluate how much AI I used in the text above.
2023.12.19
Sharing this post here as this might be a quiet corner, hopefully nobody is watching me, with my baby-ish English expression-
I just wrapped up my IST 691 Deep Learning in Practice course today. Frankly, it’s been the most exciting course throughout my Applied Data Science Master’s program. It's exactly what I've been expecting, especially fortunate, considering it's the first time the course is offered in-person during a regular semester. I’ve finally ventured into the world of techniques and tools for working with images, text, and audio, which are relevant in the arts and cultural sectors, unlike the previous courses focused on programming in R, SQL, Excel, or machine learning with ready-to-use numerical or categorical datasets. I can’t say whether the absence of mathematics in this course is beneficial or detrimental. Consequently, I completed a Deep Learning course without calculus or linear algebra knowledge. You could definitely say that this course is too simplistic for someone pursuing an Applied Data Science degree. However, focusing on the practical application, and becoming proficient in 'using libraries and APIs' is fairly what I need at this point.
For the final group project, we worked on the Flickr8K Image Captioning task using the VGG19 and Transformer models with the TensorFlow and Keras libraries in Python. While the project wasn’t entirely ‘novel’, it gave me hands-on experience in coding deep learning models and integrating various ideas into a cohesive solution. I read the ‘Attention is All You Need’ paper and learned from codes shared on Kaggle and GitHub. We implemented transfer learning using the pre-trained VGG19 model, combined with a Transformer architecture from scratch, and employed BLEU scores for result evaluation. During this, we had to examine the output shape of our VGG19 model and develop a customized model to connect it with the Transformer. Moreover, the most crucial learning experience was managing coding streams and optimizing the training pipelines. I’ve come to understand that deep learning involves a blend of mathematics, theoretical concepts, and coding skills. In this course, we mostly skipped the complex math, grasped the concepts in lectures, and honed our coding ability through practical final projects.
Transformer has always been fascinating to me. However, initially, I was so confused about all the many concepts and terms during the lectures and textbook reading. It was only through working on this project, while debugging and ensuring seamless integration, and adding comments to all the code, that I truly started to understand how a Transformer is constructed from the ground up. This involved everything from Mask, 1D and 2D Positional Encoding, Encoder, Decoder, the Multi-Head Layers, and Model Constructing, etc. Additionally, I learned to set the learning rate schedule and optimizer, set checkpoints, etc. Two weeks ago, during FaceTime with my dad, he asked me to explain what the parameters mean in neural networks and what a Transformer is. I explained it in the way I understood at that point. But I can absolutely explain this much better now.
Actually, when GPT-4 came out with its image captioning features around late September and early October, I was so thrilled. For me, it’s a boon for both my personal and professional work, and beneficial for my friends and coworkers. As a photographer and contributor to Getty Images iStock, and VGC China, I always have to write image descriptions and keywords every time I upload a picture to their database. I hate this part of the job. You see, I chose photography as a way to record my life and capture the world around me, to express myself without needing words. So, it feels ironic, almost frustrating, that I still end up having to weave words for every photo I upload. Writing descriptions fills me with anxiety and hinders me from uploading more photos. So, having ChatGPT-4 generate image descriptions and keywords, whether as a short alt text sentence or a longer descriptive paragraph, is incredibly helpful and makes things much easier.
Regarding alt text, while I was taking the MUS 607 Collections Management course in Spring 2022, we were assigned to create alt texts for some collections in our SU Art Museum. We also learned that the Museum of Contemporary Art Chicago, particularly through their Coyote project, was one of the pioneers in this area. For that assignment, creating alt text was initially a little bit challenging. It’s not easy to describe the colors and strokes of some abstract artworks, or be professional in organizing language in short but informative sentences for a non-native speaker. I searched for AI image description generators and found a helpful beta version online. It could generate 5 sentences for each uploaded picture and usually provided excellent captions. At that time, I really hoped for an officially released tool or application for image captioning, as beta versions might suddenly become unavailable.
What’s more, while I was interned as a digital archivist at the Community Folk Art Center, I digitized archival photos, which sometimes included capturing the handwritten notes on the back of the photos. While I never captioned the content of the photos at that time, it could be possible in the future. Additionally, my close friend Joy worked at the SU Bird Library under Kiley Jolicoeur, the Metadata Strategies Librarian in the Department of Digital Stewardship, during Spring of 2023. What Joy did was exactly caption the digital images for the digitized archives, a task we refer to as ‘enriching descriptive metadata’ in the library context. Her job also involved recording the content and the position of handwritten names, signatures, and notes on each item. She could process about 10 items in half a day. When I told her that GPT-4 announced their new features, we both were extremely excited about the possibilities.
Now, it turns out that I've learned to build a small model for image captioning. Our model’s average BLEU Score on the validation set is 0.11. We tested it with images from the @syracuseu and @ishcoolsu Instagram accounts. It’s not perfect, but it performs pretty well, considering our limited training set and computing resources. I know GPT-4 can do much better than any projects I could ever do, even though they’re not specifically trained on data focusing on museum collections. I understand that for many art historians or museum curators, writing alt text is a pleasurable task, and they might prefer human-written sentences over AI-generated ones. But my focus is on how technology can enhance work efficiency and performance. I’m excited about the possibility of gathering existing museum alt texts along with their images to create a targeted training set. This would enable us to fine-tune a model specifically for this purpose, addressing museum professionals’ concerns about the accuracy of general image captioning products.
Finally, a huge shoutout to Professor Chris Dunham and my group mates Haotian and Lu for their support and teamwork. Couldn’t have done it without them!
Course textbook: Glassner, Andrew. Deep Learning: A Visual Approach. No Starch Press, 2021.
Article on museum accessibility: Watlington, Emily. ‘How Museums Are Making Artworks Accessible to Blind People Online.” Art News, Feb 12, 2020. https://www.artnews.com/art-in-america/columns/the-met-mca-chicago-blind-access-alt-text-park-mcarthur-shannon-finnegan-1202677577/. Accessed Dec 19, 2023.
Participating in a research project on misinformation about immigration on X.com has been a new experience for me. This was my first time learning about the term “misinformation,” as I was previously only familiar with “fake news.” Arbitrarily, I feel that terms like “dis/misinformation” are more commonly understood by communication scholars and perhaps Americans than by the general public, including people like me.
The project involved analyzing more than 10k X posts to determine whether they contained misinformation or exaggeration, as well as assessing their stance (anti-immigration, pro-immigration, or unsure). My role as a research assistant was to complete content coding to meet my advisor’s needs, which I accomplished. Though, while working on the project, I found myself reflecting on some other questions.
What I was thinking about is the nature of information itself. Information can either be a fact or an opinion. Facts can be classified as either misinformation or true information, but as facts, they shouldn’t inherently carry a stance. Opinions, on the other hand, are not typically labeled as misinformation or not; they are expressions of a person’s stance. Therefore, labeling an X post with two attributes, “misinformation” and “stance,” doesn’t fully make sense to me.
Admittedly, the reality is often more complex. Many posts are a mix of fact and opinion. They include factual statements while simultaneously expressing opinions. Furthermore, in some cases, even when a participant shares only factual information, their stance or ideology can still be inferred from their identity or other contextual clues. In such cases, it seems manageable to assign both labels.
Another thought I had was about bias—both my own and that of others. Most of our team members on this project are international students or professors who do not vote in the U.S., yet I feel that we all bring our own biases and stances to the issue of immigration. I also wonder about American scholars who study misinformation, particularly in the context of political communication. Are they truly neutral or unbiased in their work? For example, if someone votes for Democrats or Republicans, how much do they really “hate” or feel compelled to combat exaggerated information that supports their own party? Especially on social media, which I personally see as platforms that inherently give me many entertainment, as well as propaganda and persuasion.
Social media exists and attracts users because it amplifies certain aspects of human nature, particularly emotional responsiveness, often at the expense of rationality and objectivity. Rhetorical devices are prevalent in human language expressions. If I say, ‘I have tons of homework,’ or ‘I have told you a million times!’ and they are viewed as exaggerations instead of true information, it is easy to categorize me as a misinformation spreader, which doesn’t make much sense. At the same time, people may communicate in a passive-aggressive or sarcastic manner, or even adopt opposing viewpoints, a form of trolling, to ridicule or provoke others. Analyze social media content is complicated even with the most advanced language models.
2024.07.22
I’m feeling a mix of emotions that I can’t quite articulate. Last weekend, I visited Annapolis and St. Michael’s, where I went to the Chesapeake Bay Maritime Museum. There, I saw several artworks by Maryland-based artists or those inspired by the Chesapeake Bay, dating back over a hundred and more years. I was truly moved by these works. Though not widely known and far from the household names you’d learn about in a typical art history class, these artists documented the people and places belonging to their community. I really appreciate local history museums like this one.
This weekend, I’m in New York City and visited several galleries in Chelsea, filled with modern and commercial art. This juxtaposition of different art forms and their contexts is really interesting.
2024.06.19
Prompt Engineering, Reference Librarian, Apple Mac 40th anniversary
This year marks the 40th anniversary of the Apple Macintosh. Apple Mac revolutionized computing by introducing graphical user interfaces, building on earlier developments by Xerox. Allowing users to interact with computers not solely through code, this innovation made technology accessible to a broader audience.
Currently, using GenAI effectively requires prompt engineering, where specific prompt formats help the AI give better responses. This necessity means that optimal use of GenAI is not yet widely popular. However, Apple Intelligence has the potential to simplify this process, allowing users to interact with GenAI using more everyday language within the Siri context, thereby enhancing its accessibility and effectiveness for the future.
When I first began using ChatGPT, I had just finished a course on Reference & Information Literacy Services. The course trained us to work as reference librarians, guiding patrons to clearly articulate their information needs through interviews and iterative questioning. “A good librarian knows how to listen to the unspoken assumptions, unconscious biases, or unknown unknowns embedded in a research request and to probe for the ‘question behind the question’” (Mary Ellen Bates). Despite the change in roles, I tried to express my needs as clearly and in as much detail as possible in my daily conversations with GenAI. At that time, I hadn’t yet heard of terms like prompt engineering, chain of thought, or zero-shot.
A 2013 research paper by Andrew J. Nelson and Jennifer Irwin discussed how technology, especially Web 2.0, has transformed librarians’ occupational identities. Though I quit my MLIS degree, I still often connect AI-related topics with librarianship.
I first heard about the concept of ethnography in my museum studies courses. And I learned about anthropology as a discipline during my application process for the master’s degree in museum studies while still in China. This was also when I became familiar with the concept of ecomuseums–being practiced in China’s Yunnan and Guizhou provinces–and research on intangible cultural heritage and oral histories. I hardly recall learning about these subjects as an undergraduate. And I’ve since realized that, despite apparent similarities, art history and museology are quite distinct disciplines.
My interest deepened during my trip to Mexico, where I visited the National Museum of Anthropology. I compared it to museums in China, where most institutions are identified as history and archaeology museums, rarely using the term ‘anthropology.’ The few exceptions include the Museum of Anthropology at Xiamen University, Yunnan University, and The Minzu University of China (which focuses on the study of China’s various ethnic groups).
Reflecting on my visits to the Xinjiang Uyghur Autonomous Region Museum and the Tibet Museum in Lhasa, I remember noticing, even at a young age, a conspicuous absence of the concept of anthropology in their exhibitions. Instead, the sections that stood out were those dedicated to archaeological discoveries. While I reserve judgment on the Western media’s anti-China rhetoric of Xinjiang and Tibet as internal colonies of China, I do recognize that these museums could do more to represent the regions’ multi-ethnic cultures. This made me wonder if the Spanish colonization of Mexico influenced the National Museum of Anthropology’s approach, and whether the oversight in Chinese museums is due to anthropology’s roots in the European colonization of non-European regions, possibly leading to a deliberate avoidance of the term in China. I understand the differences between anthropology and archaeology, such as Malinowski’s functionalism, which emphasizes the present functionality of objects, as opposed to a focus on tracing historical evolution and communication.
Another point of curiosity for me is that from high school onwards, we’re introduced to “essays” as a genre, yet it appears that these aren’t considered anthropological literature, despite sharing significant content and stylistic similarities. Could the authors, being literary scholars, lack specialized training in anthropology? Considering works like “Winter Pasture” by Li Juan, which has been translated into English, it raises the question: How do we differentiate between ethnographic works and essays?
Putting my personal experiences aside, I love non-fiction books and documentaries, though I haven’t read much. In my readings, I haven’t identified many research patterns, often finding myself in confusion, as noted in “The Analytic Lenses of Ethnography,” which states, “this inductive approach offers little practical guidance on how to handle the many analytic choices the ethnographer must make in the course of their site selection and data collection.”
It helped me to sort out such regularities. Instead of being afraid of the research process, I’m very excited about learning some of the systemic methods and styles of ethnography, knowing that the researcher needs to choose the analytical traditions (analytic choices). As the article mentions, “Our identification of these analytic lenses and the way in which we relate them to one another is a distillation of our own scholarship and teaching of how to read, perform, and write ethnography.” It further explains “the basic modes of analysis that typify each ideal type, trace their implications for how one selects units of observation, and demonstrate how these different ethnographic styles illuminate different dimensions of the social world.” “Given this, ethnographers should have a working knowledge of the ideal-typical approaches and understand what the tradeoffs and consequences are for adopting a particular way of seeing. Delineating these approaches also helps us evaluate ethnographic texts because it provides the reader with a heuristic tool for identifying the presumptions and interpretive frames that undergird authors’ analyses.”
Annette N. Markham’s article discusses the necessary methodological adjustments for conducting ethnographic research in social media and online contexts. With the evolving definitions and expanded scope of anthropology and ethnography, it’s essential to focus on the practice level while advocating for a more reflexive approach. (Interestingly, I’ve noticed many articles on ethnographic research authored by Australian academics; I’m curious if there’s an underlying reason for this.) Nowadays, graduates in anthropology (and journalism) are not confined to traditional research paths; many find roles within internet technology companies, engaging in software development or user experience research. This involvement aims to understand human behavior better, thereby aiding in the design and improvement of physical and software products. This shift and expansion of the anthropological discipline suggest that research methods must also be contemporary and adaptable.
I first learned the term “shadowing” in a Data Warehouse (Business Intelligence) course. It described a process where business analysts, data scientists, and engineers gather functional and non-functional requirements in the initial stages of developing data warehouses for organizations. This preparatory work involves conducting interviews, observations, and other field research. Subsequently, I took the opportunity to review the syllabus for class IST621 and explore Professor Ingrid Erickson’s Google Scholar profile to understand her current research interests better. I should be taking this or the Future of Work class; both seem incredibly interesting, and it’s sad to realize this just as I’m about to graduate. The study of entrepreneurship, examining the digital transformation processes within various businesses through qualitative methods, and interviewing Indian women startup CEOs represent a fascinating intersection of informatics and management.
My interest leans more toward applying anthropological methods in the study of product development and business management rather than in folklore studies. (As a side note, I’m starting to explore PhD programs in Information System Technology Management, mostly within Business schools.) This exploration has alleviated my apprehension towards traditional anthropology and ethnography. It’s not that I find traditional anthropology uninteresting; rather, it has always seemed like a field that’s hard to reach for me. Meanwhile, as a photographer and photobook enthusiast, I deeply wish to receive training in anthropology and then apply it to photography, exploring the world through ethnographic and sociological observation and research.
Readings:
Forsythe, D. E. (1999). “It’s Just a Matter of Common Sense”: Ethnography as Invisible Work. Computer Supported Cooperative Work: CSCW: An International Journal, 8(1-2), 127–145. https://doi.org/10.1023/A:1008692231284
Jerolmack, C., & Khan, S. (2017). The Analytic Lenses of Ethnography.
Socius, 3, 1–11. https://doi.org/10.1177/2378023117735256
McGranahan, C. (2018). Ethnography beyond method: The importance of an ethnographic sensibility. Sites a Journal of Social Anthropology and Cultural Studies, 15(1). https://doi.org/10.11157/sites-id373
Markham, A. N. (2013). Fieldwork in social media: What would Malinowski do? Qualitative Communication Research, 2(4), 434–446.
http://dcqr.ucpress.edu/content/2/4/434.abstract
Quinlan, E. (2008). Conspicuous Invisibility: Shadowing as a Data Collection Strategy. Qualitative Inquiry: QI, 14(8), 1480–1499.
https://doi.org/10.1177/1077800408318318