16 Kush R. Varshney
Ramamurthy, Srividya Ramasubramanian, Shubham Singh, Mudhakar Srivatsa, Lauren Thomas Quigley, Lav Varshney,
and Pramod Varshney for providing substantive comments on earlier drafts of this piece.
REFERENCES
[1] 2023. Foundation Models: Opportunities, Risks and Mitigations. Technical Report. IBM AI Ethics Board, Armonk, NY, USA.
[2]
Swapnaja Achintalwar, Adriana Alvarado Garcia, Ateret Anaby-Tavor, Ioana Baldini, Sara E. Berger, Bishwaranjan Bhattacharjee, Djallel Bouneouf,
Subhajit Chaudhury, Pin-Yu Chen, Lamogha Chiazor, Elizabeth M. Daly, Rogério Abreu de Paula, Pierre Dognin, Eitan Farchi, Soumya Ghosh,
Michael Hind, Raya Horesh, George Kour, Ja Young Lee, Erik Miehling, Keerthiram Murugesan, Manish Nagireddy, Inkit Padhi, David Piorkowski,
Ambrish Rawat, Orna Raz, Prasanna Sattigeri, Hendrik Strobelt, Sarathkrishna Swaminathan, Christoph Tillmann, Aashka Trivedi, Kush R.
Varshney, Dennis Wei, Shalisha Witherspooon, and Marcel Zalmanovici. 2024. Detectors for Safe and Reliable LLMs: Implementations, Uses, and
Limitations. arXiv:2403.06009.
[3]
Swapnaja Achintalwar, Ioana Baldini, Djallel Bouneouf, Joan Byamugisha, Maria Chang, Pierre Dognin, Eitan Farchi, Ndivhuwo Makondo,
Aleksandra Mojsilović, Manish Nagireddy, Karthikeyan Natesan Ramamurthy, Inkit Padhi, Orna Raz, Jesus Rios, Prasanna Sattigeri, Moninder
Singh, Siphiwe Thwala, Rosario A. Uceda-Sosa, and Kush R. Varshney. 2024. Alignment Studio: Aligning Large Language Models to Particular
Contextual Regulations. arXiv:2403.09704.
[4] Rachel Adams. 2021. Can Articial Intelligence Be Decolonized? Interdisciplinary Science Reviews 46, 1-2 (March 2021), 176–197.
[5]
Moses Adesola Adebısı. 2014. Knowledge Imperialism and Intellectual Capital Formation: A Critical Analysis of Colonial Policies on Educational
Development in Sub-Saharan Africa. Mediterranean Journal of Social Sciences 5, 4 (March 2014), 567–572.
[6]
Md Sultan Al Nahian, Spencer Frazier, Mark Riedl, and Brent Harrison. 2020. Learning Norms from Stories: A Prior for Value Aligned Agents. In
Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society. 124–130.
[7] Syed Mustafa Ali. 2016. A Brief Introduction to Decolonial Computing. ACM XRDS: Crossroads 22, 4 (Summer 2016), 16–21.
[8]
Adriana Alvarado Garcia, Juan F. Maestre, Manuhuia Barcham, Marilyn Iriarte, Marisol Wong-Villacres, Oscar A. Lemus, Palak Dudani, Pedro
Reynolds-Cuéllar, Ruotong Wang, and Teresa Cerratto Pargman. 2021. Decolonial Pathways: Our Manifesto for a Decolonizing Agenda in HCI
Research and Design. In Extended Abstracts of the CHI Conference on Human Factors in Computing Systems. 10.
[9]
Yuntao Bai, Saurav Kadavath, Sandipan Kundu, Amanda Askell, Jackson Kernion, Andy Jones, Anna Chen, Anna Goldie, Azalia Mirhoseini,
Cameron McKinnon, Carol Chen, Catherine Olsson, Christopher Olah, Danny Hernandez, Dawn Drain, Deep Ganguli, Dustin Li, Eli Tran-Johnson,
Ethan Perez, Jamie Kerr, Jared Mueller, Jerey Ladish, Joshua Landau, Kamal Ndousse, Kamile Lukosuite, Liane Lovitt, Michael Sellitto, Nelson
Elhage, Nicholas Schiefer, Noemi Mercado, Nova DasSarma, Robert Lasenby, Robin Larson, Sam Ringer, Scott Johnston, Shauna Kravec, Sheer
El Showk, Stanislav Fort, Tamera Lanham, Timothy Telleen-Lawton, Tom Conerly, Tom Henighan, Tristan Hume, Samuel R. Bowman, Zac
Hateld-Dodds, Ben Mann, Dario Amodei, Nicholas Joseph, Sam McCandlish, Tom Brown, and Jared Kaplan. 2022. Constitutional AI: Harmlessness
from AI Feedback. arXiv:2212.08073.
[10]
Michiel A. Bakker, Martin J. Chadwick, Hannah R. Sheahan, Michael Hnery Tessler, Lucy Campbell-Gillingham, Jan Balaguer, Nat McAleese,
Amelia Glaese, John Aslanides, Matthew M. Botvinick, and Christopher Summereld. 2022. Fine-Tuning Language Models to Find Agreement
Among Humans with Diverse Preferences. In Advances in Neural Information Processing Systems. 38176–38189.
[11]
Periaswamy Balaswamy. 2013. Histories From Below: The Condemned Ahalya, the Mortied Amba and the Oppressed Ekalavya. SSRN:3175708.
[12]
Emily M. Bender, Timnit Gebru, Angelina McMillan-Major, and Shmargaret Shmitchell. 2021. On the Dangers of Stochastic Parrots: Can Language
Models Be Too Big?. In Proceedings of the ACM Conference on Fairness, Accountability, and Transparency. 610–623.
[13] Ruha Benjamin. 2022. Viral Justice: How We Grow the World We Want. Princeton University Press, Princeton, NJ, USA.
[14]
Noam Benkler, Drisana Mosaphir, Scott Friedman, Andrew Smart, and Sonja Schmer-Galunder. 2023. Assessing LLMs for Moral Value Pluralism.
arXiv:2312.10075.
[15]
Sebastian Benthall and Bruce D. Haynes. 2019. Racial Categories in Machine Learning. In Proceedings of the Conference on Fairness, Accountability,
and Transparency. 289–298.
[16] Abeba Birhane. 2020. Algorithmic Colonization of Africa. SCRIPTed 17, 2 (Aug. 2020), 389–409.
[17] Abeba Birhane. 2021. Algorithmic Injustice: A Relational Ethics Approach. Patterns 2, 2 (Feb. 2021), 100205.
[18]
Abeba Birhane, Elayne Ruane, Thomas Laurent, Matthew S. Brown, Johnathan Flowers, Anthony Ventresque, and Christopher L. Dancy. 2022. The
Forgotten Margins of AI Ethics. In Proceedings of the ACM Conference on Fairness, Accountability, and Transparency. 948–958.
[19]
Rishi Bommasani, Sayash Kapoor, Kevin Klyman, Shayne Longpre, Ashwin Ramaswami, Daniel Zhang, Marietje Schaake, Daniel E. Ho, Arvind
Narayanan, and Percy Liang. 2023. Considerations for Governing Open Foundation Models. Issue Brief. HAI Policy & Society.
[20]
Matt Bornstein, Guido Appenzeller, and Martin Casado. 2023. Who Owns the Generative AI Platform? https://a16z.com/who-owns-the-generative-
ai-platform/.
[21] Djallel Bouneouf. 2023. Multi-Armed Bandit Problem and Application. Independently Published.
[22]
Joy Buolamwini and Timnit Gebru. 2018. Gender Shades: Intersectional Accuracy Disparities in Commercial Gender Classication. In Proceedings
of the Conference on Fairness, Accountability and Transparency. 77–91.
Manuscript submitted to ACM