COS 333: Project Ideas from Around Campus

Sun Mar 11 09:23:34 EDT 2018

Newer items are at the front.

Overview

There are many individuals and groups on campus who have really interesting problems that could be profitably attacked by folks in COS 333. Here are some of them. They have come from a variety of friends and colleagues on campus. I've edited their prose a bit, but have tried to leave their ideas intact. You're welcome to approach the people listed directly, and I would be happy to act as an intermediary if you prefer, or to help you solicit more information. The first half dozen or so are new; the others are hold-overs from previous years but still of interest.


Health & Safety systems, Kyle Angjelo, Chemical Safety

  1. A database and interface for Hazardous Waste Manifest tracking and reporting. It would be nice if it included RMW and Universal Waste. Autoscan for manifests would be nice.
  2. A software database that can be used to track incidents and accidents. The main function would be searching and providing stats on different searches (e.g. undergrad incidents, eye incidents, related to missing PPE)
  3. Time tracking software with a nice user interface (hard to break). The tracker should be able to generate reports on how much time is spent on specific functions. It should be able to tally functions across different people.
  4. A DEA Destruction mobile app that allows us to screen capture usage logs and have that information automatically placed into a DEA Form 41. It should also maintain the usage log repository
  5. A Hazardous Waste mobile app to track where hazardous waste is coming from. QR codes could be used to gather user information and generate reports on how much waste and what waste is being generated.
  6. A Hazardous Waste Emergency Response quick calculator mobile app. It could take data from Tigerspace and use room dimensions, ACH, and chemical information to calculate contaminant concentrations.
  7. An interactive interface for users to learn how to adjust their computer workstations to avoid repetitive strain issues.
  8. A specialized app to allow UHS people to access health forms and get results from them (with Duo authorization). This would include setting up a database for all health information maintained by UHS.


Peer support platform for aspiring teachers, Alexandra Zalewski, Tiger Challenge

The TeachOn team from the Tiger Challenge program is using design thinking to address the teacher shortage in New Jersey. There are many people who consider teaching but abandon this path due to lack of community support. We discovered a need for an interactive platform that allows teachers and aspiring teachers to contribute information to a one-stop resource on how to become a teacher (addressing functional and emotional questions). We imagine a setup in which users can make accounts, take career direction quizzes, view stats and information, interact with teachers and view webinars. We would want to start it off with logins through Princeton NetIds and CAS so that strangers could not access it without a login. There is a lot of freedom in what features will be on this platform, as long as they help attract and support aspiring teachers.


Improve Tiger Energy, Caroline Savage, Office of Sustainability

This project would try to improve Tiger Energy by:


Academic Task/Assignment Time Estimator, Nik Voge, McGraw Center

Time is in short supply for Princeton students. This makes scheduling and planning of academic tasks and activities such as completing p-sets, assigned reading, papers, and projects difficult. Because assignments can be quite challenging and time consuming and because they can vary considerably not only from course to course, but also from week to week, it is often difficult for students to accurately predict how much time tasks will require. At the same time, most students, with the encouragement of the university, are involved in extra-curricular, career preparation, and social activities, which results in a relatively small margin for error in planning and scheduling.

In many cases students do not budget adequate time to complete their academic work, leading to unmet grade (and learning) goals and feelings of dissatisfaction. Students often lack sufficient information to effectively plan and schedule their academic work and other aspects of their lives.

One recent innovation is Rice University's Course Workload Estimator. While the Course Workload Estimator has been a useful tool for instructors, it can be improved upon. It can be adapted to Princeton's distinctive academic environment, including its instructional materials and evaluation standards. Another improvement is continuously refining the algorithm by which the estimates are made by collecting input from students in specific courses on the amount of time various tasks demand. Additionally, the corpus of data collected can be analyzed to better understand the academic time demands across campus, an endeavor which has never been undertaken in any systematic manner to my knowledge.


Princeton Analytica, Sam Wang, Neuroscience

Project: Princeton Analytica Cambridge Analytica is a political consulting company that attained fame by microtargeting of voters based on their demographic characteristics. Their services are proprietary. This project is a step toward doing the same, potentially in an open-source manner, and with an application to a current hot topic, gerrymandering.

Both political and legislative operatives need to fill in missing data. For example, this comes up in fights over gerrymandering, in which redistricters have to be able to tell if a plan is unfair ? or conversely, if they want to build a plan that is as fair as possible. However, they don't always have complete information. For example, they might know the vote totals in some precincts but not all of them. However, they do have complete sets of other variables, such as census information or Google search terms. See the following example: http://election.princeton.edu/2016/04/26/google-wide-association-studies/

The first part of the project is to build a classifier that helps people impute complete data, and validate how good that imputation is. The ideal case is to have a front end that can work to a naive user.

This can be expanded to the actual redistricting problem by adding an adjacency matrix that shows which precincts (or cities or counties or whatever) are next to each other, and then let people join them together. A person could get a score of how compact and how partisan their district was. That requires GIS and other data.

Manual district-building is probably relatively easy. Automated is more interesting. For example, an automatic procedure could be biased toward building compact districts: (1) Join two entities and calculate the centroid, then (2) bias the choice of the next entity according to how close it is to the centroid of the existing combination, and (3) repeat until the population target is hit. One would want to do a literature search to find out how similar this is to existing procedures by Chen and Rodden, Mattingly, Fifield and Imai, Tam Cho, and others.


Presenting Cultural-Heritage Data Online, Cliff Wulfman, Library

A very large quantity of the cultural-heritage material that has been digitized is encoded and stored in XML: information about the objects (metadata); information about digital images of the objects (file types; file paths; technical info about the files).

There has been much buzz in the digital cultural-heritage community in recent years about the International Image Interoperability Framework (IIIF). IIIF is a set of specifications for APIs to web services, including an Image API, which deliver images (at various resolutions, orientations, etc.) and a Presentation API, which delivers a structured representation of complex image-based digital objects in JSON-LD (JSON for Linked Data).

Princeton's Digital Library includes a collection of Princeton-area newspapers, including the entire run of The Daily Princetonian from its founding in 1876. The digital representations of these newspapers are encoded in XML – a particular blend of XML schemas called METS/ALTO.

The project would be to create a IIIF-based viewer for The Daily Princetonian historical collection by implementing IIIF APIs:


Data collection and presentation for student outcomes, Jed Marsh, Vice Provost for Institutional Research

There is an increasing interest in student outcomes after the initial placement -- say 10 years post degree. Currently, these data are harvested from a hodge-podge of sources, including scraping sites like LinkedIn. There's a fair amount of staff time spent across campus googling former students, both graduates and undergrads. We need tools that:
(1) improve data collection from the web. Could there be an API from LinkedIn or job search sites? Could one develop an app to systematically search for and harvest CV's & resumes posted by Princeton Alumni?
(2) Categorize unstructured employment data (job code, employer, etc.,) into standardized occupation (SOC) and industry (NACIS) codes.
(3) Store these data in a common repository that could be available for student outcome studies.


Margaret Martonosi, Director, Keller Center

A truly unified regional event calendar. There isn't even a good unified event calendar for items on campus. If you start trying to merge in off-campus events, it gets worse. Try finding a single site that has events from the Music department, the McCarter Theatre, the Garden Theatre, the Princeton Public Library. It makes me sad to find out about cool events the day after they happen.

Something related to sexual harassment would be timely. Could involve education on Title IX issues, or safe rides home, or something else.


Themed historical tours of campus, Abby Klionsky '14, Office of the Executive Vice President

As a breakout group of the Campus Iconography Committee, the Princeton History Working Group is building a series of themed historical tours of Princeton's campus that will highlight lesser-known histories of the university. These will take shape in the form of a mobile app, which will use wayfinding technology to guide users to sites across campus and showcase associated photos, audio, and video to tell these stories. For some of these sites, we'd like to incorporate augmented reality features -- particularly in places where there may no longer be a physical marker or building still standing. The augmented reality component we're envisioning would likely be a statue for "placement" in one of the statue-hold pedestals in East Pyne courtyard or the front of Frist, a moving image to launch over a picture frame or screen that does exist in reality, or overlaying an old image of a campus map/building over what exists today.


Entries from 2017

 

Apps for Freshman Orientation and Families Weekends, Claire Fowler, Associate Dean of the College, cmfowler@princeton.edu

[Updated 2/10] We are seeking an intuitive and simple mobile application that would allow students to access various Orientation resources, most importantly, the calendar of events. The app would have a CAS login, which will allow students to customize their calendar. Through a mobile platform, we will be able to communicate changes to the calendar as well as provide them with reminders of their sessions. Basic features would include an interactive campus tour, a display of both the Orientation calendar and a way for visitors to add events, reminder notifications, and resources on campus. This page shows graphically what Dean Fowler and her colleagues have in mind.


Improving the political process, Sam Wang (Neuroscience, MOL), sswang@princeton.edu

These ideas are in theory symmetric, in the sense that they would be equally useful to people all over the ideological spectrum.

1. Target identification for activism. This app would take a news article as input (here's a test example). Then identify reporters and legislators named in it - in this case Emmarie Huetteman, Yamiche Alcindor, Senator Susan Collins, Senator Lisa Murkowski, Betsy DeVos, and others. Then return the most effective way of reaching those people, which is email for reporters (Huetteman and Alcindor), Washington and home-state phone numbers and physical addresses for Senators, and some other unknown route for others. Note that optimizing the contact path is nontrivial - it requires locating email and Twitter addresses for reporters, and recognizing what category each person falls into. Finally, display the information so that the user can do something with it immediately.

2. Find-A-Town-Hall. A plug-in for the competitive district finder at the Princeton Election Consortium. That application finds competitive Congressional districts near the user. An event finder would locate activist events related to the district, and return it to the user. Note that there are people who are developing Town Hall databases, so there is some information available online. For robustness it would be best to rely on multiple online databases. Also note that the PEC project has been developed into SwingLeft.org, so that is another possible partner.

 


Facilitating political discussions across the right-left divide, Margaret Martonosi (CS), mrm@princeton.edu

In considering how to move past this divisive election year, many people from across the political spectrum feel that increased political dialogs between individual voters may be one of the few ways to reunify the US and decrease the divide between "left" and "right". It can be challenging, however, to find conversational partners from "the other side". Even once a conversational partner is found, it can be hard to conduct conversations across the divide in a civil and constructive manner. In this project, you will create an app to facilitate the pairing of a user with a conversational partner potentially anywhere in the US, and a framework to encourage civil and constructive conversations.

User Interface design would be a key part of the project. One possible approach would be something like Tinder, with pools of users from opposite sides of the political spectrum (either overall, or on a particular issue). Once matched, users could chat with each other. Other UI options might be whether users should be anonymous or not, whether user profile photos are useful, etc.

Once the basic app exists, many additional features are possible. For example, the app could be intelligent in providing prompts of questions or conversation starters. The app could use NLP sentiment analysis to gauge the "stress level" of the conversation and offer guidance on defusing fraught conversations to end civilly. The app could perform text analysis to avoid the use of profanities or offensive words. The app could allow both users to rate their conversational partner afterwards (like uber) and could avoid further pairings of low-rating users. While originally envisioned for bridging the left-right political divide, the framework of the app could be used for a range of polarizing or sensitive conversation topics.


Princeton and Slavery Project, Joseph Yannielli (Council of the Humanities), joseph.yannielli@princeton.edu

The Princeton & Slavery project details the slaveholding practices of Princeton's faculty, trustees, and students, investigates how funds derived from slave labor contributed to the early growth of the University, examines the ways in which students and faculty engaged some of the central intellectual and moral issues of their time, and looks at how the very southern orientation of the University shaped institutional culture. The project has amassed over 100GB of data, including thousands of pages of manuscripts, books, photos, and newspaper articles, as well as biographical information on thousands of students who attended Princeton between 1746 and 1865. We are open to any and all fresh ideas for interpreting or visualizing this data.

We are especially interested in developing a smartphone app with an augmented reality campus tour focused on the legacy and impact of slavery at Princeton. The app would allow users to uncover a "secret history of Princeton" by revealing the hidden connections to slavery behind prominent buildings, monuments, and other locations on campus. Specific locations would be linked to artifacts, videos, and interpretive essays from the Princeton & Slavery project. The app might also be "gamified (à la Pokemon GO), allowing users to collect points by uncovering hidden histories and to apply those points to uncover new artifacts and information.

(With Prof Marni Sandwiess, History)


Improving Accessibility, Shawm Maxam, Assistant Director for Diversity and Inclusion, smaxam@princeton.edu@princeton.edu

Project Idea One: Build a Chrome Extension or web service that turns off a page's CSS and creates a menu of user preferences for various levels of ability. Such as applying a color contrast compliant color palette, font size, removing animations, describing text of images and more. All of this would be controlled via a user interface in the browser and remain persistent throughout the site. This would be a benefit for all users of websites who have diverse abilities. This could be a tool that is widely shared and utilized

Project Idea Two: Build and accessible color palette creation tool for web accessibility design. Currently, there are tools out there that do some functions well, but none of them meet specifications that would be useful for Web Designers. Color contrast works on a standard algorithm so no expertise is required. The user interface can be as slick as the students can come up with. I would supply the requirements and vet them with the accessibility community in NYC. This could be a tool that is widely shared and utilized.

The tool would have a social impact value and really be in the "service of humanity" for an under represented group of people. People with disabilities represent the largest minority group in the world. The push for inclusive and accessible content has a long way to go, so tools like this can provide access where barriers currently exist.

(Contact Damian A. Sian, dsian@princeton.edu)


McGraw Center Projects, Nic Voge, Associate Director, McGraw Center, nvoge@Princeton.EDU

One idea is quite general. Perhaps a team might see a way to enhance the impact of Principedia (http://principedia.princeton.edu/) ,which is a first-of-its-kind student crowd-sourced learning resource, by linking it to ReCal.io or in some other way. Principedia can be quite helpful for advising and course selection.

Another idea would be to create an app that helps students plan and schedule their (regular) use of academic support such as tutoring, writing center appointments, workshops, office hours and advising sessions. Students lament that figuring out which academic support resource to use when poses its own challenges.

A third idea is to create a tool that would help students create thesis, course-specific (or other) support/accountability groups that helped them articulate goals, report on progress, get badges, and in other ways motivate them to learn and achieve their academic goals. It would be something analogous to groups created for dissertation writing, exercise, weight loss, etc.


User Interfaces for Sustainable Energy Research, Johannes M. Dieterich, Carter Lab, MAE, jmd2@princeton.edu

We would like to add graphical user interfaces to our three in-house codes as project ideas (these would very likely be independent projects as the requirements and workflows are rather different). As you know, Professor Carter's group works in the field of sustainable energy research, hence all of the codes are used for this type of research. Let me provide a bit of context:

* TigerCI (https://github.com/EACcodes/TigerCI/) a code for the highly-accurate treatment of small to medium sized molecules. We have recently used it for combustion analysis. A GUI would be responsible for setting up input parameters, visualize molecules and their orbitals, potentially manipulate them, and analyze/visualize results.

* PROFESS (https://github.com/EACcodes/TigerCI/) a code for small to huge (everything from 4 atoms to 1 million) systems of periodic nature (i.e., solids or liquids). Used for light-weight metal alloy design and analysis of plasma-facing components in nuclear fusion reactors. This GUI would need to visualize these systems (even to come up with a good human interface for a few thousand atoms will be challenging!), allow some manipulations, generate inputs for typically performed tasks requiring multiple calls to PROFESS (i.e., replacing some scripts we have on our side), analyze/visualize results.

* Embedding methods these are different in-house codes that link together other, modified from upstream, codes. Used for analyzing novel, light-based catalysis of energy-intensive chemical reactions. Of all the projects, I consider this to be the most challenging as the typical workflows are complex. However, this also means that a GUI simplifying these workflows for us would be of great value. A GUI would need to be able to visualize molecules, molecules on surfaces, mark and separate out a region of atoms, 3D plots of potentials, input generation, output analysis for different program codes

For all of these codes, one of the challenges will be to ensure platform-agnostic GUIs since they would be used on Linux, Mac, and Windows. The students could re-use existing open-source visualization tools and extend them.

(With Prof Emily Carter, Dean of Engineering)

 

============================= OLDER ENTRIES ========================================

Entries from 2016


Entries from 2015 and earlier

Princeton public transportation engine, John Logan (Library), jlogan@princeton.edu

We need a Princeton public transportation engine, whereby you could enter your location and your destination, and then get coherent information about buses, trains, etc.? TigerTransit and other services cover the ground, so to speak, but figuring out how to get there from here reminds me of the old traveling salesman problem. And it might turn out that some of the schedules could and should be modified to make connections quicker than they are now (Friend Center to Frist to PPPL requires, or at least required last time I tried to parse the schedules, quite a wait at Frist).


OIT infrastructure and maintenance reporting systems, Jay Dominick, Vice President for Information Technology, OIT (jdominick@princeton.edu)

1. A system to report maintenance related issues on campus at time of observation. A quick example would be that if you see a lightbulb out, you would snap a photo of it with your phone, add some location information, put the info in a database and then alert a technician. This could also used to report printer malfunctions, cluster problems, etc.

2. A recommender system to help students navigate the information and service infrastructure of campus. This gets at the problem that "There are so many resources here for students, and the organization of the information is so poor, that students don't know what they could be doing." This is a particular problem for OIT in that we offer lots of tools and technologies for students but can't seem to communicate particularly well about them. However, some students do manage to figure it out and their expertise could be very useful to others. How do you capture what the experts know about campus and make it available (in a time-relevant manner) to other students?


College wise calendar, Patrick Caddeau, Dean of Forbes College (caddeau@Princeton.edu)

The idea is to help students to sync up how they spend their time with their academic goals, important deadlines, and milestones in the progress from freshman to senior, and beyond. Many students struggle with how to wisely allocate their time and find management of their time to be a major source of stress. A college wise calendar would provide a map that connects how students spend time with accomplishing major goals. It would have three main features:

(1) automatically populate with all significant university deadlines (add/drop, pdf, mid-terms, dean's date, deadlines for declaring a major, JP, etc). For some of these events, there could be an estimation of how far in advance you need to prepare for the event so you can see a bar indicating when you should begin planning -- for example 72 hours before add/drop so you have time to schedule a meeting with your professor or adviser to get an update on your status in the course, a month prior for JP deadline to make sure you have a working draft, etc.). User could add, sync, or import additional events from other calendars. If a student selects a particular major the calendar could populate with a list of departmental requirements that could be dropped into the calendar in the appropriate term -- using features of ICE perhaps? Courses that have prerequisites would prompt users for those courses when they are dropped into the calendar.

(2) a feature that ranks or tags calendar events with relative importance to you and what type of goal it is connected with -- for example: thesis would be tied to the "graduate from Princeton" goal so it would be ranked high, while attending TH night arch sing could be given relatively low importance and tied to "relaxation".

(3) zoom feature allowing users to see a week, month, term, year, or all four years at Princeton in a single view. Events that are ranked high in importance (for example finish thesis with a 6 month block of time) would be visible event from the highest level while events ranked lower would only be visible when viewed at a higher resolution. This would help students to think about how they spend their time as it relates to their goals by seeing long term goals and deadlines from different perspectives.


De-duping RECAP, Marvin Bielawski, Deputy University Librarian (marvinb@Princeton.edu)

Princeton, the New York public library, and Columbia run a joint off-site storage facility on the Forrestal campus named "RECAP." One of our longterm dreams is to do something called "de-duping," meaning "de-duplicating," meaning storing only 1 copy of a particular volume rather than 2 or 3 (one from Princeton, one from NYPL, and one from Columbia). There are many obstacles to this dream, some of which are legal (e.g., if there's one copy, who owns it?).

But one of the obstacles is technical. To de-dupe, we would have to identify the duplicates (preferably before they entered RECAP). This can be tricky for at least two reasons. One is that, given the purposes of the research libraries, it will matter (at least sometimes) whether the duplicates are exact or not: the 2nd edition and the 4th edition are not perfect substitutes for one another. The second is that Princeton, NYPL, and Columbia all run differently configured online catalogs, so it becomes a clunky, manual process to compare records.

This should be solvable by a Kayak-like program: if one app can search a bunch of airline websites, why not an app that combs multiple library databases?