Hello, my name is Kevin Arvai, posting a review request for Open Impute, which is applying for project approval on Open Humans. This project adds a data source by performing genome imputation on members’ existing genomic data sources.
The genome imputation pipeline runs on a server hosted by Open Humans, but I have access to it. When the pipeline finishes and the new data source has been uploaded, all of the member’s data is deleted from the server programmatically.
Should this project be visible and available for all Open Humans members to join?
Description: The goal of Imputer is to provide users with a more comprehensive picture of their genome. Direct to consumer genetics companies, like 23andMe, only genotype a small fraction of the genome. Researchers are finding new genetic locations associated with traits and diseases at a rapid pace. Users might be interested in knowing their genotype status for these new associations, but the locations may be in regions that direct to consumer tests are not genotyping. Imputer leverages the vast amount of genotype data made available by 1000 genomes project to provide Open Humans users with genotype estimates at additional locations in their genome.
I want to be impartial though and apply the standards I tend to apply to projects… please forgive me in advance!
Information about the project
who manages it? I don’t think @arvkevi’s name is anywhere on the site itself, although it’s on the activity page in openhumans.org
what will it do? The about page does a great job explaining imputation!
what is in the data it has access to? arguably redundant with permissions in Open Humans, but maybe nice to have a brief statement like “will request access to your genetic data in Open Humans (e.g. 23andMe)”.
security, privacy, and data management? – you don’t need to do overkill here, but I think it’s currently not addressed? it’s important to explain some general info about data management & sharing. You might want to create a brief “terms of use” page, I have a brief “terms” page for Genevieve. (Note that a terms page can also protect someone running a site with a “disclaimers” section! )
Summary info on the front page – while an about page is probably appropriate for a complete description, most people don’t bother looking there. I think the website’s front page should make sure to hit highlights of info mandated by project guidelines (but not too wordy) & direct someone to an “about” page to read details. In particular, “who is running it”, and “what it will do”. (e.g. I think Bastian’s Rescuetime project does a nice job with brief info highlights.)
name consistency? In Open Humans, it’s Open Impute. The logo/badge/icon says Imputer. The URL is openimpute. The website says " Genotype Imputation for Open Humans" and “the Open Humans genotype imputation app”. (Note - the last one might lead someone to think Open Humans staff manage the project directly? users could get confused and report bugs to the wrong people, etc.) I like “Open Impute” or “Imputer” – both great! but maybe more consistency, some people are easily confused
Hopefully this is all easy stuff, it’s just wording… feel free to ask for suggestions if you’re having writer’s block