Gene and protein expressions are key determinants of cellular function. Neurons are the building blocks of brain circuits, yet the relationship between their molecular identity and the spatial distribution of their dendritic inputs and axonal outputs remain incompletely understood. The open-source knowledge base Hippocampome.org amasses such transcriptomic data from the scientific literature for morphologically defined neuron types in the rodent hippocampal formation: dentate gyrus (DG), CA3, CA2, CA1, subiculum (SUB), and entorhinal cortex (EC).Positive, negative, or mixed expression reports were initially obtained from published articles directly connecting molecular evidence to neurons with known axonal and dendritic patterns across hippocampal layers. Here, we supplement this information by collating, formalizing, and leveraging relational expression inferences (REIs) that link a gene or protein expression or lack thereof to that of another molecule or to an anatomical location. With these additional interpretations, we freely release online a comprehensive human-and machine-readable molecular profile for the more than 100 neuron types in Hippocampome.org. Analysis of these data ascertains the ability to distinguish unequivocally most neuron types in each of the major subdivisions of the hippocampus based on currently known biochemical markers. Moreover, grouping neuron types by expression similarity reveals eight super-families characterized by a few defining molecules.
Significance StatementThe molecular composition of cells underlies their structure, activity, and function. Neurons are arguably the most diverse cell types with their characteristic tree-like shapes mediating synaptic communication throughout the brain. Biochemical marker data are available online for hundreds of morphologically identified neuron types in the mammalian hippocampus, including expression of calcium-binding proteins, receptors, and enzymes. Here, we augment this evidence by systematically applying logical rules empirically derived from the published literature (e.g."presence of molecule X implies lack of molecule Y"). The resulting substantially expanded expression profiles provide nearly unique molecular identities for most known hippocampal 3 neuron types while revealing previously unrecognized genetic similarities across anatomical regions and morphological phenotypes.