In this paper, we describe a new reranking strategy named word lattice reranking, for the task of joint Chinese word segmentation and part-of-speech (POS) tagging. As a derivation of the forest reranking for parsing (Huang, 2008), this strategy reranks on the pruned word lattice, which potentially contains much more candidates while using less storage, compared with the traditional n-best list reranking. With a perceptron classifier trained with local features as the baseline, word lattice reranking performs reranking with non-local features that can't be easily incorporated into the perceptron baseline. Experimental results show that, this strategy achieves improvement on both segmentation and POS tagging, above the perceptron baseline and the n-best list reranking.
The development of robust electrocatalysts for electrocatalytic hydrogenation (ECH) of guaiacol and related lignin model monomers is necessary for the stabilization or upgrading of bio‐oil. Additionally, the efficiency of biomass conversion to bio‐oil products remains below the minimum requirements for its implementation at scale. Herein, a PtNiB/CMK‐3 catalyst with pronounced ECH performance in the conversion of guaiacol and related model lignin monomers to bio‐oil under optimally mild conditions, through a modulation strategy that modified the electronic structure of PtNi via boron alloying, is prepared. Notably, the optimized PtNiB/CMK‐3 exhibited an inspiring high faradaic efficiency of 86.2%, which is significantly higher (13.7 times) than that of the PtNi/CMK‐3 without B‐doping (6.3%). Experimental results and theoretical calculations showed that the B‐doping optimized the PtNiB alloy surface electron structure, simultaneously promoting substrate and intermediate adsorption and the ECH process. In addition, the uniform dispersion of PtNiB nanoparticles embedded within the mesoporous channels of CMK‐3 ensures an enhanced utilization efficiency, leading to improvements in stability and bio‐oil product generation. The lab‐scale ECH experiment of guaiacol also certified the scale‐up potential. This work opens a promising avenue to the rational design of advanced and highly efficient electrocatalysts for biomass upgrading.
Manually annotated corpora are valuable but scarce resources, yet for many annotation tasks such as treebanking and sequence labeling there exist multiple corpora with different and incompatible annotation guidelines or standards. This seems to be a great waste of human efforts, and it would be nice to automatically adapt one annotation standard to another. We present a simple yet effective strategy that transfers knowledge from a differently annotated corpus to the corpus with desired annotation. We test the efficacy of this method in the context of Chinese word segmentation and part-of-speech tagging, where no segmentation and POS tagging standards are widely accepted due to the lack of morphology in Chinese. Experiments show that adaptation from the much larger People's Daily corpus to the smaller but more popular Penn Chinese Treebank results in significant improvements in both segmentation and tagging accuracies (with error reductions of 30.2% and 14%, respectively), which in turn helps improve Chinese parsing accuracy.
The on-site production of ozone via electrochemical water electrolysis has attracted increasing interest because of its security and efficiency. However, the underlying mechanisms of the facet effect and lattice oxygen...
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.