Yunsheng Song scite author profile

The fossil leaves and associated infructescences from Maoming probably belong to the same plant. The occurrence of fossil leaves similar to those of extant species previously considered within Semiliquidambar and Liquidambar with the associated infructescences close to those of Altingia provide paleobotanical evidence that justifies combining the genera Liquidambar, Altingia, and Semiliquidambar into the single genus Liquidambar as recently proposed based on molecular markers.

show abstract

A Large-Scale $k$ -Nearest Neighbor Classification Algorithm Based on Neighbor Relationship Preservation

Song

Kong

Zhang

2022

Wireless Communications and Mobile Computing

View full text Add to dashboard Cite

Owing to the absence of hypotheses of the underlying distributions of the data and the strong generation ability, the k -nearest neighbor (kNN) classification algorithm is widely used to face recognition, text classification, emotional analysis, and other fields. However, kNN needs to compute the similarity between the unlabeled instance and all the training instances during the prediction process; it is difficult to deal with large-scale data. To overcome this difficulty, an increasing number of acceleration algorithms based on data partition are proposed. However, they lack theoretical analysis about the effect of data partition on classification performance. This paper has made a theoretical analysis of the effect using empirical risk minimization and proposed a large-scale k -nearest neighbor classification algorithm based on neighbor relationship preservation. The process of searching the nearest neighbors is converted to a constrained optimization problem. Then, it gives the estimation of the difference on the objective function value under the optimal solution with data partition and without data partition. According to the obtained estimation, minimizing the similarity of the instances in the different divided subsets can largely reduce the effect of data partition. The minibatch k -means clustering algorithm is chosen to perform data partition for its effectiveness and efficiency. Finally, the nearest neighbors of the test instance are continuously searched from the set generated by successively merging the candidate subsets until they do not change anymore, where the candidate subsets are selected based on the similarity between the test instance and cluster centers. Experiment results on public datasets show that the proposed algorithm can largely keep the same nearest neighbors and no significant difference in classification accuracy as the original kNN classification algorithm and better results than two state-of-the-art algorithms.

show abstract

An accelerator for support vector machines based on the local geometrical information and data partition

Song

Liang

Wang

2018

Int. J. Mach. Learn. & Cyber.

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Yunsheng Song

An efficient instance selection algorithm for k nearest neighbor regression

Liquidambar maomingensis sp. nov. (Altingiaceae) from the late Eocene of South China

A Large-Scale $k$ -Nearest Neighbor Classification Algorithm Based on Neighbor Relationship Preservation

An accelerator for support vector machines based on the local geometrical information and data partition

Contact Info

Product

Resources

About

Yunsheng Song

An efficient instance selection algorithm for k nearest neighbor regression

Liquidambar maomingensis sp. nov. (Altingiaceae) from the late Eocene of South China

A Large-Scale k -Nearest Neighbor Classification Algorithm Based on Neighbor Relationship Preservation

An accelerator for support vector machines based on the local geometrical information and data partition

Contact Info

Product

Resources

About

A Large-Scale $k$ -Nearest Neighbor Classification Algorithm Based on Neighbor Relationship Preservation