Jia Bi scite author profile

Jia Bi

4Publications

36Citation Statements Received

74Citation Statements Given

How they've been cited

How they cite others

Affiliations

University of Southampton, Institute of Computing Technology, Chinese Academy of Sciences

Publications

Order By: Most citations

Dynamic-OFA: Runtime DNN Architecture Switching for Performance Scaling on Heterogeneous Embedded Platforms

Lou

Lei

Sabet

et al. 2021

View full text Add to dashboard Cite

Mobile and embedded platforms are increasingly required to efficiently execute computationally demanding DNNs across heterogeneous processing elements. At runtime, the available hardware resources to DNNs can vary considerably due to other concurrently running applications. The performance requirements of the applications could also change under different scenarios. To achieve the desired performance, dynamic DNNs have been proposed in which the number of channels/layers can be scaled in real time to meet different requirements under varying resource constraints. However, the training process of such dynamic DNNs can be costly, since platform-aware models of different deployment scenarios must be retrained to become dynamic. This paper proposes Dynamic-OFA, a novel dynamic DNN approach for state-of-the-art platform-aware NAS models (i.e. Once-for-all network (OFA)). Dynamic-OFA pre-samples a family of sub-networks from a static OFA backbone model, and contains a runtime manager to choose different sub-networks under different runtime environments. As such, Dynamic-OFA does not need the traditional dynamic DNN training pipeline. Compared to the state-of-the-art, our experimental results using ImageNet on a Jetson Xavier NX show that the approach is up to 3.5x (CPU), 2.4x (GPU) faster for similar Top-1 accuracy, or 3.8% (CPU), 5.1% (GPU) higher accuracy at similar latency.

show abstract

A Stochastic Gradient Method with Biased Estimation for Faster Nonconvex Optimization

Gunn

2019

View full text Add to dashboard Cite

A number of optimization approaches have been proposed for optimizing nonconvex objectives (e.g. deep learning models), such as batch gradient descent, stochastic gradient descent and stochastic variance reduced gradient descent. Theory shows these optimization methods can converge by using an unbiased gradient estimator. However, in practice biased gradient estimation can allow more e cient convergence to the vicinity since an unbiased approach is computationally more expensive. To produce fast convergence there are two trade-o s of these optimization strategies which are between stochastic/batch, and between biased/unbiased. This paper proposes an integrated approach which can control the nature of the stochastic element in the optimizer and can balance the trade-o of estimator between the biased and unbiased by using a hyper-parameter. It is shown theoretically and experimentally that this hyperparameter can be configured to provide an e ective balance to improve the convergence rate.

show abstract

Analyzing Machine Learning Models with Gaussian Process for the Indoor Positioning System

Xie

Zhu

Jiang

et al. 2020

Mathematical Problems in Engineering

View full text Add to dashboard Cite

Recently, there has been growing interest in improving the efficiency and accuracy of the Indoor Positioning System (IPS). The Received Signal Strength- (RSS-) based fingerprinting technique is essential for indoor localization. However, it is challenging to estimate the indoor position based on RSS’s measurement under the complex indoor environment. This paper evaluates three machine learning approaches and Gaussian Process (GP) regression with three different kernels to get the best indoor positioning model. The hyperparameter tuning technique is used to select the optimum parameter set for each model. Experiments are carried out with RSS data from seven access points (AP). Results show that GP with a rational quadratic kernel and eXtreme gradient tree boosting model has the best positioning accuracy compared to other models. In contrast, the eXtreme gradient tree boosting model could achieve higher positioning accuracy with smaller training size and fewer access points.

show abstract

KGCL: A Knowledge-Grid-Based Cooperative Learning Environment

Zhuge

et al. 2002

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Jia Bi

Dynamic-OFA: Runtime DNN Architecture Switching for Performance Scaling on Heterogeneous Embedded Platforms

A Stochastic Gradient Method with Biased Estimation for Faster Nonconvex Optimization

Analyzing Machine Learning Models with Gaussian Process for the Indoor Positioning System

KGCL: A Knowledge-Grid-Based Cooperative Learning Environment

Contact Info

Product

Resources

About