Jing Peng scite author profile

A novel variant of the familiar backpropagation-through-time approach to training recurrent networks is described. This algorithm is intended to be used on arbitrary recurrent networks that run continually without ever being reset to an initial state, and it is specifically designed for computationally efficient computer implementation. This algorithm can be viewed as a cross between epochwise backpropagation through time, which is not appropriate for continually running networks, and the widely used on-line gradient approximation technique of truncated backpropagation through time.

show abstract

Locally adaptive metric nearest-neighbor classification

Domeniconi

Peng

Gunopulos

2002

IEEE Trans. Pattern Anal. Machine Intell.

274

144

View full text Add to dashboard Cite

Nearest neighbor classi cation assumes locally constant class conditional probabilities. This assumption becomes invalid in high dimensions with nite samples due to the curse of dimensionality. Severe bias can be introduced under these conditions when using the nearest neighbor rule. We propose a locally adaptive nearest neighbor classi cation method to try to minimize bias. We use a Chisquared distance analysis to compute a exible metric for producing neighborhoods that are highly adaptive to query locations. Neighborhoods are elongated along less relevant feature dimensions and constricted along most in uential ones. As a result, the class conditional probabilities tend to be smoother in the modi ed neighborhoods, whereby better classi cation performance can be achieved. The e cacy of our method is validated and compared against other techniques using a variety of simulated and real world data.

show abstract

Function Optimization using Connectionist Reinforcement Learning Algorithms

Williams¹,

Peng²

1991

Connection Science

222

127

View full text Add to dashboard Cite

Any non-associative reinforcement learning algorithm can be viewed as a method for performing function optimization through (possibly noise-corrupted) sampling of function values. We describe the results of simulations in which the optima of several deterministic functions studied by Ackley were sought using variants of REINFORCE algorithms. Some of the algorithms used here incorporated additional heuristic features resembling certain aspects of some of the algorithms used in Ackley's studies. Differing levels ofperformance were achieved by the various algorithms investigated, but a number of them performed at a level comparable to the best found in Ackley's studies on a number of the tasks, in spite of their simplicity. One of these variants, called REINFORCE/MENT, represents a novel but principled approach to reinforcement learning in nontrivial networks which incorporates an entropy maximization strategy. This was found to perform especially well on more hierarchically organized tasks.

show abstract

Incremental Multi-Step Q-Learning

1994

View full text Add to dashboard Cite

Probabilistic Feature Relevance Learning for Content-Based Image Retrieval

Peng

Bhanu

Qing

1999

Computer Vision and Image Understanding

117

113

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Jing Peng

An Efficient Gradient-Based Algorithm for On-Line Training of Recurrent Network Trajectories

Locally adaptive metric nearest-neighbor classification

Function Optimization using Connectionist Reinforcement Learning Algorithms

Incremental Multi-Step Q-Learning

Probabilistic Feature Relevance Learning for Content-Based Image Retrieval

Contact Info

Product

Resources

About