We develop a number of data-driven investment strategies that demonstrate how machine learning and data analytics can be used to guide investments in peer-to-peer loans. We detail the process starting with the acquisition of (real) data from a peer-to-peer lending platform all the way to the development and evaluation of investment strategies based on a variety of approaches. We focus heavily on how to apply and evaluate the data science methods, and resulting strategies, in a real-world business setting. The material presented in this article can be used by instructors who teach data science courses, at the undergraduate or graduate levels. Importantly, we go beyond just evaluating predictive performance of models, to assess how well the strategies would actually perform, using real, publicly available data. Our treatment is comprehensive and ranges from qualitative to technical, but is also modular—which gives instructors the flexibility to focus on specific parts of the case, depending on the topics they want to cover. The learning concepts include the following: data cleaning and ingestion, classification/probability estimation modeling, regression modeling, analytical engineering, calibration curves, data leakage, evaluation of model performance, basic portfolio optimization, evaluation of investment strategies, and using Python for data science.
We consider a general two-echelon distribution system consisting of a depot and multiple sales outlets, henceforth referred to as retailers, which face random demands for a given item. The replenishment process consists of two stages: the depot procures the item from an outside supplier, while the retailers' inventories are replenished by shipments from the depot. Both of the replenishment stages are associated with a given facility-specific leadtime. The depot as well as the retailers faces a limited inventory capacity. Inventories are reviewed and orders are placed on a periodic basis. When a retailer runs out of stock, unmet demand is backlogged. We propose a new approach to the above class of dynamic programming models based on Lagrangian relaxation. Every choice of the vector of Lagrange multipliers generates a lower bound via the solution of a single dynamic program (DP) with a one-dimensional state-space. The best such bound is obtained by maximizing over the vector of multipliers. The strategy that is optimal for this (maximal) lower bound DP employs an (s, S) ordering policy (generally, with time-dependent policy parameters). To arrive at an upper bound and an implementable heuristic, this (s, S) policy is paired with one of several possible allocation policies that allocate the system-wide inventory across the different facilities. We report on an extensive numerical study with close to 14 000 instances which evaluates the accuracy of the lower bound and the optimality gap of the various heuristic policies. The study reveals that the lower bound and the heuristic strategy that is constructed on its basis perform exceedingly well, almost across the entire parameter spectrum, including instances where demands are rather volatile or the average cycle time between consecutive orders is relatively large. The exception arises when storage at the depot is as expensive as at the retailer level and the retailers have large storage capacities. KEYWORDSreplenishment process, storage constraints, two-echelon distribution INTRODUCTIONWe consider a general two-echelon distribution system consisting of a depot and multiple sales outlets, henceforth referred to as retailers, which face random demands for a given item. The replenishment process consists of two stages: the depot procures the item from an outside supplier, while the retailers' inventories are replenished by shipments from the depot. Both of the replenishment stages are associated with a given facility-specific leadtime. The depot as well as the retailers faces a limited inventory capacity. Inventories are reviewed and orders are placed on a periodic basis. When a retailer runs out of stock, unmet demand is backlogged.In such systems, the challenge is to find an optimal trade-off among the following four cost components: (1) costs associated with the orders placed with the external supplier, typically reflecting economies of scale, (2) shipment costs for transfers from the depot to the retailers, (3) carrying costs associated with the depot's and retailers...
No abstract
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.