诺库维普

博文共计182篇

基本信息

文献作者	Weike Pan, Evan W. Xiang, Nathan N. Liu and Qiang Yang
文献发表年限		2010	创建时间	2018-07-12
文献关键字	cross domain; learn and transfer the latent vectors; principled matrix; heterogeneous； intrinsic preference structure; Y_{u:}行,Y_{:i}列，y_{ui}实体; 将ratings 处理成implicit feedback； coordinate systems
摘要描述	Data sparsity is a major problem for collaborative filtering (CF) techniques in recommender systems, especially for new users and items. We observe that, while our target data are sparse for CF systems, related and relatively dense auxiliary data may already exist in some other more mature application domains. In this paper, we address the data sparsity problem in a target domain by transferring knowledge about both users and items from auxiliary data sources. We observe that in different domains the user feedbacks are often heterogeneous such as ratings vs. clicks. Our solution is to integrate both user and item knowledge in auxiliary data sources through a principled matrix-based transfer learning framework that takes into account the data heterogeneity. In particular, we discover the principle coordinates of both users and items in the auxiliary data matrices, and transfer them to the target domain in order to reduce the effect of data sparsity. We describe our method, which is known as coordinate system transfer or CST, and demonstrate its effectiveness in alleviating the data sparsity problem in collaborative filtering. We show that our proposed method can significantly outperform several state-of-the-art solutions for this problem.

2. Semi-Supervised Nonnegative Matrix Factorization

3颗星

查看详情

基本信息

文献作者	Hyekyoung Lee, Jiho Yoo, and Seungjin Choi
文献发表年限		2010	创建时间	2018-07-11
文献关键字	semi-supervised NMF；分解方法求解NMF
摘要描述	Nonnegative matrix factorization (NMF) is a popular method for low-rank approximation of nonnegative matrix, pro- viding a useful tool for representation learning that is valuable for clustering and classification. When a portion of data are la- beled, the performance of clustering or classification is improved if the information on class labels is incorporated into NMF. To this end, we present semi-supervised NMF (SSNMF), where we jointly incorporate the data matrix and the (partial) class label ma- trix into NMF. We develop multiplicative updates for SSNMF to minimize a sum of weighted residuals, each of which involves the nonnegative 2-factor decomposition of the data matrix or the label matrix, sharing a common factor matrix. Experiments on docu- ment datasets and EEG datasets in BCI competition confirm that our method improves clustering as well as classification perfor- mance, compared to the standard NMF, stressing that semi-super- vised NMF yields semi-supervised feature extraction.

3. Matching Users and Items Across Domains to Improve the Recommendation Quality

4颗星

查看详情

基本信息

文献作者	Chung-Yi Li
文献发表年限		2014	创建时间	2018-07-11
文献关键字	Collaborative Filtering; Transfer Learning; Matrix Factorization; cross domain； constraint;奇异值分解；CBT；RMGM
摘要描述	Given two homogeneous rating matrices with some overlapped users/items whose mappings are unknown, this paper aims at an- swering two questions. First, can we identify the unknown map- ping between the users and/or items? Second, can we further utilize the identified mappings to improve the quality of recommendation in either domain? Our solution integrates a latent space match- ing procedure and a refining process based on the optimization of prediction to identify the matching. Then, we further design a transfer-based method to improve the recommendation perfor- mance. Using both synthetic and real data, we have done extensive experiments given different real life scenarios to verify the effec- tiveness of our models. The code and other materials are available at http://www.csie.ntu.edu.tw/~r00922051/matching/

4. Can Movies and Books Collaborate? Cross-Domain Collaborative Filtering for Sparsity Reduction

4颗星

查看详情

基本信息

文献作者	Bin Li；Qiang Yang; Xiangyang Xue
文献发表年限		2009	创建时间	2018-07-11
文献关键字	cross-domain; transfer；codebook；CBT
摘要描述	The sparsity problem in collaborative filtering (CF) is a major bottleneck for most CF methods. In this paper, we consider a novel approach for alleviat- ing the sparsity problem in CF by transferring user- item rating patterns from a dense auxiliary rating matrix in other domains (e.g., a popular movie rat- ing website) to a sparse rating matrix in a target domain (e.g., a new book rating website). We do not require that the users and items in the two do- mains be identical or even overlap. Based on the limited ratings in the target matrix, we establish a bridge between the two rating matrices at a cluster- level of user-item rating patterns in order to transfer more useful knowledge from the auxiliary task do- main. We first compress the ratings in the auxiliary rating matrix into an informative and yet compact cluster-level rating pattern representation referred to as a codebook. Then, we propose an efficient al- gorithm for reconstructing the target rating matrix by expanding the codebook. We perform extensive empirical tests to show that our method is effective in addressing the data sparsity problem by transfer- ring the useful knowledge from the auxiliary tasks, as compared to many state-of-the-art CF methods.

5. Little Is Much: Bridging Cross-Platform Behaviors through Overlapped Crowds

4颗星

查看详情

基本信息

文献作者	Meng Jiang, Peng Cui
文献发表年限		2016	创建时间	2018-07-11
文献关键字	cross-domain; overlapped; transfer；semi- supervised NMF
摘要描述	People often use multiple platforms to fulfill their different information needs. With the ultimate goal of serving people intelligently, a fundamental way is to get comprehensive understanding about user needs. How to organically integrate and bridge cross-platform information in a human-centric way is important. Existing transfer learning assumes either fully- overlapped or non-overlapped among the users. However, the real case is the users of different platforms are partially over- lapped. The number of overlapped users is often small and the explicitly known overlapped users is even less due to the lacking of unified ID for a user across different platforms. In this paper, we propose a novel semi-supervised transfer learning method to address the problem of cross-platform behavior prediction, called XPT RANS . To alleviate the sparsity issue, it fully exploits the small number of overlapped crowds to optimally bridge a user’s behaviors in different platforms. Extensive experiments across two real social networks show that XPT RANS significantly outperforms the state-of-the-art. We demonstrate that by fully exploiting 26% overlapped users, XPT RANS can predict the behaviors of non-overlapped users with the same accuracy as overlapped users, which means the small overlapped crowds can successfully bridge the information across different platforms.

6. Projected Gradient Methods for Nonnegative Matrix Factorization

4颗星

查看详情

基本信息

文献作者	Chih-Jen Lin
文献发表年限		2007	创建时间	2018-07-04
文献关键字	非负矩阵求法；NMF解释; VQ (vector quantization->K-means)
摘要描述	Nonnegative matrix factorization (NMF) can be formulated as a mini- mization problem with bound constraints. Although bound-constrained optimization has been studied extensively in both theory and practice, so far no study has formally applied its techniques to NMF. In this letter, we propose two projected gradient methods for NMF, both of which exhibit strong optimization properties. We discuss efficient implementations and demonstrate that one of the proposed methods converges faster than the popular multiplicative update approach. A simple Matlab code is also provided.

7. Orthogonal Nonnegative Matrix Tri-factorizations for Clustering

4颗星

查看详情

基本信息

文献作者	Chris Ding
文献发表年限		2006	创建时间	2018-06-30
文献关键字	3-factor NMF; semi-NMF; 带有正交约束的NMF求解方法；证明NMF等同于k-means； Nonnegative, non-negative; trace, 矩阵的迹trace；矩阵分解自由度； the degree of freedom
摘要描述	Currently, most research on nonnegative matrix factorization (NMF) focus on 2-factor X = FG T factorization. We provide a systematic analysis of 3-factor X = FSG T NMF. While unconstrained 3-factor NMF is equivalent to unconstrained 2-factor NMF, constrained 3- factor NMF brings new features to constrained 2-factor NMF. We study the orthogonality constraint because it leads to rigorous clus- tering interpretation. We provide new rules for updating F, S, G and prove the convergence of these algorithms. Experiments on 5 datasets and a real world case study are performed to show the capability of bi-orthogonal 3-factor NMF on simultaneously clus- tering rows and columns of the input data matrix. We provide a new approach of evaluating the quality of clustering on words us- ing class aggregate distribution and multi-peak distribution. We also provide an overview of various NMF extensions and examine their relationships.

8. Non-negative Matrix Factorization with Sparseness Constraints

5颗星

查看详情

基本信息

文献作者	Patrik O. Hoyer
文献发表年限		2004	创建时间	2018-06-28
文献关键字	non-negative matrix factorization, sparseness, data-adaptive representations；稀疏矩阵分解；
摘要描述	Non-negative matrix factorization (NMF) is a recently developed technique for finding parts-based, linear representations of non-negative data. Although it has successfully been applied in several applications, it does not always result in parts-based representations. In this paper, we show how explicitly incorporating the notion of ‘sparseness’ improves the found decompositions. Additionally, we provide complete MATLAB code both for standard NMF and for our extension. Our hope is that this will further the application of these methods to solving novel data-analysis problems.

9. Transferable Contextual Bandit for Cross-Domain Recommendation

4颗星

查看详情

基本信息

文献作者	Bo Liu, Ying Wei, Yu Zhang, Zhixian Yan, Qiang Yang
文献发表年限		2018	创建时间	2018-06-27
文献关键字	bandit policy; 线性；linear；exploitation; exploration
摘要描述	Traditional recommendation systems (RecSys) suffer from two problems: the exploitation-exploration dilemma and the cold-start problem. One solution to solving the exploitation- exploration dilemma is the contextual bandit policy, which adaptively exploits and explores user interests. As a result, the contextual bandit policy achieves increased rewards in the long run. The contextual bandit policy, however, may cause the system to explore more than needed in the cold-start situations, which can lead to worse short-term rewards. Cross-domain RecSys methods adopt transfer learning to leverage prior knowledge in a source RecSys domain to jump start the cold-start target RecSys. To solve the two problems together, in this paper, we propose the first applicable transferable contextual bandit (TCB) policy for the cross-domain recommendation. TCB not only benefits the exploitation but also accelerates the exploration in the target RecSys. TCB’s exploration, in turn, helps to learn how to transfer between different domains. TCB is a general algorithm for both homogeneous and heterogeneous domains. We perform both theoretical regret analysis and empirical experiments. The empirical results show that TCB outperforms the state-of-the-art algorithms over time.

10. Controlling Popularity Bias in Learning-to-Rank Recommendation

2颗星

查看详情

基本信息

文献作者	Himan Abdollahpouri; Robin Burke; Bamshad Mobasher
文献发表年限		2017	创建时间	2018-06-19
文献关键字	Recommender systems; long-tail; Recommendation evaluation; Coverage; Learning to rank; 长尾；diversity；多样性
摘要描述	Many recommendation algorithms suffer from popularity bias in their output: popular items are recommended frequently and less popular ones rarely, if at all. However, less popular, long-tail items are precisely those that are often desirable recommendations. In this paper, we introduce a flexible regularization-based framework to enhance the long-tail coverage of recommendation lists in a learning-to-rank algorithm. We show that regularization provides a tunable mechanism for controlling the trade-off between accuracy and coverage. Moreover, the experimental results using two data sets show that it is possible to improve coverage of long tail items without substantial loss of ranking performance.

博主信息

Nengjun 内部成员

搜索该博主文章

博文 共计182篇

基本信息

基本信息

基本信息

基本信息

基本信息

基本信息

基本信息

基本信息

基本信息

基本信息

博文共计182篇