日拱一卒

置顶 | 发表于 2021-06-26 | 分类于 PaperNote

平时的阅读列表，一般会附上下载链接和关键词。日拱一卒，与诸君共勉。 2025 M6 [123] M5 [Advertising, Bidding, Promoation, Generative, Alibaba, WWW2026] GAM: A Generative Auto-Marketing Framework in Online E-commerce Platforms M4 [ ...

阅读全文 »

2026年AI发展随笔

发表于 2026-05-05 | 分类于 Daily

最近几周密集关注学习了一下最近几年的LLM发展脉络，有些闲言碎语般的感悟在这里mark一下：谈谈算法工程师。技术门槛已经被磨平了。这也是最近很长一段时间没有写读paper博客的主要原因，如果你足够“智慧”，就不应该再从类似博客的地方来汲取知识，大模型可以给你everything。在个人很多场景下，通过跟高质量的模型多轮交互，都可以获得比较好的实践方案，这件事对于我还是非常震撼的，过去若干 ...

阅读全文 »

年度反思之2024我在淘宝做内容

发表于 2025-02-22 | 分类于 Dairy

本文写于2025年2月，一方面是最近读到了若干篇Google在短视频推荐方向的文章，另一方面是年前和某前同事聊新工作在做的事情，感觉思路更开阔了，顺便把以前做的事情再拿出来“鞭尸”一下。全文中的”内容“仅指各类短视频内容，暂不讨论直播。本文主要内容如下：近期Google的几篇短视频相关文章，都讲了什么，有哪些令我印象深刻的巧思之前在淘宝做内容的时候，做过哪些类似的事情，和Google这 ...

阅读全文 »

LiRank: Industrial Large Scale Ranking Models at LinkedIn

发表于 2024-09-08 | 分类于 PaperNote

TL;DR 本文是LinkedIn的模型团队模型迭代“年终汇报”，包含LinkedIn团队对于各类模型涨点的技巧的实践。摘要 We present LiRank, a large-scale ranking framework at LinkedIn that brings to production state-of-the-art modeling architectures and op ...

阅读全文 »

3 Years in Hangzhou

发表于 2024-05-25 | 分类于 Dairy

一转眼三年，零零散散还是要写一点，大部分摘自2023的复盘。关于「工作」入职那天阿里巴巴的股价首先，感谢在阿里遇到的所有人，重点感谢师兄子璟，感谢这些人让我从一个技术人成长为一个相对合格的职场人。工作只是”工作“ 职场中，应当抛开”自我“的属性，只有职场人的属性，大家都是来完成工作的，并没有针对个人的或好或坏的情感。举个简单例子，以前我在上层老板比较多的会议上讲事情会非常紧张，担 ...

阅读全文 »

An Empirical Study of Selection Bias in Pinterest Ads Retrieval

发表于 2023-08-20 | 分类于 PaperNote

摘要 Data selection bias has been a long-lasting challenge in the machine learning domain, especially in multi-stage recommendation systems, where the distribution of labeled items for model training i ...

阅读全文 »

Streaming CTR Prediction: Rethinking Recommendation Task for Real-World Streaming Data

发表于 2023-08-06 | 分类于 PaperNote

摘要 The Click-Through Rate (CTR) prediction task is critical in industrial recom- mender systems, where models are usually deployed on dynamic streaming data in practical applications. Such streaming ...

阅读全文 »

Fresh Content Needs More Attention- Multi-funnel Fresh Content Recommendation

发表于 2023-07-02 | 分类于 PaperNote

摘要 Recommendation system serves as a conduit connecting users to an incredibly large, diverse and ever growing collection of contents. In practice, missing information on fresh (and tail) contents ne ...

阅读全文 »

Attention is all you need

发表于 2023-07-01 | 分类于 PaperNote

摘要 The dominant sequence transduction models are based on complex recurrent or convolutional neural networks that include an encoder and a decoder. The best performing models also connect the encoder ...

阅读全文 »

On the Factory Floor: ML Engineering for Industrial-Scale Ads Recommendation Models

发表于 2022-10-07 | 分类于 PaperNote

写在最前面，本来是想简单摘抄一下这篇文章中的精华，写到一半觉得这篇文章不应如此，本文应该是一篇可以比肩Wide&Deep的文章。如果说Wide&Deep告诉业界推荐就是要搞Embedding，E2E，那么本文可能就是告诉大家CTR模型就是要搞Online Learning，搞ODL，以及围绕这些技术Google广告团队他们都在思考什么样问题。 N年前写本科毕设时，我有幸选中了Wi ...

阅读全文 »