佛山智唯网站建设/免费推广引流软件

佛山智唯网站建设,免费推广引流软件,网站关键词搜索如何排名靠前,建设行业网上报名官网网址在上一篇文章中,我们深入探讨了 Soft Actor-Critic (SAC) 算法及其在平衡探索与利用方面的优势。本文将介绍强化学习领域的重要里程碑——Asynchronous Advantage Actor-Critic (A3C) 算法,并展示如何利用 PyTorch 实现并行化训练来加速学习过程。 一、A…

在上一篇文章中,我们深入探讨了 Soft Actor-Critic (SAC) 算法及其在平衡探索与利用方面的优势。本文将介绍强化学习领域的重要里程碑——Asynchronous Advantage Actor-Critic (A3C) 算法,并展示如何利用 PyTorch 实现并行化训练来加速学习过程。


一、A3C 算法原理

A3C 算法由 DeepMind 于 2016 年提出,通过异步并行的多个智能体(Worker)与环境交互,显著提升了训练效率。其核心思想可概括为:

  1. 并行架构 多个 Worker 线程同时与各自的环境副本交互,收集经验并计算梯度。

  2. 参数共享 所有 Worker 共享全局网络参数,定期将本地梯度同步到全局网络。

  3. 优势函数(Advantage) 使用 A(s,a)=Q(s,a)−V(s)A(s,a)=Q(s,a)−V(s) 衡量动作的相对优势,降低方差。

算法优势
  • 高效数据收集:并行交互突破数据相关性限制

  • 稳定训练:异步更新缓解梯度冲突

  • 资源利用率高:充分利用多核 CPU 资源


二、A3C 实现步骤

我们将使用 PyTorch 实现 A3C 算法解决 CartPole 平衡问题:

  1. 定义全局网络结构 包含 Actor 和 Critic 两个分支

  2. 实现异步 Worker 每个 Worker 独立与环境交互并计算梯度

  3. 设计并行更新机制 异步更新全局网络参数

  4. 训练与评估


三、代码实现

import torch
import torch.nn as nn
import torch.optim as optim
import torch.multiprocessing as mp
import gym
​
# 定义全局网络
class GlobalNetwork(nn.Module):def __init__(self, state_dim, action_dim):super().__init__()self.base = nn.Sequential(nn.Linear(state_dim, 128),nn.ReLU())self.actor = nn.Linear(128, action_dim)self.critic = nn.Linear(128, 1)
​def forward(self, x):x = self.base(x)return torch.softmax(self.actor(x), dim=-1), self.critic(x)
​
# Worker 进程定义
class Worker(mp.Process):def __init__(self, global_net, optimizer, global_ep, name):super().__init__()self.env = gym.make('CartPole-v1')self.local_net = GlobalNetwork(4, 2)  # 本地网络self.global_net = global_netself.optimizer = optimizerself.global_ep = global_epself.name = name
​def run(self):while self.global_ep.value < 300:  # 训练300个全局episode# 同步全局参数到本地self.local_net.load_state_dict(self.global_net.state_dict())# 收集轨迹数据states, actions, rewards = [], [], []state, _ = self.env.reset()  # 正确解包observationep_reward = 0while True:state_tensor = torch.FloatTensor(state).unsqueeze(0)  # 添加批次维度 [1,4]prob, _ = self.local_net(state_tensor)action = torch.multinomial(prob, 1).item()next_state, reward, done, _, _ = self.env.step(action)  # 解包所有返回值ep_reward += rewardstates.append(state)actions.append(action)rewards.append(reward)if done:# 计算优势函数returns, advantages = self.cal_advantages(states, rewards)# 异步更新全局网络self.update_global(states, actions, advantages, returns)with self.global_ep.get_lock():self.global_ep.value += 1print(f"Episode {self.global_ep.value} finished by {self.name} with reward {ep_reward}")breakstate = next_state
​def cal_advantages(self, states, rewards):# 计算回报returns = []discounted_reward = 0for r in reversed(rewards):discounted_reward = r + 0.99 * discounted_rewardreturns.insert(0, discounted_reward)# 转换为张量并标准化returns = torch.FloatTensor(returns)returns = (returns - returns.mean()) / (returns.std() + 1e-8)# 计算价值估计states_tensor = torch.FloatTensor(states)  # 形状 [n_steps, 4]_, values = self.local_net(states_tensor)values = values.squeeze().detach()# 计算优势函数advantages = returns - valuesreturn returns, advantages
​def update_global(self, states, actions, advantages, returns):states_tensor = torch.FloatTensor(states)  # 形状 [n_steps, 4]actions_tensor = torch.LongTensor(actions)advantages = advantages.detach()returns = returns.detach()# 计算损失probs, values = self.local_net(states_tensor)policy_loss = -torch.log(probs.gather(1, actions_tensor.unsqueeze(1))) * advantages.unsqueeze(1)policy_loss = policy_loss.mean()value_loss = nn.MSELoss()(values.squeeze(), returns)entropy = -torch.sum(probs * torch.log(probs), dim=1).mean()total_loss = policy_loss + 0.5 * value_loss - 0.01 * entropy# 反向传播并更新全局网络self.optimizer.zero_grad()total_loss.backward()# 将本地梯度上传到全局网络for local_param, global_param in zip(self.local_net.parameters(), self.global_net.parameters()):if global_param.grad is not None:global_param.grad = local_param.grad.clone()self.optimizer.step()
​
if __name__ == '__main__':# 初始化全局网络global_net = GlobalNetwork(4, 2)global_net.share_memory()optimizer = optim.Adam(global_net.parameters(), lr=0.0002)global_ep = mp.Value('i', 0)# 启动4个Worker进程workers = [Worker(global_net, optimizer, global_ep, f"Worker-{i}") for i in range(4)][w.start() for w in workers][w.join() for w in workers]

四、关键代码解析

  1. 网络结构设计

    • 共享特征提取层(base

    • Actor 输出动作概率分布

    • Critic 评估状态价值

  2. 并行训练机制

    • 使用 torch.multiprocessing 实现多进程

    • share_memory() 实现参数共享

    • 原子计数器 global_ep 控制训练进度

  3. 优势函数计算 使用 N-step 回报计算优势:


五、训练结果

运行代码将观察到:

  1. 多个 Worker 并行训练,控制台输出各进程的即时奖励

  2. 全局训练 episode 达到 300 时自动终止

    Episode 1 finished by Worker-1 with reward 34.0
    Episode 2 finished by Worker-2 with reward 60.0
    Episode 3 finished by Worker-0 with reward 38.0
    Episode 4 finished by Worker-3 with reward 46.0
    Episode 5 finished by Worker-2 with reward 17.0
    Episode 6 finished by Worker-1 with reward 40.0
    Episode 7 finished by Worker-3 with reward 56.0
    Episode 8 finished by Worker-0 with reward 54.0
    Episode 9 finished by Worker-1 with reward 18.0
    Episode 10 finished by Worker-2 with reward 59.0
    Episode 11 finished by Worker-3 with reward 19.0
    Episode 12 finished by Worker-2 with reward 17.0
    Episode 13 finished by Worker-0 with reward 32.0
    Episode 14 finished by Worker-3 with reward 18.0
    Episode 15 finished by Worker-1 with reward 59.0
    Episode 16 finished by Worker-1 with reward 19.0
    Episode 17 finished by Worker-3 with reward 15.0
    Episode 18 finished by Worker-2 with reward 39.0
    Episode 19 finished by Worker-1 with reward 12.0
    Episode 20 finished by Worker-0 with reward 29.0
    Episode 21 finished by Worker-3 with reward 25.0
    Episode 22 finished by Worker-2 with reward 14.0
    Episode 23 finished by Worker-1 with reward 17.0
    Episode 24 finished by Worker-0 with reward 31.0
    Episode 25 finished by Worker-3 with reward 17.0
    Episode 26 finished by Worker-2 with reward 22.0
    Episode 27 finished by Worker-1 with reward 10.0
    Episode 28 finished by Worker-0 with reward 12.0
    Episode 29 finished by Worker-2 with reward 38.0
    Episode 30 finished by Worker-3 with reward 17.0
    Episode 31 finished by Worker-0 with reward 16.0
    Episode 32 finished by Worker-1 with reward 13.0
    Episode 33 finished by Worker-3 with reward 11.0
    Episode 34 finished by Worker-2 with reward 36.0
    Episode 35 finished by Worker-0 with reward 14.0
    Episode 36 finished by Worker-3 with reward 16.0
    Episode 37 finished by Worker-1 with reward 12.0
    Episode 38 finished by Worker-0 with reward 36.0
    Episode 39 finished by Worker-3 with reward 26.0
    Episode 40 finished by Worker-1 with reward 35.0
    Episode 41 finished by Worker-0 with reward 15.0
    Episode 42 finished by Worker-2 with reward 77.0
    Episode 43 finished by Worker-2 with reward 15.0
    Episode 44 finished by Worker-1 with reward 40.0
    Episode 45 finished by Worker-3 with reward 41.0
    Episode 46 finished by Worker-0 with reward 61.0
    Episode 47 finished by Worker-2 with reward 26.0
    Episode 48 finished by Worker-1 with reward 44.0
    Episode 49 finished by Worker-3 with reward 43.0
    Episode 50 finished by Worker-0 with reward 26.0
    Episode 51 finished by Worker-2 with reward 13.0
    Episode 52 finished by Worker-1 with reward 42.0
    Episode 53 finished by Worker-3 with reward 16.0
    Episode 54 finished by Worker-2 with reward 11.0
    Episode 55 finished by Worker-3 with reward 38.0
    Episode 56 finished by Worker-1 with reward 18.0
    Episode 57 finished by Worker-0 with reward 62.0
    Episode 58 finished by Worker-1 with reward 12.0
    Episode 59 finished by Worker-2 with reward 13.0
    Episode 60 finished by Worker-3 with reward 9.0
    Episode 61 finished by Worker-0 with reward 23.0
    Episode 62 finished by Worker-2 with reward 12.0
    Episode 63 finished by Worker-0 with reward 12.0
    Episode 64 finished by Worker-3 with reward 11.0
    Episode 65 finished by Worker-1 with reward 33.0
    Episode 66 finished by Worker-0 with reward 10.0
    Episode 67 finished by Worker-2 with reward 14.0
    Episode 68 finished by Worker-1 with reward 13.0
    Episode 69 finished by Worker-3 with reward 25.0
    Episode 70 finished by Worker-0 with reward 21.0
    Episode 71 finished by Worker-1 with reward 19.0
    Episode 72 finished by Worker-3 with reward 11.0
    Episode 73 finished by Worker-2 with reward 49.0
    Episode 74 finished by Worker-0 with reward 23.0
    Episode 75 finished by Worker-3 with reward 11.0
    Episode 76 finished by Worker-2 with reward 18.0
    Episode 77 finished by Worker-1 with reward 44.0
    Episode 78 finished by Worker-0 with reward 12.0
    Episode 79 finished by Worker-3 with reward 30.0
    Episode 80 finished by Worker-1 with reward 20.0
    Episode 81 finished by Worker-0 with reward 23.0
    Episode 82 finished by Worker-2 with reward 14.0
    Episode 83 finished by Worker-1 with reward 11.0
    Episode 84 finished by Worker-0 with reward 12.0
    Episode 85 finished by Worker-3 with reward 30.0
    Episode 86 finished by Worker-2 with reward 83.0
    Episode 87 finished by Worker-1 with reward 26.0
    Episode 88 finished by Worker-0 with reward 17.0
    Episode 89 finished by Worker-3 with reward 17.0
    Episode 90 finished by Worker-2 with reward 31.0
    Episode 91 finished by Worker-1 with reward 28.0
    Episode 92 finished by Worker-0 with reward 12.0
    Episode 93 finished by Worker-3 with reward 18.0
    Episode 94 finished by Worker-2 with reward 12.0
    Episode 95 finished by Worker-3 with reward 17.0
    Episode 96 finished by Worker-0 with reward 21.0
    Episode 97 finished by Worker-1 with reward 39.0
    Episode 98 finished by Worker-2 with reward 44.0
    Episode 99 finished by Worker-0 with reward 23.0
    Episode 100 finished by Worker-3 with reward 12.0
    Episode 101 finished by Worker-1 with reward 14.0
    Episode 102 finished by Worker-2 with reward 21.0
    Episode 103 finished by Worker-0 with reward 12.0
    Episode 104 finished by Worker-3 with reward 18.0
    Episode 105 finished by Worker-1 with reward 12.0
    Episode 106 finished by Worker-0 with reward 30.0
    Episode 107 finished by Worker-2 with reward 21.0
    Episode 108 finished by Worker-3 with reward 23.0
    Episode 109 finished by Worker-1 with reward 34.0
    Episode 110 finished by Worker-2 with reward 13.0
    Episode 111 finished by Worker-0 with reward 12.0
    Episode 112 finished by Worker-3 with reward 14.0
    Episode 113 finished by Worker-1 with reward 21.0
    Episode 114 finished by Worker-0 with reward 25.0
    Episode 115 finished by Worker-2 with reward 18.0
    Episode 116 finished by Worker-3 with reward 23.0
    Episode 117 finished by Worker-0 with reward 17.0
    Episode 118 finished by Worker-2 with reward 16.0
    Episode 119 finished by Worker-1 with reward 24.0
    Episode 120 finished by Worker-3 with reward 13.0
    Episode 121 finished by Worker-0 with reward 25.0
    Episode 122 finished by Worker-1 with reward 14.0
    Episode 124 finished by Worker-3 with reward 26.0
    Episode 123 finished by Worker-2 with reward 13.0
    Episode 125 finished by Worker-0 with reward 20.0
    Episode 126 finished by Worker-2 with reward 12.0
    Episode 127 finished by Worker-1 with reward 15.0
    Episode 128 finished by Worker-3 with reward 9.0
    Episode 129 finished by Worker-1 with reward 14.0
    Episode 130 finished by Worker-0 with reward 15.0
    Episode 131 finished by Worker-2 with reward 14.0
    Episode 132 finished by Worker-3 with reward 18.0
    Episode 133 finished by Worker-2 with reward 31.0
    Episode 134 finished by Worker-3 with reward 30.0
    Episode 135 finished by Worker-0 with reward 14.0
    Episode 136 finished by Worker-1 with reward 16.0
    Episode 137 finished by Worker-3 with reward 14.0
    Episode 138 finished by Worker-0 with reward 15.0
    Episode 140 finished by Worker-2 with reward 25.0
    Episode 139 finished by Worker-1 with reward 20.0
    Episode 141 finished by Worker-3 with reward 15.0
    Episode 142 finished by Worker-0 with reward 15.0
    Episode 143 finished by Worker-2 with reward 11.0
    Episode 144 finished by Worker-1 with reward 14.0
    Episode 145 finished by Worker-3 with reward 17.0
    Episode 146 finished by Worker-2 with reward 18.0
    Episode 147 finished by Worker-3 with reward 11.0
    Episode 148 finished by Worker-1 with reward 26.0
    Episode 149 finished by Worker-0 with reward 20.0
    Episode 150 finished by Worker-3 with reward 14.0
    Episode 151 finished by Worker-2 with reward 20.0
    Episode 152 finished by Worker-1 with reward 19.0
    Episode 153 finished by Worker-0 with reward 22.0
    Episode 154 finished by Worker-2 with reward 15.0
    Episode 155 finished by Worker-3 with reward 15.0
    Episode 156 finished by Worker-1 with reward 31.0
    Episode 157 finished by Worker-0 with reward 18.0
    Episode 158 finished by Worker-2 with reward 66.0
    Episode 159 finished by Worker-3 with reward 24.0
    Episode 160 finished by Worker-1 with reward 25.0
    Episode 161 finished by Worker-0 with reward 17.0
    Episode 162 finished by Worker-2 with reward 30.0
    Episode 163 finished by Worker-3 with reward 20.0
    Episode 164 finished by Worker-1 with reward 10.0
    Episode 165 finished by Worker-0 with reward 16.0
    Episode 166 finished by Worker-2 with reward 13.0
    Episode 167 finished by Worker-3 with reward 27.0
    Episode 168 finished by Worker-0 with reward 15.0
    Episode 169 finished by Worker-1 with reward 11.0
    Episode 170 finished by Worker-2 with reward 17.0
    Episode 171 finished by Worker-0 with reward 21.0
    Episode 172 finished by Worker-3 with reward 45.0
    Episode 173 finished by Worker-1 with reward 34.0
    Episode 174 finished by Worker-3 with reward 33.0
    Episode 175 finished by Worker-2 with reward 22.0
    Episode 176 finished by Worker-0 with reward 15.0
    Episode 177 finished by Worker-1 with reward 14.0
    Episode 178 finished by Worker-2 with reward 15.0
    Episode 179 finished by Worker-3 with reward 30.0
    Episode 180 finished by Worker-1 with reward 12.0
    Episode 181 finished by Worker-2 with reward 17.0
    Episode 182 finished by Worker-3 with reward 9.0
    Episode 183 finished by Worker-0 with reward 57.0
    Episode 184 finished by Worker-0 with reward 11.0
    Episode 185 finished by Worker-1 with reward 15.0
    Episode 186 finished by Worker-3 with reward 25.0
    Episode 187 finished by Worker-0 with reward 23.0
    Episode 188 finished by Worker-2 with reward 22.0
    Episode 189 finished by Worker-1 with reward 35.0
    Episode 190 finished by Worker-3 with reward 20.0
    Episode 191 finished by Worker-0 with reward 15.0
    Episode 192 finished by Worker-2 with reward 32.0
    Episode 193 finished by Worker-1 with reward 11.0
    Episode 194 finished by Worker-3 with reward 11.0
    Episode 195 finished by Worker-0 with reward 19.0
    Episode 196 finished by Worker-2 with reward 23.0
    Episode 197 finished by Worker-1 with reward 14.0
    Episode 198 finished by Worker-3 with reward 13.0
    Episode 199 finished by Worker-0 with reward 23.0
    Episode 200 finished by Worker-2 with reward 38.0
    Episode 201 finished by Worker-1 with reward 23.0
    Episode 202 finished by Worker-3 with reward 20.0
    Episode 203 finished by Worker-0 with reward 17.0
    Episode 204 finished by Worker-1 with reward 15.0
    Episode 205 finished by Worker-2 with reward 47.0
    Episode 206 finished by Worker-0 with reward 17.0
    Episode 207 finished by Worker-3 with reward 16.0
    Episode 208 finished by Worker-1 with reward 28.0
    Episode 209 finished by Worker-2 with reward 18.0
    Episode 210 finished by Worker-0 with reward 17.0
    Episode 211 finished by Worker-3 with reward 16.0
    Episode 212 finished by Worker-2 with reward 29.0
    Episode 213 finished by Worker-1 with reward 20.0
    Episode 214 finished by Worker-0 with reward 47.0
    Episode 215 finished by Worker-3 with reward 18.0
    Episode 216 finished by Worker-1 with reward 28.0
    Episode 217 finished by Worker-2 with reward 17.0
    Episode 218 finished by Worker-0 with reward 12.0
    Episode 219 finished by Worker-3 with reward 47.0
    Episode 220 finished by Worker-2 with reward 18.0
    Episode 221 finished by Worker-1 with reward 16.0
    Episode 222 finished by Worker-0 with reward 10.0
    Episode 223 finished by Worker-3 with reward 11.0
    Episode 224 finished by Worker-2 with reward 15.0
    Episode 225 finished by Worker-1 with reward 21.0
    Episode 226 finished by Worker-3 with reward 30.0
    Episode 227 finished by Worker-0 with reward 22.0
    Episode 228 finished by Worker-2 with reward 39.0
    Episode 229 finished by Worker-1 with reward 65.0
    Episode 230 finished by Worker-3 with reward 29.0
    Episode 231 finished by Worker-0 with reward 38.0
    Episode 232 finished by Worker-2 with reward 46.0
    Episode 233 finished by Worker-3 with reward 14.0
    Episode 234 finished by Worker-1 with reward 16.0
    Episode 235 finished by Worker-0 with reward 10.0
    Episode 236 finished by Worker-2 with reward 13.0
    Episode 237 finished by Worker-3 with reward 9.0
    Episode 238 finished by Worker-1 with reward 10.0
    Episode 239 finished by Worker-0 with reward 25.0
    Episode 240 finished by Worker-3 with reward 16.0
    Episode 241 finished by Worker-2 with reward 39.0
    Episode 242 finished by Worker-1 with reward 22.0
    Episode 243 finished by Worker-2 with reward 11.0
    Episode 244 finished by Worker-0 with reward 28.0
    Episode 245 finished by Worker-3 with reward 10.0
    Episode 246 finished by Worker-1 with reward 26.0
    Episode 247 finished by Worker-0 with reward 13.0
    Episode 248 finished by Worker-2 with reward 20.0
    Episode 249 finished by Worker-3 with reward 31.0
    Episode 250 finished by Worker-1 with reward 11.0
    Episode 251 finished by Worker-3 with reward 18.0
    Episode 252 finished by Worker-2 with reward 28.0
    Episode 253 finished by Worker-0 with reward 75.0
    Episode 254 finished by Worker-1 with reward 79.0
    Episode 255 finished by Worker-3 with reward 12.0
    Episode 256 finished by Worker-2 with reward 57.0
    Episode 257 finished by Worker-0 with reward 80.0
    Episode 258 finished by Worker-1 with reward 61.0
    Episode 259 finished by Worker-3 with reward 17.0
    Episode 260 finished by Worker-0 with reward 9.0
    Episode 261 finished by Worker-2 with reward 15.0
    Episode 262 finished by Worker-1 with reward 15.0
    Episode 263 finished by Worker-3 with reward 26.0
    Episode 264 finished by Worker-0 with reward 17.0
    Episode 265 finished by Worker-1 with reward 20.0
    Episode 266 finished by Worker-2 with reward 11.0
    Episode 267 finished by Worker-3 with reward 28.0
    Episode 268 finished by Worker-0 with reward 12.0
    Episode 269 finished by Worker-2 with reward 13.0
    Episode 270 finished by Worker-3 with reward 37.0
    Episode 271 finished by Worker-1 with reward 40.0
    Episode 272 finished by Worker-0 with reward 11.0
    Episode 273 finished by Worker-2 with reward 21.0
    Episode 274 finished by Worker-3 with reward 19.0
    Episode 275 finished by Worker-1 with reward 42.0
    Episode 276 finished by Worker-2 with reward 18.0
    Episode 277 finished by Worker-3 with reward 11.0
    Episode 278 finished by Worker-0 with reward 97.0
    Episode 279 finished by Worker-1 with reward 10.0
    Episode 280 finished by Worker-2 with reward 11.0
    Episode 281 finished by Worker-0 with reward 13.0
    Episode 282 finished by Worker-3 with reward 15.0
    Episode 283 finished by Worker-1 with reward 15.0
    Episode 284 finished by Worker-2 with reward 36.0
    Episode 285 finished by Worker-3 with reward 14.0
    Episode 286 finished by Worker-0 with reward 14.0
    Episode 287 finished by Worker-1 with reward 14.0
    Episode 288 finished by Worker-2 with reward 11.0
    Episode 289 finished by Worker-1 with reward 21.0
    Episode 290 finished by Worker-0 with reward 41.0
    Episode 291 finished by Worker-3 with reward 13.0
    Episode 292 finished by Worker-2 with reward 12.0
    Episode 293 finished by Worker-1 with reward 9.0
    Episode 294 finished by Worker-2 with reward 13.0
    Episode 295 finished by Worker-0 with reward 12.0
    Episode 296 finished by Worker-3 with reward 50.0
    Episode 297 finished by Worker-1 with reward 65.0
    Episode 298 finished by Worker-2 with reward 56.0
    Episode 299 finished by Worker-0 with reward 25.0
    Episode 300 finished by Worker-3 with reward 18.0
    Episode 301 finished by Worker-1 with reward 17.0
    Episode 302 finished by Worker-2 with reward 15.0
    Episode 303 finished by Worker-0 with reward 30.0


六、总结与扩展

本文实现了 A3C 算法的核心逻辑,展示了并行化训练在强化学习中的优势。读者可以尝试以下扩展:

  1. 在 Atari 游戏等复杂环境中测试算法

  2. 调整 Worker 数量观察训练效率变化

  3. 实现梯度裁剪等稳定化技巧

在下一篇文章中,我们将探讨分布式强化学习的进阶技术——IMPALA 算法,敬请期待!


注意事项

  1. 本代码需在支持多进程的环境下运行(如 Linux)

  2. 可添加 torch.nn.utils.clip_grad_norm_ 防止梯度爆炸

  3. 使用 gym.wrappers.Monitor 可录制智能体行为

希望本文能帮助您理解 A3C 算法的核心思想与实践方法!欢迎在评论区交流讨论。

本文来自互联网用户投稿,该文观点仅代表作者本人,不代表本站立场。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如若转载,请注明出处:http://www.mzph.cn/diannao/73853.shtml

如若内容造成侵权/违法违规/事实不符,请联系多彩编程网进行投诉反馈email:809451989@qq.com,一经查实,立即删除!

相关文章

【深度学习】多目标融合算法(五):定制门控网络CGC(Customized Gate Control)

目录 一、引言 二、CGC&#xff08;Customized Gate Control&#xff0c;定制门控网络&#xff09; 2.1 技术原理 2.2 技术优缺点 2.3 业务代码实践 2.3.1 业务场景与建模 2.3.2 模型代码实现 2.3.3 模型训练与推理测试 2.3.4 打印模型结构 三、总结 一、引言 上一…

在线pdf处理网站合集

1、PDF24 Tools&#xff1a;https://tools.pdf24.org/zh/ 2、PDF派&#xff1a;https://www.pdfpai.com/ 3、ALL TO ALL&#xff1a;https://www.alltoall.net/ 4、CleverPDF&#xff1a;https://www.cleverpdf.com/cn 5、Doc Small&#xff1a;https://docsmall.com/ 6、Aconv…

网络编程-实现客户端通信

#include <stdio.h> #include <stdlib.h> #include <string.h> #include <unistd.h> #include <sys/socket.h> #include <netinet/in.h> #include <sys/select.h>#define MAX_CLIENTS 2 // 最大客户端连接数 #define BUFFER_SI…

力扣100二刷——图论、回溯

第二次刷题不在idea写代码&#xff0c;而是直接在leetcode网站上写&#xff0c;“逼”自己掌握常用的函数。 标志掌握程度解释办法⭐Fully 完全掌握看到题目就有思路&#xff0c;编程也很流利⭐⭐Basically 基本掌握需要稍作思考&#xff0c;或者看到提示方法后能解答⭐⭐⭐Sl…

【大模型实战篇】多模态推理模型Skywork-R1V

1. 背景介绍 近期昆仑万维开源的Skywork R1V模型&#xff0c;是基于InternViT-6B-448px-V2_5以及deepseek-ai/DeepSeek-R1-Distill-Qwen-32B 通过强化学习得到。当然语言模型也可以切换成QwQ-32B。因此该模型最终的参数量大小为38B。 该模型具备多模态推理能力&#xf…

识别并脱敏上传到deepseek/chatgpt的文本文件中的护照信息

本文将介绍一种简单高效的方法解决用户在上传文件到DeepSeek、ChatGPT&#xff0c;文心一言&#xff0c;AI等大语言模型平台过程中的护照号识别和脱敏问题。 DeepSeek、ChatGPT&#xff0c;Qwen&#xff0c;Claude等AI平台工具快速的被接受和使用&#xff0c;用户每天上传的文…

数据驱动进化:AI Agent如何重构手机交互范式?

如果说AIGC拉开了内容生成的序幕&#xff0c;那么AI Agent则标志着AI从“工具”向“助手”的跨越式进化。它不再是简单的问答机器&#xff0c;而是一个能够感知环境、规划任务并自主执行的智能体&#xff0c;更像是虚拟世界中的“全能员工”。 正如行业所热议的&#xff1a;“大…

【AI News | 20250319】每日AI进展

AI Repos 1、XianyuAutoAgent 实现了 24 小时自动化值守的 AI 智能客服系统&#xff0c;支持多专家协同决策、智能议价和上下文感知对话&#xff0c;让我们店铺管理更轻松。主要功能&#xff1a; 智能对话引擎&#xff0c;支持上下文感知和专家路由阶梯降价策略&#xff0c;自…

C语言自定义类型【结构体】详解,【结构体内存怎么计算】 详解 【热门考点】:结构体内存对齐

引言 详细讲解什么是结构体&#xff0c;结构体的运用&#xff0c; 详细介绍了结构体在内存中占几个字节的计算。 【热门考点】&#xff1a;结构体内存对齐 介绍了&#xff1a;结构体传参 一、什么是结构体&#xff1f; 结构是⼀些值的集合&#xff0c;这些值称为成员变量。结构…

Apache DolphinScheduler:一个可视化大数据工作流调度平台

Apache DolphinScheduler&#xff08;海豚调度&#xff09;是一个分布式易扩展的可视化工作流任务调度开源系统&#xff0c;适用于企业级场景&#xff0c;提供了一个可视化操作任务、工作流和全生命周期数据处理过程的解决方案。 Apache DolphinScheduler 旨在解决复杂的大数据…

[蓝桥杯 2023 省 B] 飞机降落

[蓝桥杯 2023 省 B] 飞机降落 题目描述 N N N 架飞机准备降落到某个只有一条跑道的机场。其中第 i i i 架飞机在 T i T_{i} Ti​ 时刻到达机场上空&#xff0c;到达时它的剩余油料还可以继续盘旋 D i D_{i} Di​ 个单位时间&#xff0c;即它最早可以于 T i T_{i} Ti​ 时刻…

使用Trae 生成的React版的贪吃蛇

使用Trae 生成的React版的贪吃蛇 首先你想用这个贪吃蛇&#xff0c;你需要先安装Trae Trae 官方地址 他有两种模式 chat builder 我使用的是builder模式,虽然是Alpha.还是可以用。 接下来就是按着需求傻瓜式的操作生成代码 他生成的代码不完全正确&#xff0c;比如没有引入…

rust学习笔记17-异常处理

今天聊聊rust中异常错误处理 1. 基础类型&#xff1a;Result 和 Option&#xff0c;之前判断空指针就用到过 Option<T> 用途&#xff1a;表示值可能存在&#xff08;Some(T)&#xff09;或不存在&#xff08;None&#xff09;&#xff0c;适用于无需错误信息的场景。 f…

Python:单继承方法的重写

继承&#xff1a;让类和类之间转变为父子关系&#xff0c;子类默认继承父类的属性和方法 单继承&#xff1a; class Person:def eat(self):print("eat")def sing(self):print("sing") class Girl(Person):pass#占位符&#xff0c;代码里面类下面不写任何东…

[免费]直接整篇翻译pdf工具-支持多种语言

<闲来没事写篇博客填补中文知识库漏洞> 如题&#xff0c;[免费][本地]工具基于开源仓库&#xff1a; 工具 是python&#xff01;太好了&#xff0c;所以各个平台都可以&#xff0c;我这里基于windows. 1. 先把github代码下载下来&#xff1a; git clone https://githu…

UI设计中的用户反馈机制:提升交互体验的关键

hello宝子们...我们是艾斯视觉擅长ui设计和前端数字孪生、大数据、三维建模、三维动画10年经验!希望我的分享能帮助到您!如需帮助可以评论关注私信我们一起探讨!致敬感谢感恩! 在数字化产品泛滥的今天&#xff0c;用户与界面的每一次交互都在无形中塑造着他们对产品的认知。一个…

Hessian 矩阵是什么

Hessian 矩阵是什么 目录 Hessian 矩阵是什么Hessian 矩阵的性质及举例说明**1. 对称性****2. 正定性决定极值类型****特征值为 2(正),因此原点 ( 0 , 0 ) (0, 0) (0,0) 是极小值点。****3. 牛顿法中的应用****4. 特征值与曲率方向****5. 机器学习中的实际意义**一、定义与…

GPU视频编解码:Jetson VPI+multimedia_api视频编解码入门(一)

目录 一.Pipline与工具栈 二.硬件设备概况 三.GPU视频编解码框架 四.VPI编译使用实例 五. jetson_multimedia_api编译使用实例 一.Pipline与工具栈 二.硬件设备概况 三.GPU视频编解码框架 jetson设备目前不支持VPF框架&#xff0c;关于VPF的使用我在下节PC段使用X86进行安…

《深入理解AOP编程:从基础概念到Spring实现》

AOP编程 AOP(Aspect Oriented Programing) 面向切面编程 Spring动态代理开发 以切面为基本单位的程序开发&#xff0c;通过切脉你间的彼此协同&#xff0c;相互调用&#xff0c;完成程序构建 切面切入点额外功能 OOP(Object Oriented Programing)面向对象编程 java 以对象为基本…

Docker 实践与应用举例

一、引言 在当今快速发展的信息技术领域&#xff0c;容器化技术已成为推动软件开发、部署和运维变革的关键力量。Docker 作为容器化技术的领军者&#xff0c;以其高效、便捷、可移植等特性&#xff0c;被广泛应用于各种场景。它能够将应用程序及其依赖项打包成一个独立的容器&…