MCP๋กœ ์—ฐ๊ฒฐ โ†’

๊ณ„์‚ฐ ์ž…๋ ฅ

๊ณต์‹

Show calculation steps (1)
  1. Sampled x Values

    Sampled x Values: Leaky ReLU ํ™œ์„ฑํ™” ํ•จ์ˆ˜ ๊ณ„์‚ฐ๊ธฐ

    Each point i (from 0) uses x = startX + i*stepX for the given number of points.

๊ด‘๊ณ 

๊ฒฐ๊ณผ

Leaky ReLU at x = 3
3
f(x) = x if x > 0, else ฮฑยทx
์  ๊ฐœ์ˆ˜ 101
ฮฑ (leak slope) 0.01
First f(x) (at x = -4) -0.04
Last f(x) (at x = 1) 1
x f(x)
-4 -0.04
-3.95 -0.0395
-3.9 -0.039
-3.85 -0.0385
-3.8 -0.038
-3.75 -0.0375
-3.7 -0.037
-3.65 -0.0365
-3.6 -0.036
-3.55 -0.0355
-3.5 -0.035
-3.45 -0.0345
-3.4 -0.034
-3.35 -0.0335
-3.3 -0.033
-3.25 -0.0325
-3.2 -0.032
-3.15 -0.0315
-3.1 -0.031
-3.05 -0.0305
-3 -0.03
-2.95 -0.0295
-2.9 -0.029
-2.85 -0.0285
-2.8 -0.028
-2.75 -0.0275
-2.7 -0.027
-2.65 -0.0265
-2.6 -0.026
-2.55 -0.0255
-2.5 -0.025
-2.45 -0.0245
-2.4 -0.024
-2.35 -0.0235
-2.3 -0.023
-2.25 -0.0225
-2.2 -0.022
-2.15 -0.0215
-2.1 -0.021
-2.05 -0.0205
-2 -0.02
-1.95 -0.0195
-1.9 -0.019
-1.85 -0.0185
-1.8 -0.018
-1.75 -0.0175
-1.7 -0.017
-1.65 -0.0165
-1.6 -0.016
-1.55 -0.0155
-1.5 -0.015
-1.45 -0.0145
-1.4 -0.014
-1.35 -0.0135
-1.3 -0.013
-1.25 -0.0125
-1.2 -0.012
-1.15 -0.0115
-1.1 -0.011
-1.05 -0.0105
-1 -0.01
-0.95 -0.0095
-0.9 -0.009
-0.85 -0.0085
-0.8 -0.008
-0.75 -0.0075
-0.7 -0.007
-0.65 -0.0065
-0.6 -0.006
-0.55 -0.0055
-0.5 -0.005
-0.45 -0.0045
-0.4 -0.004
-0.35 -0.0035
-0.3 -0.003
-0.25 -0.0025
-0.2 -0.002
-0.15 -0.0015
-0.1 -0.001
-0.05 -0.0005
0 0
0.05 0.05
0.1 0.1
0.15 0.15
0.2 0.2
0.25 0.25
0.3 0.3
0.35 0.35
0.4 0.4
0.45 0.45
0.5 0.5
0.55 0.55
0.6 0.6
0.65 0.65
0.7 0.7
0.75 0.75
0.8 0.8
0.85 0.85
0.9 0.9
0.95 0.95
1 1

Leaky ReLU ํ™œ์„ฑํ™” ํ•จ์ˆ˜๋ž€?

Leaky ReLU(Leaky Rectified Linear Unit)๋Š” ์‹ฌ์ธต ์‹ ๊ฒฝ๋ง์—์„œ ๋„๋ฆฌ ์“ฐ์ด๋Š” ํ™œ์„ฑํ™” ํ•จ์ˆ˜์ž…๋‹ˆ๋‹ค. ์ผ๋ฐ˜ ReLU์ฒ˜๋Ÿผ ์–‘์ˆ˜ ์ž…๋ ฅ์€ ๊ทธ๋Œ€๋กœ ํ†ต๊ณผ์‹œํ‚ค์ง€๋งŒ, ์Œ์ˆ˜ ์ž…๋ ฅ์„ 0์œผ๋กœ ๋ˆŒ๋Ÿฌ ๋ฒ„๋ฆฌ๋Š” ๋Œ€์‹  ์ž‘์€ ๊ธฐ์šธ๊ธฐ alpha๋ฅผ ๋ถ€์—ฌํ•œ๋‹ค๋Š” ์ ์ด ๋‹ค๋ฆ…๋‹ˆ๋‹ค. ๋•๋ถ„์— ์Œ์ˆ˜ ์˜์—ญ์—์„œ๋„ ์ž‘์€ ๊ธฐ์šธ๊ธฐ(๊ทธ๋ž˜๋””์–ธํŠธ)๊ฐ€ ๊ณ„์† ํ๋ฅด๊ฒŒ ๋˜์–ด, ๋‰ด๋Ÿฐ์ด 0๋งŒ ์ถœ๋ ฅํ•œ ์ฑ„ ํ•™์Šต์„ ๋ฉˆ์ถฐ ๋ฒ„๋ฆฌ๋Š” ์ด๋ฅธ๋ฐ” 'dying ReLU(์ฃฝ์€ ReLU)' ๋ฌธ์ œ๋ฅผ ์™„ํ™”ํ•˜๋Š” ๋ฐ ๋„์›€์ด ๋ฉ๋‹ˆ๋‹ค.

x์ถ•๊ณผ y์ถ•์— ํ‘œ์‹œ๋œ Leaky ReLU ํ™œ์„ฑํ™” ํ•จ์ˆ˜ ๊ทธ๋ž˜ํ”„
Leaky ReLU ๊ณก์„ : ์›์ ์„ ์ง€๋‚˜๋Š” ์ง์„ ์œผ๋กœ, ์Œ์˜ ์ž…๋ ฅ์—๋Š” ์ž‘์€ ๊ธฐ์šธ๊ธฐ๋ฅผ, ์–‘์˜ ์ž…๋ ฅ์—๋Š” ๊ธฐ์šธ๊ธฐ 1์„ ๊ฐ€์ง„๋‹ค.

์ˆ˜์‹

์ž…๋ ฅ \(x\)์™€ ๋ˆ„์„ค ๊ธฐ์šธ๊ธฐ \(\alpha\)์— ๋Œ€ํ•ด ์ถœ๋ ฅ์€ ๋‹ค์Œ๊ณผ ๊ฐ™์Šต๋‹ˆ๋‹ค.

$$f(x) = \begin{cases} x & \text{if } x > 0 \\[0.5em] \alpha \cdot x & \text{if } x \le 0 \end{cases}$$

๊ธฐ๋ณธ ๋ˆ„์„ค ๊ฐ’์€ \(\alpha = 0.01\) ์ž…๋‹ˆ๋‹ค. ๋‘ ๊ฐ€์ง€ ํŠน์ˆ˜ํ•œ ๊ฒฝ์šฐ๋ฅผ ์•Œ์•„ ๋‘๋ฉด ์ข‹์Šต๋‹ˆ๋‹ค. \(\alpha = 0\)์ด๋ฉด ์ผ๋ฐ˜ ReLU(\(\max(0, x)\))์™€ ๊ฐ™์•„์ง€๊ณ , \(\alpha = 1\)์ด๋ฉด ํ•จ์ˆ˜๊ฐ€ ํ•ญ๋“ฑ ์ง์„  \(f(x) = x\)๋กœ ๋ฐ”๋€๋‹ˆ๋‹ค.

๊ณ„์‚ฐ๊ธฐ ์‚ฌ์šฉ๋ฒ•

์‹œ์ž‘ \(x\) ๊ฐ’, ์  ์‚ฌ์ด์˜ ๊ฐ„๊ฒฉ(step), ์ƒ์„ฑํ•  ์ ์˜ ๊ฐœ์ˆ˜, ๊ทธ๋ฆฌ๊ณ  ๋ˆ„์„ค ๊ธฐ์šธ๊ธฐ \(\alpha\)๋ฅผ ์ž…๋ ฅํ•˜์„ธ์š”. ๊ณ„์‚ฐ๊ธฐ๋Š” \(i = 0\)๋ถ€ํ„ฐ \(\text{count}-1\)๊นŒ์ง€ ๋‹ค์Œ ์ˆ˜์—ด์„ ๋งŒ๋“ค๊ณ ,

$$x_i = \text{startX} + i \cdot \text{stepX}$$

๊ฐ ์ ์—์„œ \(f\)๋ฅผ ๊ณ„์‚ฐํ•ด \((x, f(x))\) ์Œ์˜ ๋ชฉ๋ก๊ณผ ๊ณก์„  ๊ทธ๋ž˜ํ”„๋ฅผ ํ•จ๊ป˜ ๋ณด์—ฌ ์ค๋‹ˆ๋‹ค. \(x\) ๊ฐ’ ํ•˜๋‚˜๋งŒ ์ž…๋ ฅํ•ด \(f(x)\)๋ฅผ ๋ฐ”๋กœ ํ•œ ๋ฒˆ ๊ณ„์‚ฐํ•ด ๋ณผ ์ˆ˜๋„ ์žˆ์Šต๋‹ˆ๋‹ค.

๊ณ„์‚ฐ ์˜ˆ์‹œ

\(\alpha = 0.01\)์ธ ๊ฒฝ์šฐ: \(x = -4\)๋Š” 0 ์ดํ•˜์ด๋ฏ€๋กœ \(f = 0.01 \times (-4) = -0.04\) ์ž…๋‹ˆ๋‹ค. \(x = 0\)์—์„œ๋Š” \(f = 0\)์ด๊ณ , \(x = 3\)์€ ์–‘์ˆ˜์ด๋ฏ€๋กœ \(f = 3\) ์ž…๋‹ˆ๋‹ค. ๊ธฐ๋ณธ๊ฐ’(\(\text{startX} = -4\), \(\text{stepX} = 0.05\), \(\text{count} = 101\))์„ ์‚ฌ์šฉํ•˜๋ฉด ๊ตฌ๊ฐ„์€ \(x = -4\)(\(f = -0.04\))์—์„œ ์‹œ์ž‘ํ•ด \(x = +1.0\)(\(f = 1.0\))๊นŒ์ง€ ์ง„ํ–‰๋˜๋ฉฐ, 81๋ฒˆ์งธ ์ (\(i = 80\))์—์„œ 0์„ ์ง€๋‚˜๊ฒŒ ๋ฉ๋‹ˆ๋‹ค.

์ž์ฃผ ๋ฌป๋Š” ์งˆ๋ฌธ

Leaky ReLU๋Š” ReLU์™€ ์–ด๋–ป๊ฒŒ ๋‹ค๋ฅธ๊ฐ€์š”? ReLU๋Š” ๋ชจ๋“  ์Œ์ˆ˜ ์ž…๋ ฅ์— ๋Œ€ํ•ด ์ •ํ™•ํžˆ 0์„ ์ถœ๋ ฅํ•˜์ง€๋งŒ, Leaky ReLU๋Š” \(\alpha \cdot x\)๋ผ๋Š” ์ž‘์€ ์Œ์ˆ˜ ๊ฐ’์„ ์ถœ๋ ฅํ•ด ๊ธฐ์šธ๊ธฐ๋ฅผ ๋ณด์กดํ•ฉ๋‹ˆ๋‹ค.

alpha ๊ฐ’์€ ์–ผ๋งˆ๊ฐ€ ์ ์ ˆํ•œ๊ฐ€์š”? 0.01์ด ํ”ํžˆ ์“ฐ์ด๋Š” ๊ธฐ๋ณธ๊ฐ’์ž…๋‹ˆ๋‹ค. Parametric ReLU ๊ฐ™์€ ๋ณ€ํ˜•์—์„œ๋Š” ํ•™์Šต ๊ณผ์ •์—์„œ \(\alpha\)๋ฅผ ์ง์ ‘ ํ•™์Šตํ•˜๊ธฐ๋„ ํ•ฉ๋‹ˆ๋‹ค.

alpha๊ฐ€ ์Œ์ˆ˜์ผ ์ˆ˜๋„ ์žˆ๋‚˜์š”? ์ˆ˜ํ•™์ ์œผ๋กœ๋Š” ๊ฐ€๋Šฅํ•˜์ง€๋งŒ ํ”์น˜ ์•Š์œผ๋ฉฐ, ์ผ๋ฐ˜์ ์ธ ์‹ ๊ฒฝ๋ง์—์„œ๋Š” ๊ถŒ์žฅํ•˜์ง€ ์•Š์Šต๋‹ˆ๋‹ค.

์ตœ์ข… ์—…๋ฐ์ดํŠธ: