1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
1001
1002
1003
1004
1005
1006
1007
1008
1009
1010
1011
1012
1013
1014
1015
1016
1017
1018
1019
1020
1021
1022
1023
1024
1025
1026
1027
1028
1029
1030
1031
1032
1033
1034
1035
1036
1037
1038
1039
1040
1041
1042
1043
1044
1045
1046
1047
1048
1049
1050
1051
1052
1053
1054
1055
1056
1057
1058
1059
1060
1061
1062
1063
1064
1065
1066
1067
1068
1069
1070
1071
1072
1073
1074
1075
1076
1077
1078
1079
1080
1081
1082
1083
1084
1085
1086
1087
1088
1089
1090
1091
1092
1093
1094
1095
1096
1097
1098
1099
1100
1101
1102
1103
1104
1105
1106
1107
1108
1109
1110
1111
1112
1113
1114
1115
1116
1117
1118
1119
1120
1121
1122
1123
1124
1125
1126
1127
1128
1129
1130
1131
1132
1133
1134
1135
1136
1137
1138
1139
1140
1141
1142
1143
1144
1145
1146
1147
1148
1149
1150
1151
1152
1153
1154
1155
1156
1157
1158
1159
1160
1161
1162
1163
1164
1165
1166
1167
1168
1169
1170
1171
1172
1173
1174
1175
1176
1177
1178
1179
1180
1181
1182
1183
1184
1185
1186
1187
1188
1189
1190
1191
1192
1193
1194
1195
1196
1197
1198
1199
1200
1201
1202
1203
1204
1205
1206
1207
1208
1209
1210
1211
1212
1213
1214
1215
1216
1217
1218
1219
1220
1221
1222
1223
1224
1225
1226
1227
1228
1229
1230
1231
1232
1233
1234
1235
1236
1237
1238
1239
1240
1241
1242
1243
1244
1245
1246
1247
1248
1249
1250
1251
1252
1253
1254
1255
1256
1257
1258
1259
1260
1261
1262
1263
1264
1265
1266
1267
1268
1269
1270
1271
1272
1273
1274
1275
1276
1277
1278
1279
1280
1281
1282
1283
1284
1285
1286
1287
1288
1289
1290
1291
1292
1293
1294
1295
1296
1297
1298
1299
1300
1301
1302
1303
1304
1305
1306
1307
1308
1309
1310
1311
1312
1313
1314
1315
1316
1317
1318
1319
1320
1321
1322
1323
1324
1325
1326
1327
1328
1329
1330
1331
1332
1333
1334
1335
1336
1337
1338
1339
1340
1341
1342
1343
1344
1345
1346
1347
1348
1349
1350
1351
1352
1353
1354
1355
1356
1357
1358
1359
1360
1361
1362
1363
1364
1365
1366
1367
1368
1369
1370
1371
1372
1373
1374
1375
1376
1377
1378
1379
1380
1381
1382
1383
1384
1385
1386
1387
1388
1389
1390
1391
1392
1393
1394
1395
1396
1397
1398
1399
1400
1401
1402
1403
1404
1405
1406
1407
1408
1409
1410
1411
1412
1413
1414
1415
1416
1417
1418
1419
1420
1421
1422
1423
1424
1425
1426
1427
1428
1429
1430
1431
1432
1433
1434
1435
1436
1437
1438
1439
1440
1441
1442
1443
1444
1445
1446
1447
1448
1449
1450
1451
1452
1453
1454
1455
1456
1457
1458
1459
1460
1461
1462
1463
1464
1465
1466
1467
1468
1469
1470
1471
1472
1473
1474
1475
1476
1477
1478
1479
1480
1481
1482
1483
1484
1485
1486
1487
1488
1489
1490
1491
1492
1493
1494
1495
1496
1497
1498
1499
1500
1501
1502
1503
1504
1505
1506
1507
1508
1509
1510
1511
1512
1513
1514
1515
1516
1517
1518
1519
1520
1521
1522
1523
1524
1525
1526
1527
1528
1529
1530
1531
1532
1533
1534
1535
1536
1537
1538
1539
1540
1541
1542
1543
1544
1545
1546
1547
1548
1549
1550
1551
1552
1553
1554
1555
1556
1557
1558
1559
1560
1561
1562
1563
1564
1565
1566
1567
1568
1569
1570
1571
1572
1573
1574
1575
1576
1577
1578
1579
1580
1581
1582
1583
1584
1585
1586
1587
1588
1589
1590
1591
1592
1593
1594
1595
1596
1597
1598
1599
1600
1601
1602
1603
1604
1605
1606
1607
1608
1609
1610
1611
1612
1613
1614
1615
1616
1617
1618
1619
1620
1621
1622
1623
1624
1625
1626
1627
1628
1629
1630
1631
1632
1633
1634
1635
1636
1637
1638
1639
1640
1641
1642
1643
1644
1645
1646
1647
1648
1649
1650
1651
1652
1653
1654
1655
1656
1657
1658
1659
1660
1661
1662
1663
1664
1665
1666
1667
1668
1669
1670
1671
1672
1673
1674
1675
1676
1677
1678
1679
1680
1681
1682
1683
1684
1685
1686
1687
1688
1689
1690
1691
1692
1693
1694
1695
1696
1697
1698
1699
1700
1701
1702
1703
1704
1705
1706
1707
1708
1709
1710
1711
1712
1713
1714
1715
1716
1717
1718
1719
1720
1721
1722
1723
1724
1725
1726
1727
1728
1729
1730
1731
1732
1733
1734
1735
1736
1737
1738
1739
1740
1741
1742
1743
1744
1745
1746
1747
1748
1749
1750
1751
1752
1753
1754
1755
1756
1757
1758
1759
1760
1761
1762
1763
1764
1765
1766
1767
1768
1769
1770
1771
1772
1773
1774
1775
1776
1777
1778
1779
1780
1781
1782
1783
1784
1785
1786
1787
1788
1789
1790
1791
1792
1793
1794
1795
1796
1797
1798
1799
1800
1801
1802
1803
1804
1805
1806
1807
1808
1809
1810
1811
1812
1813
1814
1815
1816
1817
1818
1819
1820
1821
1822
1823
1824
1825
1826
1827
1828
1829
1830
1831
1832
1833
1834
1835
1836
1837
1838
1839
1840
1841
1842
1843
1844
1845
1846
1847
1848
1849
1850
1851
1852
1853
1854
1855
1856
1857
1858
1859
1860
1861
1862
1863
1864
1865
1866
1867
1868
1869
1870
1871
1872
1873
1874
1875
1876
1877
1878
1879
1880
1881
1882
1883
1884
1885
1886
1887
1888
1889
1890
1891
1892
1893
1894
1895
1896
1897
1898
1899
1900
1901
1902
1903
1904
1905
1906
1907
1908
1909
1910
1911
1912
1913
1914
1915
1916
1917
1918
1919
1920
1921
1922
1923
1924
1925
1926
1927
1928
1929
1930
1931
1932
1933
1934
1935
1936
1937
1938
1939
1940
1941
1942
1943
1944
1945
1946
1947
1948
1949
1950
1951
1952
1953
1954
1955
1956
1957
1958
1959
1960
1961
1962
1963
1964
1965
1966
1967
1968
1969
1970
1971
1972
1973
1974
1975
1976
1977
1978
1979
1980
1981
1982
1983
1984
1985
1986
1987
1988
1989
1990
1991
1992
1993
1994
1995
1996
1997
1998
1999
2000
2001
2002
2003
2004
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
2026
2027
2028
2029
2030
2031
2032
2033
2034
2035
2036
2037
2038
2039
2040
2041
2042
2043
2044
2045
2046
2047
2048
2049
2050
2051
2052
2053
2054
2055
2056
2057
2058
2059
2060
2061
2062
2063
2064
2065
2066
2067
2068
2069
2070
2071
2072
2073
2074
2075
2076
2077
2078
2079
2080
2081
2082
2083
2084
2085
2086
2087
2088
2089
2090
2091
2092
2093
2094
2095
2096
2097
2098
2099
2100
2101
2102
2103
2104
2105
2106
2107
2108
2109
2110
2111
2112
2113
2114
2115
2116
2117
2118
2119
2120
2121
2122
2123
2124
2125
2126
2127
2128
2129
2130
2131
2132
2133
2134
2135
2136
2137
2138
2139
2140
2141
2142
2143
2144
2145
2146
2147
2148
2149
2150
2151
2152
2153
2154
2155
2156
2157
2158
2159
2160
2161
2162
2163
2164
2165
2166
2167
2168
2169
2170
2171
2172
2173
2174
2175
2176
2177
2178
2179
2180
2181
2182
2183
2184
2185
2186
2187
2188
2189
2190
2191
2192
2193
2194
2195
2196
2197
2198
2199
2200
2201
2202
2203
2204
2205
2206
2207
2208
2209
2210
2211
2212
2213
2214
2215
2216
2217
2218
2219
2220
2221
2222
2223
2224
2225
2226
2227
2228
2229
2230
2231
2232
2233
2234
2235
2236
2237
2238
2239
2240
2241
2242
2243
2244
2245
2246
2247
2248
2249
2250
2251
2252
2253
2254
2255
2256
2257
2258
2259
2260
2261
2262
2263
2264
2265
2266
2267
2268
2269
2270
2271
2272
2273
2274
2275
2276
2277
2278
2279
2280
2281
2282
2283
2284
2285
2286
2287
2288
2289
2290
2291
2292
2293
2294
2295
2296
2297
2298
2299
2300
2301
2302
2303
2304
2305
2306
2307
2308
2309
2310
2311
2312
2313
2314
2315
2316
2317
2318
2319
2320
2321
2322
2323
2324
2325
2326
2327
2328
2329
2330
2331
2332
2333
2334
2335
2336
2337
2338
2339
2340
2341
2342
2343
2344
2345
2346
2347
2348
2349
2350
2351
2352
2353
2354
2355
2356
2357
2358
2359
2360
2361
2362
2363
2364
2365
2366
2367
2368
2369
2370
2371
2372
2373
2374
2375
2376
2377
2378
2379
2380
2381
2382
2383
2384
2385
2386
2387
2388
2389
2390
2391
2392
2393
2394
2395
2396
2397
2398
2399
2400
2401
2402
2403
2404
2405
2406
2407
2408
2409
2410
2411
2412
2413
2414
2415
2416
2417
2418
2419
2420
2421
2422
2423
2424
2425
2426
2427
2428
2429
2430
2431
2432
2433
2434
2435
2436
2437
2438
2439
2440
2441
2442
2443
2444
2445
2446
2447
2448
2449
2450
2451
2452
2453
2454
2455
2456
2457
2458
2459
2460
2461
2462
2463
2464
2465
2466
2467
2468
2469
2470
2471
2472
2473
2474
2475
2476
2477
2478
2479
2480
2481
2482
2483
2484
2485
2486
2487
2488
2489
2490
2491
2492
2493
2494
2495
2496
2497
2498
2499
2500
2501
2502
2503
2504
2505
2506
2507
2508
2509
2510
2511
2512
2513
2514
2515
2516
2517
2518
2519
2520
2521
2522
2523
2524
2525
2526
2527
2528
2529
2530
2531
2532
2533
2534
2535
2536
2537
2538
2539
2540
2541
2542
2543
2544
2545
2546
2547
2548
2549
2550
2551
2552
2553
2554
2555
2556
2557
2558
2559
2560
2561
2562
2563
2564
2565
2566
2567
2568
2569
2570
2571
2572
2573
2574
2575
2576
2577
2578
2579
2580
2581
2582
2583
2584
2585
2586
2587
2588
2589
2590
2591
2592
2593
2594
2595
2596
2597
2598
2599
2600
2601
2602
2603
2604
2605
2606
2607
2608
2609
2610
2611
2612
2613
2614
2615
2616
2617
2618
2619
2620
2621
2622
2623
2624
2625
2626
2627
2628
2629
2630
2631
2632
2633
2634
2635
2636
2637
2638
2639
2640
2641
2642
2643
2644
2645
2646
2647
2648
2649
2650
2651
2652
2653
2654
2655
2656
2657
2658
2659
2660
2661
2662
2663
2664
2665
2666
2667
2668
2669
2670
2671
2672
2673
2674
2675
2676
2677
2678
2679
2680
2681
2682
2683
2684
2685
2686
2687
2688
2689
2690
2691
2692
2693
2694
2695
2696
2697
2698
2699
2700
2701
2702
2703
2704
2705
2706
2707
2708
2709
2710
2711
2712
2713
2714
2715
2716
2717
2718
2719
2720
2721
2722
2723
2724
2725
2726
2727
2728
2729
2730
2731
2732
2733
2734
2735
2736
2737
2738
2739
2740
2741
2742
2743
2744
2745
2746
2747
2748
2749
2750
2751
2752
2753
2754
2755
2756
2757
2758
2759
2760
2761
2762
2763
2764
2765
2766
2767
2768
2769
2770
2771
2772
2773
2774
2775
2776
2777
2778
2779
2780
2781
2782
2783
2784
2785
2786
2787
2788
2789
2790
2791
2792
2793
2794
2795
2796
2797
2798
2799
2800
2801
2802
2803
2804
2805
2806
2807
2808
2809
2810
2811
2812
2813
2814
2815
2816
2817
2818
2819
2820
2821
2822
2823
2824
2825
2826
2827
2828
2829
2830
2831
2832
2833
2834
2835
2836
2837
2838
2839
2840
2841
2842
2843
2844
2845
2846
2847
2848
2849
2850
2851
2852
2853
2854
2855
2856
2857
2858
2859
2860
2861
2862
2863
2864
2865
2866
2867
2868
2869
2870
2871
2872
2873
2874
2875
2876
2877
2878
2879
2880
2881
2882
2883
2884
2885
2886
2887
2888
2889
2890
2891
2892
2893
2894
2895
2896
2897
2898
2899
2900
2901
2902
2903
2904
2905
2906
2907
2908
2909
2910
2911
2912
2913
2914
2915
2916
2917
2918
2919
2920
2921
2922
2923
2924
2925
2926
2927
2928
2929
2930
2931
2932
2933
2934
2935
2936
2937
2938
2939
2940
2941
2942
2943
2944
2945
2946
2947
2948
2949
2950
2951
2952
2953
2954
2955
2956
2957
2958
2959
2960
2961
2962
2963
2964
2965
2966
2967
2968
2969
2970
2971
2972
2973
2974
2975
2976
2977
2978
2979
2980
2981
2982
2983
2984
2985
2986
2987
2988
2989
2990
2991
2992
2993
2994
2995
2996
2997
2998
2999
3000
3001
3002
3003
3004
3005
3006
3007
3008
3009
3010
3011
3012
3013
3014
3015
3016
3017
3018
3019
3020
3021
3022
3023
3024
3025
3026
3027
3028
3029
3030
3031
3032
3033
3034
3035
3036
3037
3038
3039
3040
3041
3042
3043
3044
3045
3046
3047
3048
3049
3050
3051
3052
3053
3054
3055
3056
3057
3058
3059
3060
3061
3062
3063
3064
3065
3066
3067
3068
3069
3070
3071
3072
3073
3074
3075
3076
3077
3078
3079
3080
3081
3082
3083
3084
3085
3086
3087
3088
3089
3090
3091
3092
3093
3094
3095
3096
3097
3098
3099
3100
3101
3102
3103
3104
3105
3106
3107
3108
3109
3110
3111
3112
3113
3114
3115
3116
3117
3118
3119
3120
3121
3122
3123
3124
3125
3126
3127
3128
3129
3130
3131
3132
3133
3134
3135
3136
3137
3138
3139
3140
3141
3142
3143
3144
3145
3146
3147
3148
3149
3150
3151
3152
3153
3154
3155
3156
3157
3158
3159
3160
3161
3162
3163
3164
3165
3166
3167
3168
3169
3170
3171
3172
3173
3174
3175
3176
3177
3178
3179
3180
3181
3182
3183
3184
3185
3186
3187
3188
3189
3190
3191
3192
3193
3194
3195
3196
3197
3198
3199
3200
3201
3202
3203
3204
3205
3206
3207
3208
3209
3210
3211
3212
3213
3214
3215
3216
3217
3218
3219
3220
3221
3222
3223
3224
3225
3226
3227
3228
3229
3230
3231
3232
3233
3234
3235
3236
3237
3238
3239
3240
3241
3242
3243
3244
3245
3246
3247
3248
3249
3250
3251
3252
3253
3254
3255
3256
3257
3258
3259
3260
3261
3262
3263
3264
3265
3266
3267
3268
3269
3270
3271
3272
3273
3274
3275
3276
3277
3278
3279
3280
3281
3282
3283
3284
3285
3286
3287
3288
3289
3290
3291
3292
3293
3294
3295
3296
3297
3298
3299
3300
3301
3302
3303
3304
3305
3306
3307
3308
3309
3310
3311
3312
3313
3314
3315
3316
3317
3318
3319
3320
3321
3322
3323
3324
3325
3326
3327
3328
3329
3330
3331
3332
3333
3334
3335
3336
3337
3338
3339
3340
3341
3342
3343
3344
3345
3346
3347
3348
3349
3350
3351
3352
3353
3354
3355
3356
3357
3358
3359
3360
3361
3362
3363
3364
3365
3366
3367
3368
3369
3370
3371
3372
3373
3374
3375
3376
3377
3378
3379
3380
3381
3382
3383
3384
3385
3386
3387
3388
3389
3390
3391
3392
3393
3394
3395
3396
3397
3398
3399
3400
3401
3402
3403
3404
3405
3406
3407
3408
3409
3410
3411
3412
3413
3414
3415
3416
3417
3418
3419
3420
3421
3422
3423
3424
3425
3426
3427
3428
3429
3430
3431
3432
3433
3434
3435
3436
3437
3438
3439
3440
3441
3442
3443
3444
3445
3446
3447
3448
3449
3450
3451
3452
3453
3454
3455
3456
3457
3458
3459
3460
3461
3462
3463
3464
3465
3466
3467
3468
3469
3470
3471
3472
3473
3474
3475
3476
3477
3478
3479
3480
3481
3482
3483
3484
3485
3486
3487
3488
3489
3490
3491
3492
3493
3494
3495
3496
3497
3498
3499
3500
3501
3502
3503
3504
3505
3506
3507
3508
3509
3510
3511
3512
3513
3514
3515
3516
3517
3518
3519
3520
3521
3522
3523
3524
3525
3526
3527
3528
3529
3530
3531
3532
3533
3534
3535
3536
3537
3538
3539
3540
3541
3542
3543
3544
3545
3546
3547
3548
3549
3550
3551
3552
3553
3554
3555
3556
3557
3558
3559
3560
3561
3562
3563
3564
3565
3566
3567
|
Mesa 20.0.0 Release Notes / 2020-02-19
======================================
Mesa 20.0.0 is a new development release. People who are concerned with
stability and reliability should stick with a previous release or wait
for Mesa 20.0.1.
Mesa 20.0.0 implements the OpenGL 4.6 API, but the version reported by
glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) /
glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being
used. Some drivers don't support all the features required in OpenGL
4.6. OpenGL 4.6 is **only** available if requested at context creation.
Compatibility contexts may report a lower version depending on each
driver.
Mesa 20.0.0 implements the Vulkan 1.2 API, but the version reported by
the apiVersion property of the VkPhysicalDeviceProperties struct depends
on the particular driver being used.
SHA256 checksum
---------------
::
bb6db3e54b608d2536d4000b3de7dd3ae115fc114e8acbb5afff4b3bbed04b34 mesa-20.0.0.tar.xz
New features
------------
- OpenGL 4.6 on radeonsi.
- GL_ARB_gl_spirv on radeonsi.
- GL_ARB_spirv_extensions on radeonsi.
- GL_EXT_direct_state_access for compatibility profile.
- VK_AMD_device_coherent_memory on RADV.
- VK_AMD_mixed_attachment_samples on RADV.
- VK_AMD_shader_explicit_vertex_parameter on RADV.
- VK_AMD_shader_image_load_store_lod on RADV.
- VK_AMD_shader_fragment_mask on RADV.
- VK_EXT_subgroup_size_control on RADV/LLVM.
- VK_KHR_separate_depth_stencil_layouts on Intel, RADV.
- VK_KHR_shader_subgroup_extended_types on RADV.
- VK_KHR_swapchain_mutable_format on RADV.
- VK_KHR_shader_float_controls on RADV/ACO.
- GFX6 (Southern Islands) and GFX7 (Sea Islands) support on RADV/ACO.
- Wave32 support for GFX10 (Navi) on RADV/ACO.
- Compilation of Geometry Shaders on RADV/ACO.
- Vulkan 1.2 on Intel, RADV.
- GL_INTEL_shader_integer_functions2 and
VK_INTEL_shader_integer_functions2 on Intel.
Bug fixes
---------
- drisw crashes on calling NULL putImage on EGL surfaceless platform
(pbuffer EGLSurface)
- [radeonsi][vaapi][bisected] invalid VASurfaceID when playing
interlaced DVB stream in Kodi
- [RADV] GPU hangs while the cutscene plays in the game Assassin's
Creed Origins
- ACO: The Elder Scrolls Online crashes on startup (Navi)
- Broken rendering of glxgears on S/390 architecture (64bit, BigEndian)
- aco: sun flickering with Assassins Creeds Origins
- !1896 broke ext_image_dma_buf_import piglit tests with radeonsi
- aco: wrong geometry with Assassins Creed Origins on GFX6
- valgrind errors since commit a8ec4082a41
- OSMesa osmesa_choose_format returns a format not supported by
st_new_renderbuffer_fb
- Build error with VS on WIN
- Using EGL_KHR_surfaceless_context causes spurious "libEGL warning:
FIXME: egl/x11 doesn't support front buffer rendering."
- !3460 broke texsubimage test with piglit on zink+anv
- The screen is black when using ACO
- [Regression] JavaFX unbounded VRAM+RAM usage
- radv: implement VK_AMD_shader_explicit_vertex_parameter
- Civilization VI crashes when loading game (AMD Vega Mobile)
- [radeonsi] X-Server crashes when trying to start Guild Wars 2 with
the commits from !3421
- aco: implement GFX6 support
- Add support for VK_KHR_swapchain_mutable_format
- radv: The Surge 2 crashes in ac_get_elem_bits()
- [Regression] JavaFX unbounded VRAM+RAM usage
- Use the OpenCL dispatch defnitions from OpenCL_Headers
- [regression][ilk,g965,g45] various dEQP-GLES2.functional.shaders.\*
failures
- aco: Dead Rising 4 crashes in lower_to_hw_instr() on GFX6-GFX7
- libvulkan_radeon.so crash with \`free(): double free detected in
tcache 2\`
- Commit be08e6a causes crash in com.android.launcher3 (Launcher)
- anv: Regression causing issues for radv when there are no Intel
devices
- Mesa no longer compiles with GCC 10
- [Navi/aco] Guild Wars 2 - ring gfx timeout with commit 3bca0af2
- [radv/aco] Regression is causing a soft crash in The Witcher 3
- [bisected] [radeonsi] GPU hangs/resets while playing interlaced
content on Kodi with VAAPI
- [radeonsi] MSAA image not copied properly after image store through
texture view
- T-Rex and Manhattan onscreen performance issue on Android
- VkSamplerCreateInfo compareEnable not respected
- VkSamplerCreateInfo compareEnable not respected
- Freedreno drm softpin driver implementation leaks memory
- [POLARIS10] VRAM leak involving glTexImage2D with non-NULL data
argument
- [regression][bisected][ivb/byt] crucible test
func.push-constants.basic.q0 causes gpu hang
- MR 3096 broke lots of piglit ext_framebuffer_object tests on Raven
- Rise of the Tomb Raider benchmark crash on Dell XPS 7390 2-in-1 w/
Iris Plus Graphics (Ice Lake 8x8 GT2)
- Raven Ridge (2400G): Resident Evil 2 crashes my machine
- Common practice of glGetActiveUniform leads to O(N²) behavior in Mesa
- Rocket League ingame artifacts
- [radv] SteamVR direct mode no longer works
- [ANV] unused create parameters not properly ignored
- [Bisected] Mesa fails to start alacritty with the wayland backend
(AMD Vega).
- [iris] piglit test clip-distance-vs-gs-out fails due to VUE map
mismatch between VS <-> GS stages
- [radv] SteamVR direct mode no longer works
- Blocky corruption in The Surge 2
- radeonsi: Floating point exception on R9 270 gpu for a set of traces
- [RADV] [Navi] LOD artifacting in Halo - The Master Chief Collection
(Halo Reach)
- [CTS]
dEQP-VK.api.image_clearing.core.clear_color_image.2d.linear.single_layer.r32g32b32\_\*
fail on GFX6-GFX8
- Vulkan: Please consider adding another sample count to
sampledImageIntegerSampleCounts
- Navi10: Bitrate based encoding with VAAPI/RadeonSI unusable
- [RADV] create parameters not properly ignored
- [regression][bdw,gen9,hsw,icl][iris] gltcs failures on
mesa=8172b1fa03f
- Bugs in RadeonSI VAAPI implementation
- [GFX10] Glitch rendering Custom Avatars in Beat Saber
- intel/fs: Check for 16-bit immediates in
fs_visitor::lower_mul_dword_inst is too strict
- i965/iris: assert when destroy GL context with active query
- Visuals without alpha bits are not sRGB-capable
- swapchain throttling: wait for fence has 1ns timeout
- radeonsi: OpenGL app always produces page fault in gfxhub on Navi 10
- [regression]
KHR-GLES31.core.geometry_shader.api.program_pipeline_vs_gs_capture
fails for various drivers
- [CTS]
dEQP-VK.spirv_assembly.instruction.spirv1p4.entrypoint.tess_con_pc_entry_point
hangs on GFX10
- [RADV] SPIR-V warning when compiling shader using storage
multisampled image array
- [RADV] The Dead Rising 4 is causing a GPU hang with LLVM backend
- macOS u_thread.h:156:4: error: implicit declaration of function
'pthread_getcpuclockid'
- [Wine / Vulkan] Doom 2016 Hangs on Main Menu
- NULL resource when playing VP9 video through VDPAU on RX 570
- radeonsi: mpv --vo=vaapi incorrect rendering on gfx9+
- [BSW/BDW] skia lcdblendmode & lcdoverlap test failure
- Create a way to prefer iris vs i965 via driconf
- [Bisected] i965: CS:GO crashes in emit_deref_copy_load_store with
debug Mesa
- radv/aco Jedi Fallen Order hair rendering buggy
- Inaccurate information on https://docs.mesa3d.org/repository.html
about how to get git write access.
- [RADV] VK_KHR_timeline_semaphore balloons in runtime
- Shadow of Mordor has randomly dancing black shadows on Talion's face
- gen7 crucible failures func.push-constants.basic.q0 and
func.shader-subgroup-vote.basic.q0
- GL_EXT_disjoint_timer_query failing with GL_INVALID_ENUM
- Unreal 4 Elemental and MatineeFightScene demos misrender
- gputest gimark has unwanted black liquorice flakes
- triangle strip clipping with GL_FIRST_VERTEX_CONVENTION causes wrong
vertex's attribute to be broadcasted for flat interpolation
- [bisected][regression][g45,g965,ilk] piglit arb_fragment_program kil
failures
- glcts crashes since the enablement of ARB_shading_language_include
- Android build broken
- ld.lld: error: duplicate symbol (mesa-19.3.0-rc1)
- Divinity: Original Sin Enhanced Edition(Native) crash on start
- HSW. Tropico 6 and SuperTuxKart have shadows flickering
- GL_EXT_disjoint_timer_query failing with GL_INVALID_ENUM
- glxgears segfaults on POWER / Xvnc
- [regression][bdw,gen9,icl][iris] piglit failures on mesa
f9fd04aca15fd00889caa666ba38007268e67f5c
- Redundant builds of libmesa_classic and libmesa_gallium
- [IVB,BYT] [Regression] [Bisected] Core dump at launching
arb_compute_shader/linker/bug-93840.shader_test
- Vulkan drivers need access to format utils of gallium
- Disabling lower_fragdata_array causes shader-db to crash for some
drivers
- GL_EXT_disjoint_timer_query failing with GL_INVALID_ENUM
- Android build broken by commit 9020f51 "util/u_endian: Add error
checks"
- radv secure compile feature breaks compilation of RADV on armhf EABI
(19.3-rc1)
- radv_debug.c warnings when compiling on 32 bits : cast to pointer
from integer of different size
- Meson: Mesa3D build failure with standalone Mingw-w64 multilib
- [regression][bisected] KHR46 VertexArrayAttribFormat has unexpectedly
generated GL_INVALID_OPERATION
- textureSize(samplerExternalOES, int) missing in desktop mesa 19.1.7
implementation
- zink: implicly casting integers to pointers, warnings on 32-bit
compile
- Objects leaving trails in Firefox with antialias and
preserveDrawingBuffer in three.js WebGLRednerer with mesa 19.2
Changes
-------
Aaron Watry (1):
- clover/llvm: fix build after llvm 10 commit 1dfede3122ee
Adam Jackson (1):
- drisw: Cache the depth of the X drawable
Afonso Bordado (4):
- pan/midgard: Optimize comparisions with similar operations
- pan/midgard: Move midgard_is_branch_unit to helpers
- pan/midgard: Optimize branches with inverted arguments
- pan/midgard: Fix midgard_compile.h includes
Alan Coopersmith (1):
- intel/perf: adapt to platforms like Solaris without d_type in struct
dirent
Alejandro Piñeiro (4):
- v3d: adds an extra MOV for any sig.ld\*
- mesa/main/util: moving gallium u_mm to util, remove main/mm
- nir/opt_peephole_select: remove unused variables
- turnip: remove unused descriptor state dirty
Alexander van der Grinten (1):
- egl: Fix \_eglPointerIsDereferencable w/o mincore()
Alexander von Gluck IV (1):
- haiku/hgl: Fix build via header reordering
Alyssa Rosenzweig (223):
- pipe-loader: Build kmsro loader for with all kmsro targets
- pan/midgard: Remove OP_IS_STORE_VARY
- pan/midgard: Add a dummy source for loads
- pan/midgard: Refactor swizzles
- pan/midgard: Eliminate blank_alu_src
- pan/midgard: Use fp32 blend shaders
- pan/midgard: Validate tags when branching
- pan/midgard: Fix quadword_count handling
- pan/midgard: Compute bundle interference
- pan/midgard: Add bizarre corner case
- pan/midgard: offset_swizzle doesn't need dstsize
- pan/midgard: Extend offset_swizzle to non-32-bit
- pan/midgard: Extend swizzle packing for vec4/16-bit
- pan/midgard: Extend default_phys_reg to !32-bit
- panfrost/ci: Update T760 expectations
- pan/midgard: Fix printing of half-registers in texture ops
- pan/midgard: Disassemble half-steps correctly
- pan/midgard: Pass shader stage to disassembler
- pan/midgard: Switch base for vertex texturing on T720
- nir: Add load_output_u8_as_fp16_pan intrinsic
- pan/midgard: Identify ld_color_buffer_u8_as_fp16\*
- pan/midgard: Implement nir_intrinsic_load_output_u8_as_fp16_pan
- pan/midgard: Pack load/store masks
- panfrost: Select format-specific blending intrinsics
- pan/midgard: Add blend shader selection bits for MRT
- pan/midgard: Implement linearly-constrained register allocation
- pan/midgard: Integrate LCRA
- pan/midgard: Remove util/ra support
- pan/midgard: Compute spill costs
- pan/lcra: Use Chaitin's spilling heuristic
- pan/midgard: Copypropagate vector creation
- pan/midgard: Fix copypropagation for textures
- pan/midgard: Generalize texture registers across GPUs
- pan/midgard: Fix vertex texturing on early Midgard
- pan/midgard: Use texture, not textureLod, on early Midgard
- pan/midgard: Disassemble with old pipeline always on T720
- pan/midgard: Prioritize texture registers
- pan/midgard: Expand 64-bit writemasks
- pan/midgard: Implement i2i64 and u2u64
- pan/midgard: Fix mir_round_bytemask_down for !32b
- pan/midgard: Pack 64-bit swizzles
- pan/midgard: Use generic constant packing for 8/64-bit
- pan/midgard: Implement non-aligned UBOs
- pan/midgard: Expose more typesize helpers
- pan/midgard: Fix masks/alignment for 64-bit loads
- pan/midgard: Represent ld/st offset unpacked
- pan/midgard: Use shader stage in mir_op_computes_derivative
- panfrost: Stub out clover callbacks
- panfrost: Pass kernel inputs as uniforms
- panfrost: Disable tiling for GLOBAL resources
- panfrost: Set PIPE_COMPUTE_CAP_ADDRESS_BITS to 64
- pan/midgard: Introduce quirks checks
- panfrost: Add the lod_bias field
- nir: Add load_sampler_lod_paramaters_pan intrinsic
- pan/midgard: Implement load_sampler_lod_paramaters_pan
- pan/midgard: Add LOD bias/clamp lowering
- pan/midgard: Describe quirk MIDGARD_BROKEN_LOD
- pan/midgard: Enable LOD lowering only on buggy chips
- panfrost: Add lcra.c to Android.mk
- pan/midgard: Use lower_tex_without_implicit_lod
- panfrost: Add information about T720 tiling
- panfrost: Implement pan_tiler for non-hierarchy GPUs
- panfrost: Simplify draw_flags
- pan/midgard: Splatter on fragment out
- gitlab-ci: Remove non-default skips from Panfrost
- panfrost: Remove blend shader hack
- panfrost: Update SET_VALUE with information from igt
- panfrost: Rename SET_VALUE to WRITE_VALUE
- gallium/util: Support POLYGON in u_stream_outputs_for_vertices
- pan/midgard: Move spilling code out of scheduler
- pan/midgard: Split spill node selection/spilling
- pan/midgard: Simplify spillability test
- pan/midgard: Remove spill cost heuristic
- pan/midgard: Move bounds checking into LCRA
- pan/midgard: Remove consecutive_skip code
- pan/midgard: Remove code marked "TODO: remove me"
- pan/midgard: Dynamically allocate r26/27 for spills
- pan/midgard: Use no_spill bitmask
- pan/midgard: Don't use no_spill for memory spill src
- pan/midgard: Force alignment for csel_v
- pan/midgard: Don't try to free NULL in LCRA
- pan/midgard: Simplify and fix vector copyprop
- pan/midgard: Fix shift for TLS access
- panfrost: Describe thread local storage sizing rules
- panfrost: Rename unknown_address_0 -> scratchpad
- panfrost: Split stack_shift nibble from unk0
- panfrost: Add routines to calculate stack size/shift
- panfrost: Factor out panfrost_query_raw
- panfrost: Query core count and thread tls alloc
- panfrost: Route stack_size from compiler
- panfrost: Emit SFBD/MFBD after a batch, instead of before
- panfrost: Handle minor cppcheck issues
- pan/midgard: Remove unused ld/st packing hepers
- pan/midgard: Handle misc. cppcheck warnings
- panfrost: Calculate maximum stack_size per batch
- panfrost: Pass size to panfrost_batch_get_scratchpad
- pandecode: Add cast
- panfrost: Move nir_undef_to_zero to Midgard compiler
- panfrost: Move property queries to \_encoder
- panfrost: Add panfrost_model_name helper
- panfrost: Report GPU name in es2_info
- ci: Remove T760/T860 from CI temporarily
- panfrost: Pass blend RT number through
- pan/midgard: Add schedule barrier after fragment writeout
- pan/midgard: Writeout per render target
- pan/midgard: Fix liveness analysis with multiple epilogues
- pan/midgard: Set r1.w magic
- panfrost: Fix FBD issue
- ci: Reinstate Panfrost CI
- panfrost: Remove fbd_type enum
- panfrost: Pack invocation_shifts manually instead of a bit field
- panfrost: Remove asserts in panfrost_pack_work_groups_compute
- panfrost: Simplify sampler upload condition
- panfrost: Don't double-create scratchpad
- panfrost: Add PAN_MESA_DEBUG=precompile for shader-db
- panfrost: Let precompile imply shaderdb
- panfrost: Handle empty shaders
- pan/midgard: Use a reg temporary for mutiple writes
- pan/midgard: Hoist temporary coordinate for cubemaps
- pan/midgard: Set .shadow for shadow samplers
- pan/midgard: Set Z to shadow comparator for 2D
- pan/midgard: Add uniform/work heuristic
- pan/midgard: Implement textureOffset for 2D textures
- pan/midgard: Fix crash with txs
- pan/midgard: Lower txd with lower_tex
- panfrost: Decode shader types in pantrace shader-db
- pan/decode: Skip COMPUTE in blobber-db
- pan/decode: Prefix blobberdb with MESA_SHADER\_\*
- pan/decode: Append 0:0 spills:fills to blobber-db
- pan/midgard: Fix disassembler cycle/quadword counting
- pan/midgard: Bounds check lcra_restrict_range
- pan/midgard: Extend IS_VEC4_ONLY to arguments
- pan/midgard: Clamp LOD register swizzle
- pan/midgard: Expand swizzle for texelFetch
- pan/midgard: Fix fallthrough from offset to comparator
- pan/midgard: Do witchcraft on texture offsets
- pan/midgard: Generalize temp coordinate to non-2D
- pan/midgard: Implement shadow cubemaps
- pan/midgard: Enable lower_(un)pack\_\* lowering
- pan/midgard: Support loads from R11G11B10 in a blend shader
- pan/midgard: Add mir_upper_override helper
- pan/midgard: Compute destination override
- panfrost: Rename pan_instancing.c -> pan_attributes.c
- panfrost: Factor batch/resource out of instancing routines
- panfrost: Move instancing routines to encoder/
- panfrost: Factor out panfrost_compute_magic_divisor
- panfrost: Fix off-by-one in pan_invocation.c
- pan/decode: Fix reference computation for invocations
- panfrost: Slight cleanup of Gallium's pan_attribute.c
- panfrost: Remove pan_shift_odd
- pan/decode: Handle gl_VertexID/gl_InstanceID
- panfrost: Unset vertex_id_zero_based
- pan/midgard: Factor out emit_attr_read
- pan/midgard: Lower gl_VertexID/gl_InstanceID to attributes
- panfrost: Extend attribute_count for vertex builtins
- panfrost: Route gl_VertexID through cmdstream
- pan/midgard: Fix minor typo
- panfrost: Remove MALI_SPECIAL_ATTRIBUTE_BASE defines
- panfrost: Update information on fixed attributes/varyings
- panfrost: Remove MALI_ATTR_INTERNAL
- panfrost: Inline away MALI_NEGATIVE
- panfrost: Implement remaining texture wrap modes
- panfrost: Add pan_attributes.c to Android.mk
- panfrost: Add missing #include in common header
- panfrost: Remove mali_alt_func
- panfrost; Update comment about work/uniform_count
- panfrost: Remove 32-bit next_job path
- glsl: Set .flat for gl_FrontFacing
- pan/midgard: Promote tilebuffer reads to 32-bit
- pan/midgard: Use type-appropriate st_vary
- pan/midgard: Implement flat shading
- panfrost: Identify glProvokingVertex flag
- panfrost: Disable some CAPs we want lowered
- panfrost: Implement integer varyings
- panfrost: Remove MRT indirection in blend shaders
- panfrost: Respect glPointSize()
- pan/midgard: Convert fragment writeout to proper branches
- pan/midgard: Remove prepacked_branch
- panfrost: Handle RGB16F colour clear
- panfrost: Pack MRT blend shaders into a single BO
- pan/midgard: Fix memory corruption in constant combining
- pan/midgard: Use better heuristic for shader termination
- pan/midgard: Generalize IS_ALU and quadword_size
- pan/midgard: Generate MRT writeout loops
- pan/midgard: Remove old comment
- pan/midgard: Identity ld_color_buffer as 32-bit
- pan/midgard: Use upper ALU tags for MFBD writeout
- panfrost: Texture from Z32F_S8 as R32F
- panfrost: Support rendering to non-zero Z/S layers
- panfrost: Implement sRGB blend shaders
- panfrost: Cleanup tiling selection logic
- panfrost: Report MSAA 4x supported for dEQP
- panfrost: Handle PIPE_FORMAT_R10G10B10A2_USCALED
- panfrost: Respect constant buffer_offset
- panfrost: Adjust for mismatch between hardware/Gallium in arrays/cube
- pan/midgard: Account for z/w flip in texelFetch
- panfrost: Don't double-flip Z/W for 2D arrays
- pan/midgard: Support indirect UBO offsets
- panfrost: Fix linear depth textures
- pan/midgard: Bytemasks should round up, not round down
- panfrost: Identify un/pack colour opcodes
- pan/midgard: Fix recursive csel scheduling
- panfrost: Expose some functionality with dEQP flag
- panfrost: Compile tiling routines with -O3
- panfrost,lima: De-Galliumize tiling routines
- panfrost: Rework linear<--->tiled conversions
- panfrost: Add pandecode entries for ASTC/ETC formats
- panfrost: Fix crash in compute variant allocation
- panfrost: Drop mysterious zero=0xFFFF field
- panfrost: Don't use implicit mali_exception_status enum
- pan/decode: Remove last_size
- pan/midgard: Remove pack_color define
- pan/decode: Remove SHORT_SLIDE indirection
- panfrost: Fix 32-bit warning for \`indices\`
- pan/decode: Drop MFBD compute shader stuff
- pan/midgard: Record TEXTURE_OP_BARRIER
- pan/midgard: Disassemble barrier instructions
- pan/midgard: Validate barriers use a barrier tag
- pan/midgard: Handle tag 0x4 as texture
- pan/midgard: Remove float_bitcast
- pan/midgard: Fix missing prefixes
- pan/midgard: Don't crash with constants on unknown ops
- pan/midgard: Use fprintf instead of printf for constants
Andreas Baierl (14):
- lima: Beautify stream dumps
- lima: Parse VS and PLBU command stream while making a dump
- lima/streamparser: Fix typo in vs semaphore parser
- lima/streamparser: Add findings introduced with gl_PointSize
- lima/parser: Some fixes and cleanups
- lima/parser: Add RSW parsing
- lima/parser: Add texture descriptor parser
- lima: Rotate dump files after each finished pp frame
- lima: Fix dump file creation
- lima/parser: Fix rsw parser
- lima/parser: Fix VS cmd stream parser
- lima/parser: Make rsw alpha blend parsing more readable
- lima: Add stencil support
- lima: Fix alpha blending
Andres Rodriguez (1):
- vulkan/wsi: disable the hardware cursor
Andrii Simiklit (5):
- main: fix several 'may be used uninitialized' warnings
- glsl: fix an incorrect max_array_access after optimization of
ssbo/ubo
- glsl: fix a binding points assignment for ssbo/ubo arrays
- glsl/nir: do not change an element index to have correct block name
- mesa/st: fix a memory leak in get_version
Anthony Pesch (5):
- util: import xxhash
- util: move fnv1a hash implementation into its own header
- util/hash_table: replace \_mesa_hash_data's fnv1a hash function with
xxhash
- util/hash_table: added hash functions for integer types
- util/hash_table: update users to use new optimal integer hash
functions
Anuj Phogat (2):
- intel: Add device info for 1x4x6 Jasper Lake
- intel: Add pci-ids for Jasper Lake
Arno Messiaen (5):
- lima: fix stride in texture descriptor
- lima: add layer_stride field to lima_resource struct
- lima: introduce ppir_op_load_coords_reg to differentiate between
loading texture coordinates straight from a varying vs loading them
from a register
- lima: add cubemap support
- lima/ppir: add lod-bias support
Bas Nieuwenhuizen (33):
- radv: Fix timeout handling in syncobj wait.
- radv: Remove \_mesa_locale_init/fini calls.
- turnip: Remove \_mesa_locale_init/fini calls.
- anv: Remove \_mesa_locale_init/fini calls.
- radv: Fix disk_cache_get size argument.
- radv: Close all unnecessary fds in secure compile.
- radv: Do not change scratch settings while shaders are active.
- radv: Allocate cmdbuffer space for buffer marker write.
- radv: Enable VK_KHR_buffer_device_address.
- amd/llvm: Refactor ac_build_scan.
- radv: Unify max_descriptor_set_size.
- radv: Fix timeline semaphore refcounting.
- radv: Fix RGBX Android<->Vulkan format correspondence.
- amd/common: Fix tcCompatible degradation on Stoney.
- amd/common: Always use addrlib for HTILE tc-compat.
- radv: Limit workgroup size to 1024.
- radv: Expose all sample counts for integer formats as well.
- amd/common: Handle alignment of 96-bit formats.
- nir: Add clone/hash/serialize support for non-uniform tex
instructions.
- nir: print non-uniform tex fields.
- amd/common: Always initialize gfx9 mipmap offset/pitch.
- turnip: Use VK_NULL_HANDLE instead of NULL.
- meson: Enable -Werror=int-conversion.
- Revert "amd/common: Always initialize gfx9 mipmap offset/pitch."
- radv: Only use the gfx mipmap level offset/pitch for linear textures.
- spirv: Fix glsl type assert in spir2nir.
- radv: Emit a BATCH_BREAK when changing pixel shaders or
CB_TARGET_MASK.
- radv: Use new scanout gfx9 metadata flag.
- radv: Disable VK_EXT_sample_locations on GFX10.
- radv: Remove syncobj_handle variable in header.
- radv: Expose VK_KHR_swapchain_mutable_format.
- radv: Allow DCC & TC-compat HTILE with
VK_IMAGE_CREATE_EXTENDED_USAGE_BIT.
- radv: Do not set SX DISABLE bits for RB+ with unused surfaces.
Ben Crocker (1):
- llvmpipe: use ppc64le/ppc64 Large code model for JIT-compiled shaders
Bernd Kuhls (1):
- util/os_socket: Include unistd.h to fix build error
Boris Brezillon (21):
- panfrost: MALI_DEPTH_TEST is actually MALI_DEPTH_WRITEMASK
- panfrost: Destroy the upload manager allocated in
panfrost_create_context()
- panfrost: Release the ctx->pipe_framebuffer ref
- panfrost: Move BO cache related fields to a sub-struct
- panfrost: Try to evict unused BOs from the cache
- gallium: Fix the ->set_damage_region() implementation
- panfrost: Make sure we reset the damage region of RTs at flush time
- panfrost: Remove unneeded phi nodes
- panfrost/midgard: Fix swizzle for store instructions
- panfrost/midgard: Print the actual source register for store
operations
- panfrost/midgard: Use a union to manipulate embedded constants
- panfrost/midgard: Rework mir_adjust_constants() to make it type/size
agnostic
- panfrost/midgard: Make sure promote_fmov() only promotes 32-bit imovs
- panfrost/midgard: Factorize f2f and u2u handling
- panfrost/midgard: Add f2f64 support
- panfrost/midgard: Fix mir_print_instruction() for branch instructions
- panfrost/midgard: Add 64 bits float <-> int converters
- panfrost/midgard: Add missing lowering passes for type/size
conversion ops
- panfrost/midgard: Add a condense_writemask() helper
- panfrost/midgard: Prettify embedded constant prints
- panfrost: Fix the damage box clamping logic
Brian Ho (14):
- turnip: Update tu_query_pool with turnip-specific fields
- turnip: Implement vkCreateQueryPool for occlusion queries
- turnip: Implement vkCmdBeginQuery for occlusion queries
- turnip: Implement vkCmdEndQuery for occlusion queries
- turnip: Update query availability on render pass end
- turnip: Implement vkGetQueryPoolResults for occlusion queries
- turnip: Implement vkCmdResetQueryPool
- turnip: Implement vkCmdCopyQueryPoolResults for occlusion queries
- anv: Properly fetch partial results in vkGetQueryPoolResults
- anv: Handle unavailable queries in vkCmdCopyQueryPoolResults
- turnip: Enable occlusionQueryPrecise
- turnip: Free event->bo on vkDestroyEvent
- turnip: Fix vkGetQueryPoolResults with available flag
- turnip: Fix vkCmdCopyQueryPoolResults with available flag
Brian Paul (4):
- s/APIENTRY/GLAPIENTRY/ in teximage.c
- nir: fix a couple signed/unsigned comparison warnings in
nir_builder.h
- Call shmget() with permission 0600 instead of 0777
- nir: no-op C99 \_Pragma() with MSVC
C Stout (1):
- util/vector: Fix u_vector_foreach when head rolls over
Caio Marcelo de Oliveira Filho (24):
- spirv: Don't leak GS initialization to other stages
- glsl: Check earlier for MaxShaderStorageBlocks and MaxUniformBlocks
- glsl: Check earlier for MaxTextureImageUnits and MaxImageUniforms
- anv: Initialize depth_bounds_test_enable when not explicitly set
- spirv: Consider the sampled_image case in wa_glslang_179 workaround
- intel/fs: Lower 64-bit MOVs after lower_load_payload()
- intel/fs: Fix lowering of dword multiplication by 16-bit constant
- intel/vec4: Fix lowering of multiplication by 16-bit constant
- anv/gen12: Temporarily disable VK_KHR_buffer_device_address (and EXT)
- spirv: Implement SPV_KHR_non_semantic_info
- panfrost: Fix Makefile.sources
- anv: Drop unused function parameter
- anv: Ignore some CreateInfo structs when rasterization is disabled
- intel/fs: Only use SLM fence in compute shaders
- spirv: Drop EXT for PhysicalStorageBuffer symbols
- spirv: Handle PhysicalStorageBuffer in memory barriers
- nir: Add missing nir_var_mem_global to various passes
- intel/fs: Add FS_OPCODE_SCHEDULING_FENCE
- intel/fs: Add workgroup_size() helper
- intel/fs: Don't emit fence for shared memory if only one thread is
used
- intel/fs: Don't emit control barrier if only one thread is used
- anv: Always initialize target_stencil_layout
- intel/compiler: Add names for SHADER_OPCODE_[IU]SUB_SAT
- nir: Make nir_deref_path_init skip trivial casts
Chris Wilson (1):
- egl: Mention if swrast is being forced
Christian Gmeiner (24):
- drm-shim: fix EOF case
- etnaviv: rs: upsampling is not supported
- etnaviv: add drm-shim
- etnaviv: drop not used config_out function param
- etnaviv: use a more self-explanatory param name
- etnaviv: handle 8 byte block in tiling
- etnaviv: add support for extended pe formats
- etnaviv: fix integer vertex formats
- etnaviv: use NORMALIZE_SIGN_EXTEND
- etnaviv: fix R10G10B10A2 vertex format entries
- etnaviv: handle integer case for GENERIC_ATTRIB_SCALE
- etnaviv: remove dead code
- etnaviv: remove not used etna_bits_ones(..)
- etnaviv: drop compiled_rs_state forward declaration
- etnaviv: update resource status after flushing
- gallium: add PIPE_CAP_MAX_VERTEX_BUFFERS
- etnaviv: check if MSAA is supported
- etnaviv: gc400 does not support any vertex sampler
- etnaviv: use a better name for FE_VERTEX_STREAM_UNK14680
- etnaviv: move state based texture structs
- etnaviv: move descriptor based texture structs
- etnaviv: add deqp debug option
- etnaviv: drop default state for PE_STENCIL_CONFIG_EXT2
- etnaviv: drm-shim: add GC400
Connor Abbott (19):
- nir: Fix non-determinism in lower_global_vars_to_local
- radv: Rename ac_arg_regfile
- ac: Add a shared interface between radv, radeonsi, LLVM and ACO
- ac/nir, radv, radeonsi: Switch to using ac_shader_args
- radv: Move argument declaration out of nir_to_llvm
- aco: Constify radv_nir_compiler_options in isel
- aco: Use radv_shader_args in aco_compile_shader()
- aco: Split vector arguments at the beginning
- aco: Make num_workgroups and local_invocation_ids one argument each
- radv: Replace supports_spill with explict_scratch_args
- aco: Use common argument handling
- aco: Make unused workgroup id's 0
- nir: Maintain the algebraic automaton's state as we work.
- a6xx: Add more CP packets
- freedreno: Use new macros for CP_WAIT_REG_MEM and CP_WAIT_MEM_GTE
- freedreno: Fix CP_MEM_TO_REG flag definitions
- freedreno: Document CP_COND_REG_EXEC more
- freedreno: Document CP_UNK_A6XX_55
- freedreno: Document CP_INDIRECT_BUFFER_CHAIN
Daniel Ogorchock (2):
- panfrost: Fix panfrost_bo_access memory leak
- panfrost: Fix headers and gpu_headers memory leak
Daniel Schürmann (58):
- aco: fix immediate offset for spills if scratch is used
- aco: only use single-dword loads/stores for spilling
- aco: fix accidential reordering of instructions when scheduling
- aco: workaround Tonga/Iceland hardware bug
- aco: fix invalid access on Pseudo_instructions
- aco: preserve kill flag on moved operands during RA
- aco: rematerialize s_movk instructions
- aco: check if SALU instructions are predeceeded by exec when
calculating WQM needs
- aco: value number instructions using the execution mask
- aco: use s_and_b64 exec to reduce uniform booleans to one bit
- amd/llvm: Add Subgroup Scan functions for SI
- radv: Enable Subgroup Arithmetic and Clustered for SI
- aco: don't value-number instructions from within a loop with ones
after the loop.
- aco: don't split live-ranges of linear VGPRs
- aco: fix a couple of value numbering issues
- aco: refactor visit_store_fs_output() to use the Builder
- aco: Initial GFX7 Support
- aco: SI/CI - fix sampler aniso
- aco: fix SMEM offsets for SI/CI
- aco: implement nir_op_fquantize2f16 for SI/CI
- aco: only use scalar loads for readonly buffers on SI/CI
- aco: implement nir_op_isign on SI/CI
- aco: move buffer_store data to VGPR if needed
- aco: implement quad swizzles for SI/CI
- aco: recognize SI/CI SMRD hazards
- aco: fix disassembly of writelane instructions.
- aco: split read/writelane opcode into VOP2/VOP3 version for SI/CI
- aco: implement 64bit VGPR shifts for SI/CI
- aco: make 1/2*PI a literal constant on SI/CI
- aco: implement 64bit i2b for SI /CI
- aco: implement 64bit ine/ieq for SI/CI
- aco: disable disassembly for SI/CI due to lack of support by LLVM
- radv: only flush scalar cache for SSBO writes with ACO on GFX8+
- aco: flush denorms after fmin/fmax on pre-GFX9
- aco: don't use a scalar temporary for reductions on GFX10
- aco: implement (clustered) reductions for SI/CI
- aco: implement inclusive_scan for SI/CI
- aco: implement exclusive scan for SI/CI
- radv: disable Youngblood app profile if ACO is used
- aco: return to loop_active mask at continue_or_break blocks
- radv: Enable ACO on GFX7 (Sea Islands)
- aco: use soffset for MUBUF instructions on SI/CI
- aco: improve readfirstlane after uniform ssbo loads on GFX7
- aco: propagate temporaries into expanded vectors
- nir: fix printing of var_decl with more than 4 components.
- aco: compact various Instruction classes
- aco: compact aco::span<T> to use uint16_t offset and size instead of
pointer and size_t.
- aco: fix unconditional demote_to_helper
- aco: rework lower_to_cssa()
- aco: handle phi affinities transitively through parallelcopies
- aco: ignore parallelcopies to the same register on jump threading
- aco: fix combine_salu_not_bitwise() when SCC is used
- aco: reorder VMEM operands in ACO IR
- aco: fix register allocation with multiple live-range splits
- aco: simplify adjust_sample_index_using_fmask() & get_image_coords()
- aco: simplify gathering of MIMG address components
- docs: add new features for RADV/ACO.
- aco: fix image_atomic_cmp_swap
Daniel Stone (2):
- Revert "st/dri: do FLUSH_VERTICES before calling flush_resource"
- Revert "gallium: add st_context_iface::flush_resource to call
FLUSH_VERTICES"
Danylo Piliaiev (12):
- intel/blorp: Fix usage of uninitialized memory in key hashing
- i965/program_cache: Lift restriction on shader key size
- intel/blorp: Fix usage of uninitialized memory in key hashing
- intel/fs: Do not lower large local arrays to scratch on gen7
- i965: Unify CC_STATE and BLEND_STATE atoms on Haswell as a workaround
- glsl: Add varyings to "zero-init of uninitialized vars" workaround
- drirc: Add glsl_zero_init workaround for GpuTest
- iris/query: Implement PIPE_QUERY_GPU_FINISHED
- iris: Fix value of out-of-bounds accesses for vertex attributes
- i965: Do not set front_buffer_dirty if there is no front buffer
- st/mesa: Handle the rest renderbuffer formats from OSMesa
- st/nir: Unify inputs_read/outputs_written before serializing NIR
Dave Airlie (74):
- nir/serialize: pack function has name and entry point into flags.
- nir/serialize: fix serializing functions with no implementations.
- spirv: don't store 0 to cs.ptr_size for non kernel stages.
- spirv: get the correct type for function returns.
- spirv/nir/opencl: handle some multiply instructions.
- nir: add 64-bit ufind_msb lowering support. (v2)
- nouveau: request ufind_msb64 lowering in the frontend.
- vtn/opencl: add clz support
- nir: fix deref offset builder
- llvmpipe: initial query buffer object support. (v2)
- docs: add llvmpipe to ARB_query_buffer_object.
- gallivm: split out the flow control ir to a common file.
- gallivm: nir->tgsi info convertor (v2)
- gallivm: add popcount intrinsic wrapper
- gallivm: add cttz wrapper
- gallivm: add selection for non-32 bit types
- gallivm: add nir->llvm translation (v2)
- draw: add nir info gathering and building support
- gallium: add nir lowering passes for the draw pipe stages. (v2)
- gallivm: add swizzle support where one channel isn't defined.
- llvmpipe: add initial nir support
- nir/samplers: don't zero samplers_used/txf.
- llvmpipe/images: handle undefined atomic without crashing
- gallivm/llvmpipe: add support for front facing in sysval.
- llvmpipe: enable texcoord semantics
- gallium/scons: fix graw-xlib build on OSX.
- llvmpipe: add queries disabled flag
- llvmpipe: disable occlusion queries when requested by state tracker
- draw: add support for collecting primitives generated outside
streamout
- llvmpipe: enable support for primitives generated outside streamout
- aco: handle gfx7 int8/10 clamping on exports
- gallivm: add bitfield reverse and ufind_msb
- llvmpipe/nir: handle texcoord requirements
- gallivm: fix transpose for when first channel isn't created
- gallivm: fix perspective enable if usage_mask doesn't have 0 bit set
- gallivm/nir: cleanup code and call cmp wrapper
- gallivm/nir: copy compare ordering code from tgsi
- gallivm: add base instance sysval support
- gallivm/draw: add support for draw_id system value.
- gallivm: fixup base_vertex support
- llvmpipe: enable ARB_shader_draw_parameters.
- vtn: convert vload/store to single value loops
- vtn/opencl: add shuffle/shuffle support
- gallivm/nir: wrap idiv to avoid divide by 0 (v2)
- llvmpipe: switch to NIR by default
- nir: sanitize work group intrinsics to always be 32-bit.
- gallivm: add 64-bit const int creator.
- llvmpipe/gallivm: add kernel inputs
- gallivm: add support for 8-bit/16-bit integer builders
- gallivm: pick integer builders for alu instructions.
- gallivm/nir: allow 8/16-bit conversion and comparison.
- tgsi/mesa: handle KERNEL case
- gallivm/llvmpipe: add support for work dimension intrinsic.
- gallivm/llvmpipe: add support for block size intrinsic
- gallivm/llvmpipe: add support for global operations.
- llvmpipe: handle serialized nir as a shader type.
- llvmpipe: add support for compute shader params
- llvmpipe/nir: use nir_max_vec_components in more places
- gallivm: handle non-32 bit undefined
- llvmpipe: lower hadd/add_sat
- gallivm/nir: lower packing
- gallivm/nir: add vec8/16 support
- llvmpipe: add debug option to enable OpenCL support.
- gallivm: fixup const int64 builder.
- llvmpipe: enable ARB_shader_group_vote.
- gallium/util: add multi_draw_indirect to util_draw_indirect.
- llvmpipe: enable driver side multi draw indirect
- llvmpipe: add support for ARB_indirect_parameters.
- llvmpipe: add ARB_derivative_control support
- gallivm: fix gather component handling.
- llvmpipe: fix some integer instruction lowering.
- galllivm: fix gather offset casting
- gallivm: fix find lsb
- gallivm/nir: add missing break for isub.
David Heidelberg (1):
- .mailmap: use correct email address
David Stevens (1):
- virgl: support emulating planar image sampling
Denis Pauk (2):
- gallium/swr: Enable support bptc format.
- docs/features: mark GL_ARB_texture_compression_bptc as done for
llvmpipe, softpipe, swr
Dongwon Kim (3):
- gallium: enable INTEL_PERFORMANCE_QUERY
- iris: INTEL performance query implementation
- gallium: check all planes' pipe formats in case of multi-samplers
Drew Davenport (1):
- radeonsi: Clear uninitialized variable
Drew DeVault (1):
- st_get_external_sampler_key: improve error message
Duncan Hopkins (1):
- zink: make sure src image is transfer-src-optimal
Dylan Baker (69):
- Bump VERSION to 20.0.0-devel
- docs/new_features: Empty the feature list for the 20.0 cycle
- nir: correct use of identity check in python
- r200: use preprocessor for big vs little endian checks
- r100: Use preprocessor to select big vs little endian paths
- dri/osmesa: use preprocessor for selecting endian code paths
- util/u_endian: Use \_WIN32 instead of \_MSC_VER
- util/u_endian: set PIPE_ARCH_*_ENDIAN to 1
- mesa/main: replace uses of \_mesa_little_endian with preprocessor
- mesa/swrast: replace instances of \_mesa_little_endian with
preprocessor
- mesa/main: delete now unused \_mesa_little_endian
- gallium/osmesa: Use PIPE_ARCH_*_ENDIAN instead of little_endian
function
- util: rename PIPE_ARCH_*_ENDIAN to UTIL_ARCH_*_ENDIAN
- util/u_endian: Add error checks
- meson: Add dep_glvnd to egl deps when building with glvnd
- docs: add release notes for 19.2.3
- docs: add sha256 sum to 19.2.3 release notes
- docs: update calendar, add news item and link release notes for
19.2.2
- meson: gtest needs pthreads
- gallium/osmesa: Convert osmesa test to gtest
- osmesa/tests: Extend render test to cover other working cases
- util: Use ZSTD for shader cache if possible
- docs: Add release notes for 19.2.4
- docs: Add SHA256 sum for for 19.2.4
- docs: update calendar, add news item and link release notes for
19.2.4
- docs: Add relnotes for 19.2.5
- docs/relnotes/19.2.5: Add SHA256 sum
- docs: update calendar, add news item and link release notes for
19.2.5
- docs/release-calendar: Update for extended 19.3 rc period
- docs: Add release notes for 19.2.6
- docs: Add SHA256 sum for 19.2.6
- docs: update calendar, add news item and link release notes for
19.2.6
- gallium/auxiliary: Fix uses of gnu struct = {} extension
- meson: Add -Werror=gnu-empty-initializer to MSVC compat args
- docs: Add release notes for 19.2.7
- docs: Add SHA256 sums for 19.2.7
- docs: update calendar, add news item and link release notes for
19.2.7
- docs: Update mesa 19.3 release calendar
- meson/broadcom: libbroadcom_cle needs expat headers
- meson/broadcom: libbroadcom_cle also needs zlib
- docs: add release notes for 19.3.0
- docs/19.3.0: Add SHA256 sums
- docs: Update release notes, index, and calendar for 19.3.0
- dcos: add releanse notes for 19.3.1
- docs: Add release notes, update calendar, and add news for 19.3.1
- docs: add relnotes for 19.2.8
- docs/relnotes/19.2.8: Add SHA256 sum
- docs: Add release notes, news, and update calendar for 19.2.8
- docs: Add release notes for 19.3.2
- docs: add SHA256 sums for 19.3.2
- docs: Add release notes for 19.3.2, update calendar and home page
- docs: Update release calendar for 20.0
- docs: Add relnotes for 19.3.3 release
- docs: Add SHA 256 sums for 19.3.3
- docs: update news, calendar, and link release notes for 19.3.3
- VERSION: bump to 20.0.0-rc1
- bin/pick-ui: Add a new maintainer script for picking patches
- .pick_status.json: Update to 0d14f41625fa00187f690f283c1eb6a22e354a71
- .pick_status.json: Update to b550b7ef3b8d12f533b67b1a03159a127a3ff34a
- .pick_status.json: Update to 9afdcd64f2c96f3fcc1a28912987f2e8066aa995
- .pick_status.json: Update to 7eaf21cb6f67adbe0e79b80b4feb8c816a98a720
- VERSION: bump to 20.0-rc2
- .pick_status.json: Update to d8bae10bfe0f487dcaec721743cd51441bcc12f5
- .pick_status.json: Update to 689817c9dfde9a0852f2b2489cb0fa93ffbcb215
- .pick_status.json: Update to 23037627359e739c42b194dec54875aefbb9d00b
- VERSION: bump for 20.0.0-rc3
- .pick_status.json: Update to 2a98cf3b2ecea43cea148df7f77d2abadfd1c9db
- .pick_status.json: Update to 946eacbafb47c8b94d47e7c9d2a8b02fff5a22fa
- .pick_status.json: Update to bee5c9b0dc13dbae0ccf124124eaccebf7f2a435
Eduardo Lima Mitev (2):
- turnip: Remove failed command buffer from pool
- turnip: Fix issues in tu_compute_pipeline_create() that may lead to
crash
Elie Tournier (4):
- Docs: remove duplicate meson docs for windows
- docs: fix ascii html representation
- nir/algebraic: i2f(f2i()) -> trunc()
- nir/algebraic: sqrt(x)*sqrt(x) -> fabs(x)
Emmanuel Gil Peyrot (1):
- intel/compiler: Return early if read() failed
Eric Anholt (102):
- ci: Make lava inherit the ccache setup of the .build script.
- ci: Switch over to an autoscaling GKE cluster for builds.
- Revert "ci: Switch over to an autoscaling GKE cluster for builds."
- mesa/st: Add mapping of MESA_FORMAT_RGB_SNORM16 to gallium.
- gallium: Add defines for FXT1 texture compression.
- gallium: Add some more channel orderings of packed formats.
- gallium: Add an equivalent of MESA_FORMAT_BGR_UNORM8.
- gallium: Add equivalents of packed MESA_FORMAT_*UINT formats.
- mesa: Stop defining a full separate format for RGBA_UINT8.
- mesa/st: Test round-tripping of all compressed formats.
- mesa: Prepare for the MESA_FORMAT\_\* enum to be sparse.
- mesa: Redefine MESA_FORMAT\_\* in terms of PIPE_FORMAT_*.
- mesa/st: Gut most of st_mesa_format_to_pipe_format().
- mesa/st: Make st_pipe_format_to_mesa_format an effective no-op.
- u_format: Fix swizzle of A1R5G5B5.
- ci: Use several debian buster packages instead of hand-building.
- ci: Make the skip list regexes match the full test name.
- ci: Use cts_runner for our dEQP runs.
- ci: Enable all of GLES3/3.1 testing for softpipe.
- ci: Remove old commented copy of freedreno artifacts.
- ci: Disable flappy blit tests on a630.
- ci: Expand the freedreno blit skip regex to cover more cases.
- util: Move gallium's PIPE_FORMAT utils to /util/format/
- mesa: Move compile of common Mesa core files to a static lib.
- mesa/st: Simplify st_choose_matching_format().
- mesa: Don't put sRGB formats in the array format table.
- mesa/st: Reuse st_choose_matching_format from st_choose_format().
- util: Add a mapping from VkFormat to PIPE_FORMAT.
- turnip: Drop the copy of the formats table.
- ci: Move freedreno's parallelism to the runner instead of gitlab-ci
jobs.
- ci: Use a tag from the parallel-deqp-runner repo.
- nir: Add a scheduler pass to reduce maximum register pressure.
- nir: Refactor algebraic's block walk
- nir: Make algebraic backtrack and reprocess after a replacement.
- freedreno: Introduce a fd_resource_layer_stride() helper.
- freedreno: Introduce a fd_resource_tile_mode() helper.
- freedreno: Introduce a resource layout header.
- freedreno: Convert the slice struct to the new resource header.
- freedreno/a6xx: Log the tiling mode in resource layout debug.
- turnip: Disable timestamp queries for now.
- turnip: Fix unused variable warnings.
- turnip: Drop redefinition of VALIDREG now that it's in ir3.h.
- turnip: Reuse tu6_stage2opcode() more.
- turnip: Add basic SSBO support.
- turnip: Refactor the graphics pipeline create implementation.
- turnip: Add a helper function for getting tu_buffer iovas.
- turnip: Sanity check that we're adding valid BOs to the list.
- turnip: Move pipeline BO list adding to BindPipeline.
- turnip: Add support for compute shaders.
- ci: Disable egl_ext_device_drm tests in piglit.
- freedreno: Enable texture upload memory throttling.
- freedreno: Stop forcing ALLOW_MAPPED_BUFFERS_DURING_EXEC off.
- freedreno: Track the set of UBOs to be uploaded in UBO analysis.
- freedreno: Drop the extra offset field for mipmap slices.
- freedreno: Refactor the UBWC flags registers emission.
- freedreno: Move UBWC layout into a slices array like the non-UBWC
slices.
- tu: Move our image layout into a freedreno_layout struct.
- freedreno: Move a6xx's setup_slices() to a shareable helper function.
- freedreno: Switch the 16-bit workaround to match what turnip does.
- tu: Move UBWC layout into fdl6_layout() and use that function.
- turnip: Lower usub_borrow.
- turnip: Drop unused variable.
- turnip: Add support for descriptor arrays.
- turnip: Fix support for immutable samplers.
- ci: Fix caselist results archiving after parallel-deqp-runner rename.
- mesa: Fix detection of invalidating both depth and stencil.
- mesa/st: Deduplicate the NIR uniform lowering code.
- mesa/st: Move the vec4 type size function into core GLSL types.
- mesa/prog: Reuse count_vec4_slots() from ir_to_mesa.
- mesa/st: Move the dword slot counting function to glsl_types as well.
- i965: Reuse the new core glsl_count_dword_slots().
- nir: Fix printing of ~0 .locations.
- turnip: Refactor linkage state setup.
- mesa: Make atomic lowering put atomics above SSBOs.
- gallium: Pack the atomic counters just above the SSBOs.
- nir: Drop the ssbo_offset to atomic lowering.
- compiler: Add a note about how num_ssbos works in the program info.
- freedreno: Stop scattered remapping of SSBOs/images to IBOs.
- radeonsi: Remove a bunch of default handling of pipe caps.
- r600: Remove a bunch of default handling of pipe caps.
- r300: Remove a bunch of default handling of pipe caps.
- radeonsi: Drop PIPE_CAP_TGSI_ANY_REG_AS_ADDRESS.
- turnip: Fix some whitespace around binary operators.
- turnip: Refactor the intrinsic lowering.
- turnip: Add limited support for storage images.
- turnip: Disable UBWC on images used as storage images.
- turnip: Add support for non-zero (still constant) UBO buffer indices.
- turnip: Add support for uniform texel buffers.
- freedreno/ir3: Plumb the ir3_shader_variant into legalize.
- turnip: Add support for fine derivatives.
- turnip: Fix execution of secondary cmd bufs with nothing in primary.
- freedreno: Add some missing a6xx address declarations.
- freedreno: Fix OUT_REG() on address regs without a .bo supplied.
- turnip: Port krh's packing macros from freedreno to tu.
- turnip: Convert renderpass setup to the new register packing macros.
- turnip: Convert the rest of tu_cmd_buffer.c over to the new pack
macros.
- vulkan/wsi: Fix compiler warning when no WSI platforms are enabled.
- iris: Silence warning about AUX_USAGE_MC.
- mesa/st: Fix compiler warnings from INTEL_shader_integer_functions.
- ci: Enable -Werror on the meson-i386 build.
- tu: Fix binning address setup after pack macros change.
- Revert "gallium: Fix big-endian addressing of non-bitmask array
formats."
Eric Engestrom (58):
- meson: split out idep_xmlconfig_headers from idep_xmlconfig
- anv: add missing xmlconfig headers dependency
- radv: drop unnecessary xmlpool_options_h
- pipe-loader: drop unnecessary xmlpool_options_h
- loader: replace xmlpool_options_h with idep_xmlconfig_headers
- targets/omx: replace xmlpool_options_h with idep_xmlconfig_headers
- targets/va: replace xmlpool_options_h with idep_xmlconfig_headers
- targets/vdpau: replace xmlpool_options_h with idep_xmlconfig_headers
- targets/xa: replace xmlpool_options_h with idep_xmlconfig_headers
- targets/xvmc: replace xmlpool_options_h with idep_xmlconfig_headers
- dri: replace xmlpool_options_h with idep_xmlconfig_headers
- i915: replace xmlpool_options_h with idep_xmlconfig_headers
- nouveau: replace xmlpool_options_h with idep_xmlconfig_headers
- r200: replace xmlpool_options_h with idep_xmlconfig_headers
- radeon: replace xmlpool_options_h with idep_xmlconfig_headers
- meson: move idep_xmlconfig_headers to xmlpool/
- gitlab-ci: build a recent enough version of GLVND (ie. 1.2.0)
- meson: require glvnd 1.2.0
- meson: revert glvnd workaround
- meson: add variable to control the symbols checks
- meson: move the generic symbols check arguments to a common variable
- meson: add windows support to symbols checks
- meson: require \`nm\` again on Unix systems
- mesa/imports: let the build system detect strtok_r()
- egl: fix \_EGL_NATIVE_PLATFORM fallback
- egl: move #include of local headers out of Khronos headers
- gitlab-ci: build libdrm using meson instead of autotools
- gitlab-ci: auto-cancel CI runs when a newer commit is pushed to the
same branch
- CL: sync C headers with Khronos
- CL: sync C++ headers with Khronos
- vulkan: delete typo'd header
- egl: use EGL_CAST() macro in eglmesaext.h
- anv: add missing "fall-through" annotation
- vk_util: drop duplicate formats in vk_format_map[]
- meson: drop duplicate \`lib\` prefix on libiris_gen\*
- meson: drop \`intel_\` prefix on imgui_core
- docs: reword a bit and list HTTPS before FTP
- intel: add mi_builder_test for gen12
- intel/compiler: add ASSERTED annotation to avoid "unused variable"
warning
- intel/compiler: replace \`0\` pointer with \`NULL\`
- util/simple_mtx: don't set the canary when it can't be checked
- anv: drop unused #include
- travis: autodetect python version instead of hard-coding it
- util/format: remove left-over util_format_description_table
declaration
- util/format: add PIPE_FORMAT_ASTC_*x*x*_SRGB to
util_format_{srgb,linear}()
- util/format: add trivial srgb<->linear conversion test
- u_format: move format tests to util/tests/
- amd: fix empty-body issues
- nine: fix empty-body-issues
- meson: simplify install_megadrivers.py invocation
- mesa: avoid returning a value in a void function
- meson: use github URL for wraps instead of completely unreliable
wrapdb
- egl: drop confusing mincore() error message
- llvmpipe: drop LLVM < 3.4 support
- util/atomic: fix return type of p_atomic_add_return() fallback
- util/os_socket: fix header unavailable on windows
- freedreno/perfcntrs: fix fd leak
- util/disk_cache: check for write() failure in the zstd path
Erico Nunes (17):
- lima: fix nir shader memory leak
- lima: fix bo submit memory leak
- lima/ppir: enable lower_fdph
- gallium/util: add alignment parameter to util_upload_index_buffer
- lima: allocate separate bo to store varyings
- lima: refactor indexed draw indices upload
- vc4: move the draw splitting routine to shared code
- lima: split draw calls on 64k vertices
- lima/ppir: fix lod bias src
- lima/ppir: remove assert on ppir_emit_tex unsupported feature
- lima: set shader caps to optimize control flow
- lima/ppir: remove orphan load node after cloning
- lima/ppir: implement full liveness analysis for regalloc
- lima/ppir: handle write to dead registers in ppir
- lima/ppir: fix ssa undef emit
- lima/ppir: split ppir_op_undef into undef and dummy again
- lima/ppir: fix src read mask swizzling
Erik Faye-Lund (82):
- zink: heap-allocate samplers objects
- zink: emit line-width when using polygon line-mode
- anv: remove incorrect polygonMode=point early-out
- zink: use actual format for render-pass
- zink: always allow mutating the format
- zink: do not advertize coherent mapping
- zink: disable fragment-shader texture-lod
- zink: transition resources before resolving
- zink: always allow sampling of images
- zink: use u_blitter when format-reinterpreting
- zink/spirv: drop temp-array for component-count
- zink/spirv: support loading bool constants
- zink/spirv: implement bany_fnequal[2-4]
- zink/spirv: implement bany_inequal[2-4]
- zink/spirv: implement ball_iequal[2-4]
- zink/spirv: implement ball_fequal[2-4]
- zink: do advertize integer support in shaders
- zink/spirv: add support for nir_op_flrp
- zink: correct depth-stencil format
- nir: patch up deref-vars when lowering clip-planes
- zink: always allow transfer to/from buffers
- zink: implement buffer-to-buffer copies
- zink: remove no-longer-needed hack
- zink: move format-checking to separate source
- zink: move filter-helper to separate helper-header
- zink: move blitting to separate source
- zink: move drawing separate source
- st/mesa: unmap pbo after updating cache
- zink: use true/false instead of TRUE/FALSE
- zink: reject invalid sample-counts
- zink: fix crash when restoring sampler-states
- zink: delete query rather than allocating a new one
- zink: do not try to destroy NULL-fence
- zink: handle calloc-failure
- zink: avoid NULL-deref
- zink: avoid NULL-deref
- zink: avoid NULL-deref
- zink: error-check right variable
- zink: silence coverity error
- zink: enable PIPE_CAP_MIXED_COLORBUFFER_FORMATS
- zink: implement nir_texop_txd
- zink: implement txf
- zink: implement some more trivial opcodes
- zink: simplify front-face type
- zink: factor out builtin-var creation
- zink: implement load_vertex_id
- zink: use nir_fmul_imm
- zink: remove unused code-path in lower_pos_write
- nir/zink: move clip_halfz-lowering to common code
- etnaviv: use nir_lower_clip_halfz instead of open-coding
- st/mesa: use uint-samplers for sampling stencil buffers
- zink: fixup initialization of operand_mask / num_extra_operands
- util: initialize float-array with float-literals
- st/wgl: eliminate implicit cast warning
- gallium: fix a warning
- mesa/st: use float literals
- docs: fix typo in html tag name
- docs: fix paragraphs
- docs: open paragraph before closing it
- docs: use code-tag instead of pre-tag
- docs: use code-tags instead of pre-tags
- docs: use code-tags instead of pre-tags
- docs: move paragraph closing tag
- docs: remove double-closed definition-list
- docs: do not double-close link tag
- docs: do not use definition-list for sub-topics
- docs: use figure/figcaption instead of tables
- docs: remove trailing header
- docs: remove leading spaces
- docs: remove trailing newlines
- docs: use [1] instead of asterisk for footnote
- docs: remove pointless, stray newline
- docs: fixup indentation
- zink: implement nir_texop_txs
- zink: support offset-variants of texturing
- zink: avoid incorrect vector-construction
- zink: store image-type per texture
- zink: support sampling non-float textures
- zink: support arrays of samplers
- zink: set compareEnable when setting compareOp
- st/mesa: use uint-result for sampling stencil buffers
- Revert "nir: Add a couple trivial abs optimizations"
Florian Will (1):
- radv/winsys: set IB flags prior to submit in the sysmem path
Francisco Jerez (26):
- glsl: Fix software 64-bit integer to 32-bit float conversions.
- intel/fs/gen11+: Handle ROR/ROL in lower_simd_width().
- intel/fs/gen8+: Fix r127 dst/src overlap RA workaround for EOT
message payload.
- intel/fs: Fix nir_intrinsic_load_barycentric_at_sample for SIMD32.
- intel/fs/cse: Fix non-deterministic behavior due to inaccurate
liveness calculation.
- intel/fs: Make implied_mrf_writes() an fs_inst method.
- intel/fs: Try to vectorize header setup in lower_load_payload().
- intel/fs: Generalize fs_reg::is_contiguous() to register files other
than VGRF.
- intel/fs: Rework fs_inst::is_copy_payload() into multiple
classification helpers.
- intel/fs: Extend copy propagation dataflow analysis to copies with
FIXED_GRF source.
- intel/fs: Add partial support for copy-propagating FIXED_GRFs.
- intel/fs: Add support for copy-propagating a block of multiple
FIXED_GRFs.
- intel/fs: Allow limited copy propagation of a LOAD_PAYLOAD into
another.
- intel/fs/gen4-6: Allocate registers from aligned_pairs_class based on
LINTERP use.
- intel/fs/gen6: Constrain barycentric source of LINTERP during bank
conflict mitigation.
- intel/fs/gen6: Generalize aligned_pairs_class to SIMD16 aligned
barycentrics.
- intel/fs/gen6: Use SEL instead of bashing thread payload for unlit
centroid workaround.
- intel/fs: Split fetch_payload_reg() into separate helper for
barycentrics.
- intel/fs: Introduce barycentric layout lowering pass.
- intel/fs: Switch to standard vector layout for barycentrics at
optimization time.
- intel/fs/cse: Make HALT instruction act as CSE barrier.
- intel/fs/gen7: Fix fs_inst::flags_written() for
SHADER_OPCODE_FIND_LIVE_CHANNEL.
- intel/fs: Add virtual instruction to load mask of live channels into
flag register.
- intel/fs/gen12: Workaround unwanted SEND execution due to broken
NoMask control flow.
- intel/fs/gen12: Fixup/simplify SWSB annotations of SIMD32 scratch
writes.
- intel/fs/gen12: Workaround data coherency issues due to broken NoMask
control flow.
Fritz Koenig (1):
- freedreno: reorder format check
Georg Lehmann (3):
- Correctly wait in the fragment stage until all semaphores are
signaled
- Vulkan Overlay: Don't try to change the image layout to present twice
- Vulkan overlay: use the corresponding image index for each swapchain
Gert Wollny (12):
- r600: Disable eight bit three channel formats
- virgl: Increase the shader transfer buffer by doubling the size
- gallium/tgsi_from_mesa: Add 'extern "C"' to be able to include from
C++
- nir: make nir_get_texture_size/lod available outside nir_lower_tex
- gallium: tgsi_from_mesa - handle VARYING_SLOT_FACE
- r600: Add functions to dump the shader info
- r600: Make it possible to include r600_asm.h in a C++ file
- r600/sb: Correct SB disassambler for better debugging
- r600: Fix maximum line width
- r600: Make SID and unsigned value
- r600: Delete vertex buffer only if there is actually a shader state
- mesa/st: glsl_to_nir: don't lower atomics to SSBOs if driver supports
HW atomics
Guido Günther (2):
- etnaviv: drm: Don't miscalculate timeout
- freedreno/drm: Don't miscalculate timeout
Gurchetan Singh (11):
- drirc: set allow_higher_compat_version for Faster Than Light
- virgl/drm: update UAPI
- teximage: split out helper from EGLImageTargetTexture2DOES
- glapi / teximage: implement EGLImageTargetTexStorageEXT
- dri_util: add driImageFormatToSizedInternalGLFormat function
- i965: track if image is created by a dmabuf
- i965: refactor intel_image_target_texture_2d
- i965: support EXT_EGL_image_storage
- st/dri: track if image is created by a dmabuf
- st/mesa: refactor egl image binding a bit
- st/mesa: implement EGLImageTargetTexStorage
Hyunjun Ko (7):
- freedreno/ir3: cleanup by removing repeated code
- freedreno: support 16b for the sampler opcode
- freedreno/ir3: fix printing output registers of FS.
- freedreno/ir3: fixup when changing to mad.f16
- freedreno/ir3: enable half precision for pre-fs texture fetch
- turnip: fix invalid VK_ERROR_OUT_OF_POOL_MEMORY
- freedreno/ir3: put the conversion back for half const to the right
place.
Iago Toral Quiroga (32):
- v3d: rename vertex shader key (num)_fs_inputs fields
- mesa/st: make sure we remove dead IO variables before handing NIR to
backends
- glsl: add missing initialization of the location path field
- v3d: fix indirect BO allocation for uniforms
- v3d: actually root the first BO in a command list in the job
- v3d: add missing plumbing for VPM load instructions
- v3d: add debug assert
- v3d: enable debug options for geometry shader dumps
- v3d: remove unused variable
- v3d: add initial compiler plumbing for geometry shaders
- v3d: fix packet descriptions for geometry and tessellation shaders
- v3d: emit geometry shader state commands
- v3d: implement geometry shader instancing
- v3d: add 1-way SIMD packing definition
- v3d: compute appropriate VPM memory configuration for geometry shader
workloads
- v3d: we always have at least one output segment
- v3d: add support for adjacency primitives
- v3d: don't try to render if shaders failed to compile
- v3d: predicate geometry shader outputs inside non-uniform control
flow
- v3d: save geometry shader state for blitting
- v3d: support transform feedback with geometry shaders
- v3d: remove obsolete assertion
- v3d: do not limit new CL space allocations with branch to 4096 bytes
- v3d: support rendering to multi-layered framebuffers
- v3d: move layer rendering to a separate helper
- v3d: handle writes to gl_Layer from geometry shaders
- v3d: fix primitive queries for geometry shaders
- v3d: disable lowering of indirect inputs
- v3d: support precompiling geometry shaders
- v3d: expose OES_geometry_shader
- u_vbuf: don't try to delete NULL driver CSO
- v3d: fix bug when checking result of syncobj fence import
Ian Romanick (39):
- intel/compiler: Report the number of non-spill/fill SEND messages on
vec4 too
- nir/algebraic: Add the ability to mark a replacement as exact
- nir/algebraic: Mark other comparison exact when removing a == a
- intel/fs: Disable conditional discard optimization on Gen4 and Gen5
- nir/range-analysis: Add pragmas to help loop unrolling
- nir/range_analysis: Make sure the table validation only occurs once
- nir/opt_peephole_select: Don't count some unary operations
- intel/compiler: Increase nir_opt_peephole_select threshold
- nir/algebraic: Simplify some Inf and NaN avoidance code
- nir/algebraic: Rearrange bcsel sequences generated by
nir_opt_peephole_select
- intel/compiler: Fix 'comparison is always true' warning
- mesa: Silence 'left shift of negative value' warning in BPTC
compression code
- mesa: Silence unused parameter warning
- anv: Fix error message format string
- mesa: Extension boilerplate for INTEL_shader_integer_functions2
- glsl: Add new expressions for INTEL_shader_integer_functions2
- glsl_types: Add function to get an unsigned base type from a signed
type
- glsl: Add built-in functions for INTEL_shader_integer_functions2
- nir: Add new instructions for INTEL_shader_integer_functions2
- nir/algebraic: Add lowering for uabs_usub and uabs_isub
- nir/algebraic: Add lowering for 64-bit hadd and rhadd
- nir/algebraic: Add lowering for 64-bit usub_sat
- nir/algebraic: Add lowering for 64-bit uadd_sat
- nir/algebraic: Add lowering for 64-bit iadd_sat and isub_sat
- compiler: Translate GLSL IR to NIR for new
INTEL_shader_integer_functions2 expressions
- intel/fs: Don't lower integer multiplies that don't need lowering
- intel/fs: Add SHADER_OPCODE_[IU]SUB_SAT pseudo-ops
- intel/fs: Implement support for NIR opcodes for
INTEL_shader_integer_functions2
- nir/spirv: Translate SPIR-V to NIR for new
INTEL_shader_integer_functions2 opcodes
- spirv: Silence a bunch of unused parameter warnings
- spirv: Add support for IntegerFunctions2INTEL capability
- i965: Enable INTEL_shader_integer_functions2 on Gen8+
- gallium: Add a cap bit for OpenCL-style extended integer functions
- gallium: Add a cap bit for integer multiplication between 32-bit and
16-bit
- iris: Enable INTEL_shader_integer_functions2
- anv: Enable SPV_INTEL_shader_integer_functions2 and
VK_INTEL_shader_integer_functions2
- nir/algebraic: Optimize some 64-bit integer comparisons involving
zero
- relnotes: Add GL_INTEL_shader_integer_functions2 and
VK_INTEL_shader_integer_functions2
- intel/fs: Don't count integer instructions as being possibly coissue
Icecream95 (16):
- gallium/auxiliary: Reduce conversions in
u_vbuf_get_minmax_index_mapped
- gallium/auxiliary: Handle count == 0 in
u_vbuf_get_minmax_index_mapped
- panfrost: Add negative lod bias support
- panfrost: Compact the bo_access readers array
- panfrost: Dynamically allocate shader variants
- panfrost: Add ETC1/ETC2 texture formats
- panfrost: Add ASTC texture formats
- pan/midgard: Fix bundle dynarray leak
- pan/midgard: Fix a memory leak in the disassembler
- pan/midgard: Support disassembling to a file
- pan/bifrost: Support disassembling to a file
- pan/decode: Support dumping to a file
- pan/decode: Dump to a file
- pan/decode: Rotate trace files
- panfrost: Don't copy uniforms when the size is zero
- pan/midgard: Fix a liveness info leak
Icenowy Zheng (2):
- lima: support indexed draw with bias
- lima: fix lima_set_vertex_buffers()
Ilia Mirkin (7):
- gm107/ir: fix loading z offset for layered 3d image bindings
- nv50/ir: mark STORE destination inputs as used
- nv50,nvc0: fix destination coordinates of blit
- nvc0: add dummy reset status support
- gm107/ir: avoid combining geometry shader stores at 0x60
- nvc0: treat all draws without color0 broadcast as MRT
- nvc0: disable xfb's which don't have a stride
Italo Nicola (1):
- intel/compiler: remove old comment
Iván Briano (4):
- intel/compiler: Don't change hstride if not needed
- anv: Export filter_minmax support only when it's really supported
- anv: Export VK_KHR_buffer_device_address only when really supported
- anv: Enable Vulkan 1.2 support
James Xiong (3):
- iris: try to set the specified tiling when importing a dmabuf
- gallium: dmabuf support for yuv formats that are not natively
supported
- gallium: let the pipe drivers decide the supported modifiers
Jan Vesely (2):
- clover: Initialize Asm Parsers
- clover: Use explicit conversion from llvm::StringRef to std::string
Jan Zielinski (8):
- gallium/swr: Fix depth values for blit scenario
- swr/rasterizer: Add tessellator implementation to the rasterizer
- gallium/swr: Fix Windows build
- gallium/gallivm/tgsi: enable tessellation shaders
- gallium/gallivm: enable linking lp_bld_printf function with C++ code
- gallium/swr: implementation of tessellation shaders compilation
- gallium/swr: fix tessellation state save/restore
- docs: Update SWR tessellation support
Jason Ekstrand (212):
- util: Add a util_sparse_array data structure
- anv: Move refcount to anv_bo
- anv: Use a util_sparse_array for the GEM handle -> BO map
- anv: Fix a relocation race condition
- anv: Stop storing the GEM handle in anv_reloc_list_add
- anv: Declare the bo in the anv_block_pool_foreach_bo loop
- anv: Inline anv_block_pool_get_bo
- anv: Replace ANV_BO_EXTERNAL with anv_bo::is_external
- anv: Handle state pool relocations using "wrapper" BOs
- anv: Fix a potential BO handle leak
- anv: Rework anv_block_pool_expand_range
- anv: Use anv_block_pool_foreach_bo in get_bo_from_pool
- anv: Rework the internal BO allocation API
- anv: Choose BO flags internally in anv_block_pool
- anv/tests: Zero-initialize instances
- anv/tests: Initialize the BO cache and device mutex
- anv: Allocate block pool BOs from the cache
- anv: Use the query_slot helper in vkResetQueryPoolEXT
- anv: Allocate query pool BOs from the cache
- anv: Set more flags on descriptor pool buffers
- anv: Allocate descriptor buffers from the BO cache
- util: Add a free list structure for use with util_sparse_array
- anv: Allocate batch and fence buffers from the cache
- anv: Allocate scratch BOs from the cache
- anv: Allocate misc BOs from the cache
- anv: Drop anv_bo_init and anv_bo_init_new
- anv: Add a device parameter to anv_execbuf_add_bo
- anv: Set the batch allocator for compute pipelines
- anv: Use a bitset for tracking residency
- anv: Zero released anv_bo structs
- anv: Use the new BO alloc API for Android
- anv: Don't delete fragment shaders that write sample mask
- anv: Don't claim the null RT as a valid color target
- anv: Stop compacting render targets in the binding table
- anv: Move the RT BTI flush workaround to begin_subpass
- spirv: Remove the type from sampled_image
- spirv: Add a vtn_decorate_pointer helper
- spirv: Sort out the mess that is sampled image
- nir/builder: Add a nir_extract_bits helper
- nir: Add tests for nir_extract_bits
- intel/nir: Use nir_extract_bits in lower_mem_access_bit_sizes
- intel/fs: Add DWord scattered read/write opcodes
- intel/fs: refactor surface header setup
- intel/nir: Plumb devinfo through lower_mem_access_bit_sizes
- intel/fs: Implement the new load/store_scratch intrinsics
- intel/fs: Lower large local arrays to scratch
- anv: Lock around fetching sync file FDs from semaphores
- anv: Plumb timeline semaphore signal/wait values through from the API
- spirv: Fix the MSVC build
- anv/pipeline: Assume layout != NULL
- genxml: Mark everything in genX_pack.h always_inline
- anv: Input attachments are always single-plane
- anv: Flatten descriptor bindings in anv_nir_apply_pipeline_layout
- anv: Delete dead shader constant pushing code
- anv: Stop bounds-checking pushed UBOs
- anv: Pre-compute push ranges for graphics pipelines
- intel/compiler: Add a flag to avoid compacting push constants
- anv: Re-arrange push constant data a bit
- anv: Rework push constant handling
- anv: Use a switch statement for binding table setup
- anv: More carefully dirty state in BindDescriptorSets
- anv: More carefully dirty state in BindPipeline
- anv: Use an anv_state for the next binding table
- anv: Emit a NULL vertex for zero base_vertex/instance
- nir: Validate that variables are in the right lists
- iris: Re-enable param compaction
- Revert "i965/fs: Merge CMP and SEL into CSEL on Gen8+"
- vulkan/enum_to_str: Handle out-of-order aliases
- anv/entrypoints: Better handle promoted extensions
- vulkan: Update the XML and headers to 1.1.129
- anv: Push constants are relative to dynamic state on IVB
- anv: Set up SBE_SWIZ properly for gl_Viewport
- anv: Respect the always_flush_cache driconf option
- iris: Stop setting up fake params
- anv: Drop bo_flags from anv_bo_pool
- anv: Add a has_softpin boolean
- blorp: Pass the VB size to the VF cache workaround
- anv: Always invalidate the VF cache in BeginCommandBuffer
- anv: Apply cache flushes after setting index/draw VBs
- anv: Use PIPE_CONTROL flushes to implement the gen8 VF cache WA
- anv: Don't leak when set_tiling fails
- util/atomic: Add a \_return variant of p_atomic_add
- anv: Disallow allocating above heap sizes
- anv: Stop tracking VMA allocations
- anv: Set up VMA heaps independently from memory heaps
- anv: Stop advertising two heaps just for the VF cache WA
- anv: Add an explicit_address parameter to anv_device_alloc_bo
- util/vma: Factor out the hole splitting part of util_vma_heap_alloc
- util/vma: Add a function to allocate a particular address range
- anv: Add allocator support for client-visible addresses
- anv: Use a pNext loop in AllocateMemory
- anv: Implement VK_KHR_buffer_device_address
- util/atomic: Add p_atomic_add_return for the unlocked path
- vulkan/wsi: Provide the implicitly synchronized BO to vkQueueSubmit
- vulkan/wsi: Add a hooks for signaling semaphores and fences
- anv: Always add in EXEC_OBJECT_WRITE when specified in extra_flags
- anv: Use submit-time implicit sync instead of allocate-time
- anv: Add a fence_reset_reset_temporary helper
- anv: Use BO fences/semaphores for AcquireNextImage
- anv: Return VK_ERROR_OUT_OF_DEVICE_MEMORY for too-large buffers
- anv: Re-capture all batch and state buffers
- anv: Re-emit all compute state on pipeline switch
- ANV: Stop advertising smoothLines support on gen10+
- anv: Flush the queue on DeviceWaitIdle
- anv: Unconditionally advertise Vulkan 1.1
- anv: Bump the advertised patch version to 129
- i965: Enable GL_EXT_gpu_shader4 on Gen6+
- anv: Properly advertise sampledImageIntegerSampleCounts
- anv: Drop unneeded struct keywords
- blorp: Stop whacking Z24 depth to BGRA8
- blorp: Allow reading with HiZ
- i965/blorp: Don't resolve HiZ unless we're reinterpreting
- intel/blorp: Use the source format when using blorp_copy with HiZ
- anv: Allow HiZ in TRANSFER_SRC_OPTIMAL on Gen8-9
- i965: Allow HiZ for glCopyImageSubData sources
- intel/nir: Add a memory barrier before barrier()
- intel/disasm: Fix decoding of src0 of SENDS
- genxml: Remove a non-existant HW bit
- anv: Don't add dynamic state base address to push constants on Gen7
- anv: Flag descriptors dirty when gl_NumWorkgroups is used
- anv: Re-use flush_descriptor_sets in flush_compute_state
- intel/vec4: Support scoped_memory_barrier
- nir: Handle more barriers in dead_write and copy_prop
- nir: Handle barriers with more granularity in combine_stores
- llmvpipe: No-op implement more barriers
- nir: Add a new memory_barrier_tcs_patch intrinsic
- spirv: Add a workaround for OpControlBarrier on old GLSLang
- spirv: Add output memory semantics to OpControlBarrier in TCS
- nir/glsl: Emit memory barriers as part of barrier()
- intel/nir: Stop adding redundant barriers
- nir: Rename nir_intrinsic_barrier to control_barrier
- nir/lower_atomics_to_ssbo: Also lower barriers
- anv: Drop an unused variable
- intel/blorp: Fill out all the dwords of MI_ATOMIC
- anv: Don't over-advertise descriptor indexing features
- anv: Memset array properties
- vulkan/wsi: Add a driconf option to force WSI to advertise
BGRA8_UNORM first
- vulkan: Update the XML and headers to 1.2.131
- turnip: Pretend to support Vulkan 1.2
- anv: Bump the patch version to 131
- anv,nir: Lower quad_broadcast with dynamic index in NIR
- anv: Implement the new core version feature queries
- anv: Implement the new core version property queries
- relnotes: Add Vulkan 1.2
- anv: Drop some VK_IMAGE_TILING_OPTIMAL checks
- anv: Support modifiers in GetImageFormatProperties2
- vulkan/wsi: Move the ImageCreateInfo higher up
- vulkan/wsi: Use the interface from the real modifiers extension
- vulkan/wsi: Filter modifiers with ImageFormatProperties
- vulkan/wsi: Implement VK_KHR_swapchain_mutable_format
- anv/blorp: Rename buffer image stride parameters
- anv: Canonicalize buffer formats for image/buffer copies
- anv: Add an anv_physical_device field to anv_device
- anv: Take an anv_device in vk_errorf
- anv: Take a device in anv_perf_warn
- anv: Stop allocating WSI event fences off the instance
- anv: Drop the instance pointer from anv_device
- anv: Move the physical device dispatch table to anv_instance
- anv: Drop separate chipset_id fields
- anv: Re-arrange physical_device_init
- anv: Allow enumerating multiple physical devices
- anv/apply_pipeline_layout: Initialize the nir_builder before use
- intel/blorp: resize src and dst surfaces separately
- anv: Use TRANSFER_SRC_OPTIMAL for depth/stencil MSAA resolves
- anv: Add a layout_to_aux_state helper
- anv: Use isl_aux_state for HiZ resolves
- anv: Add a usage parameter to anv_layout_to_aux_usage
- anv: Allow HiZ in read-only depth layouts
- anv: Improve BTI change cache flushing
- intel/fs: Don't unnecessarily fall back to indirect sends on Gen12
- intel/disasm: Properly disassemble indirect SENDs
- intel/isl: Plumb devinfo into isl_genX(buffer_fill_state_s)
- intel/isl: Add a hack for the Gen12 A0 texture buffer bug
- anv: Rework the meaning of anv_image::planes[]::aux_usage
- anv: Replace aux_surface.isl.size_B checks with aux_usage checks
- intel/aux-map: Add some #defines
- intel/aux-map: Factor out some useful helpers
- anv: Delete a redundant calculation
- isl: Add a helper for calculating subimage memory ranges
- anv: Add another align_down helper
- anv: Make AUX table invalidate a PIPE\_\* bit
- anv: Make anv_vma_alloc/free a lot dumber
- anv: Rework CCS memory handling on TGL-LP
- intel/blorp: Add support for CCS_E copies with UNORM formats
- intel/isl: Allow CCS_E on more formats
- intel/genxml: Make SO_DECL::"Hole Flag" a Boolean
- anv: Insert holes for non-existant XFB varyings
- intel/blorp: Handle bit-casting UNORM and BGRA formats
- anv: Replace one more aux_surface.isl.size_B check
- intel/mi_builder: Force write completion on Gen12+
- anv: Set actual state pool sizes when we have softpin
- anv: Re-use one old BT block in reset_batch_bo_chain
- anv/block_pool: Ensure allocations have contiguous maps
- anv: Rename a variable
- genxml: Add a new 3DSTATE_SF field on gen12
- anv,iris: Set 3DSTATE_SF::DerefBlockSize to per-poly on Gen12+
- intel/genxml: Drop SLMEnable from L3CNTLREG on Gen11
- iris: Set SLMEnable based on the L3$ config
- iris: Store the L3$ configs in the screen
- iris: Use the URB size from the L3$ config
- i965: Re-emit l3 state before BLORP executes
- intel: Take a gen_l3_config in gen_get_urb_config
- intel/blorp: Always emit URB config on Gen7+
- iris: Consolodate URB emit
- anv: Emit URB setup earlier
- intel/common: Return the block size from get_urb_config
- intel/blorp: Plumb deref block size through to 3DSTATE_SF
- anv: Plumb deref block size through to 3DSTATE_SF
- iris: Plumb deref block size through to 3DSTATE_SF
- anv: Always fill out the AUX table even if CCS is disabled
- intel/fs: Write the address register with NoMask for MOV_INDIRECT
- anv/blorp: Use the correct size for vkCmdCopyBufferToImage
Jonathan Gray (4):
- winsys/amdgpu: avoid double simple_mtx_unlock()
- i965: update Makefile.sources for perf changes
- util/futex: use futex syscall on OpenBSD
- util/u_thread: don't restrict u_thread_get_time_nano() to \__linux_\_
Jonathan Marek (98):
- freedreno: add Adreno 640 ID
- freedreno/ir3: disable texture prefetch for 1d array textures
- freedreno/registers: fix a6xx_2d_blit_cntl ROTATE
- etnaviv: blt: use only for tiling, and add missing formats
- etnaviv: separate PE and RS formats, use only RS only for tiling
- etnaviv: blt: set TS dirty after clear
- turnip: add display wsi
- turnip: add x11 wsi
- turnip: implement CmdClearColorImage/CmdClearDepthStencilImage
- turnip: fix sRGB GMEM clear
- util: add missing R8G8B8A8_SRGB format to vk_format_map
- freedreno/regs: update UBWC related bits
- turnip: implement UBWC
- etnaviv: avoid using RS for 64bpp formats
- etnaviv: implement 64bpp clear
- etnaviv: blt: fix partial ZS clears with TS
- etnaviv: support 3d/array/integer formats in texture descriptors
- turnip: fix integer render targets
- freedreno/registers: add missing MH perfcounter enum for a2xx
- freedreno/perfcntrs: add a2xx MH counters
- freedreno/perfcntrs/fdperf: fix u64 print on 32-bit builds
- freedreno/perfcntrs/fdperf: add missing a20x compatible
- freedreno/perfcntrs/fdperf: add missing a2xx case in select_counter
- turnip: fix display wsi fence timing out
- turnip: don't skip unused attachments when setting up tiling config
- turnip: implement CmdClearAttachments
- turnip: don't set unused BLIT_DST_INFO bits for GMEM clear
- turnip: MSAA resolve directly from GMEM
- turnip: allow writes to draw_cs outside of render pass
- turnip: add function to allocate aligned memory in a substream cs
- turnip: improve emit_textures
- turnip: implement border color
- turnip: add hw binning
- turnip: fix incorrectly failing assert
- freedreno/ir3: add GLSL_SAMPLER_DIM_SUBPASS to tex_info
- freedreno/registers: add a6xx texture format for stencil sampler
- turnip: fix hw binning render area
- turnip: fix tile layout logic
- turnip: update tile_align_w/tile_align_h
- turnip: set load_layer_id to zero
- turnip: set FRAG_WRITES_SAMPMASK bit
- turnip: fix VK_IMAGE_ASPECT_STENCIL_BIT image view
- turnip: no 8x msaa on 128bpp formats
- turnip: add dirty bit for push constants
- turnip: subpass rework
- turnip: CmdClearAttachments fixes
- turnip: implement subpass input attachments
- etnaviv: remove sRGB formats from format table
- etnaviv: sRGB render target support
- etnaviv: set output mode and saturate bits
- etnaviv: update INT_FILTER choice for GLES3 formats
- etnaviv: disable integer vertex formats on pre-HALTI2 hardware
- etnaviv: remove swizzle from format table
- etnaviv: add missing formats
- etnaviv: add missing vs_needs_z_div handling to NIR backend
- turnip: use single substream cs
- turnip: use common blit path for buffer copy
- turnip: don't require src image to be set for clear blits
- turnip: implement CmdFillBuffer/CmdUpdateBuffer
- freedreno/ir3: lower mul_2x32_64
- turnip: fix emit_textures for compute shaders
- turnip: remove compute emit_border_color
- turnip: fix emit_ibo
- turnip: change emit_ibo to be like emit_textures
- turnip: remove duplicate A6XX_SP_CS_CONFIG_NIBO
- nir: add option to lower half packing opcodes
- freedreno/ir3: lower pack/unpack ops
- turnip: don't set LRZ enable at end of renderpass
- freedreno/ir3: update prefetch input_offset when packing inlocs
- turnip: add cache invalidate to fix input attachment cases
- turnip: don't set SP_FS_CTRL_REG0_VARYING if only fragcoord is used
- freedreno/ir3: fix vertex shader sysvals with pre_assign_inputs
- freedreno/registers: document vertex/instance id offset bits
- freedreno/ir3: support load_base_instance
- turnip: emit base instance vs driver param
- turnip: emit_compute_driver_params fixes
- turnip: compute gmem offsets at renderpass creation time
- turnip: implement secondary command buffers
- nir: fix assign_io_var_locations for vertex inputs
- turnip: minor warning fixes
- util/format: add missing vulkan formats
- turnip: disable B8G8R8 vertex formats
- etnaviv: fix incorrectly failing vertex size assert
- etnaviv: update headers from rnndb
- etnaviv: HALTI2+ instanced draw
- etnaviv: implement gl_VertexID/gl_InstanceID
- etnaviv: remove unnecessary vertex_elements_state_create error
checking
- st/mesa: don't lower YUV when driver supports it natively
- st/mesa: run st_nir_lower_tex_src_plane for lowered xyuv/ayuv
- freedreno/ir3: allow inputs with the same location
- turnip: remove tu_sort_variables_by_location
- turnip: fix array/matrix varyings
- turnip: hook up GetImageDrmFormatModifierPropertiesEXT
- turnip: set linear tiling for scanout images
- vulkan/wsi: remove unused image_get_modifier
- turnip: simplify tu_physical_device_get_format_properties
- etnaviv: implement UBOs
- turnip: hook up cmdbuffer event set/wait
Jordan Justen (7):
- iris: Add IRIS_DIRTY_RENDER_BUFFER state flag
- iris/gen11+: Move flush for render target change
- iris: Allow max dynamic pool size of 2GB for gen12
- intel: Remove unused Tigerlake PCI ID
- iris: Fix some indentation in iris_init_render_context
- iris: Emit CS Stall before Instruction Cache flush for gen12 WA
- anv: Emit CS Stall before Instruction Cache flush for gen12 WA
Jose Maria Casanova Crespo (1):
- v3d: Fix predication with atomic image operations
Juan A. Suarez Romero (3):
- nir/lower_double_ops: relax lower mod()
- Revert "nir/lower_double_ops: relax lower mod()"
- nir/spirv: skip unreachable blocks in Phi second pass
Kai Wasserbäch (4):
- nir: fix unused variable warning in nir_lower_vars_to_explicit_types
- nir: fix unused variable warning in
find_and_update_previous_uniform_storage
- nir: fix unused function warning in src/compiler/nir/nir.c
- intel/gen_decoder: Fix unused-but-set-variable warning
Karol Herbst (14):
- nv50/ir: fix crash in isUniform for undefined values
- nir/validate: validate num_components on registers and intrinsics
- nir/serialize: fix vec8 and vec16
- nir/tests: add serializer tests
- nir/tests: MSVC build fix
- spirv: handle UniformConstant for OpenCL kernels
- clover/nir: treat UniformConstant as global memory
- clover/nir: set spirv environment to OpenCL
- clover/spirv: allow Int64 Atomics for supported devices
- nir: handle nir_deref_type_ptr_as_array in
rematerialize_deref_in_block
- nv50/ir: implement global atomics and handle it for nir
- nir/serialize: cast swizzle before shifting
- aco: use NIR_MAX_VEC_COMPONENTS instead of 4
- nv50ir/nir: support vec8 and vec16
Kenneth Graunke (57):
- iris: Fix "Force Zero RTA Index Enable" setting again
- nir: Handle image arrays when setting variable data
- Revert "intel/blorp: Fix usage of uninitialized memory in key
hashing"
- iris: Properly move edgeflag_out from output list to global list
- iris: Wrap iris_fix_edge_flags in NIR_PASS
- mesa: Handle GL_COLOR_INDEX in \_mesa_format_from_format_and_type().
- iris: Change keybox parenting
- iris: Stop mutating the resource in get_rt_read_isl_surf().
- iris: Drop 'old_address' parameter from iris_rebind_buffer
- iris: Create an "iris_surface_state" wrapper struct
- iris: Maintain CPU-side SURFACE_STATE copies for views and surfaces.
- iris: Update SURFACE_STATE addresses when setting sampler views
- iris: Disable VF cache partial address workaround on Gen11+
- driconf, glsl: Add a vs_position_always_invariant option
- drirc: Set vs_position_always_invariant for Shadow of Mordor on Intel
- st/mesa: Add GL_TDFX_texture_compression_FXT1 support
- iris: Map FXT1 texture formats
- meson: Add a "prefer_iris" build option
- main: Change u_mmAllocMem align2 from bytes (old API) to bits (new
API)
- meson: Include iris in default gallium-drivers for x86/x86_64
- util: Detect use-after-destroy in simple_mtx
- intel/genxml: Add a partial TCCNTLREG definition
- iris: Enable Gen11 Color/Z write merging optimization
- anv: Enable Gen11 Color/Z write merging optimization
- intel/decoder: Make get_state_size take a full 64-bit address and a
base
- iris: Create smaller program keys without legacy features
- iris: Default to X-tiling for scanout buffers without modifiers
- iris: Alphabetize source files after iris_perf.c was added
- drirc: Final Fantasy VIII: Remastered needs
allow_higher_compat_version
- iris: Make helper functions to turn iris shader keys into brw keys.
- iris: Fix shader recompile debug printing
- iris: Avoid replacing backing storage for buffers with no contents
- intel: Drop Gen11 WaBTPPrefetchDisable workaround
- st/nir: Optionally unify inputs_read/outputs_written when linking.
- iris: Set nir_shader_compiler_options::unify_interfaces.
- st/mesa: Allow ASTC5x5 fallbacks separately from other ASTC LDR
formats.
- iris: Disable ASTC 5x5 support on Gen9 for now.
- iris: Delete remnants of the unimplemented ASTC 5x5 workaround
- iris: Allow HiZ for copy_region sources
- anv: Only enable EWA LOD algorithm when doing anisotropic filtering.
- Revert "nir: assert that nir_lower_tex runs after lowering derefs"
- i965: Simplify brw_get_renderer_string()
- iris: Simplify iris_get_renderer_string()
- intel: Use similar brand strings to the Windows drivers
- intel/compiler: Fix illegal mutation in get_nir_image_intrinsic_image
- iris: Fix export of fences that have already completed.
- st/mesa: Allocate full miplevels if MaxLevel is explicitly set
- iris: Drop some workarounds which are no longer necessary
- anv: Drop some workarounds that are no longer necessary
- intel: Fix aux map alignments on 32-bit builds.
- meson: Prefer 'iris' by default over 'i965'.
- loader: Check if the kernel driver is i915 before loading iris
- iris: Drop 'engine' from iris_batch.
- iris: Make iris_emit_default_l3_config pull devinfo from the batch
- iris: Support multiple chained batches.
- i965: Use brw_batch_references in tex_busy check
- loader: Fix leak of kernel driver name
Kristian Høgsberg (62):
- freedreno/registers: Fix typo
- freedreno/registers: Move SP_PRIMITIVE_CNTL and SP_VS_VPC_DST
- freedreno/registers: Add comments about primitive counters
- freedreno/a6xx: Fix primitive counters again
- freedreno/a6xx: Clear sysmem with CP_BLIT
- freedreno: Add nogmem debug option to force bypass rendering
- freedreno/a6xx: Fix layered texture type enum
- freedreno/a6x: Rename z/s formats
- freedreno/a6xx: Add register offset for STG/LDG
- freedreno/ir3: Emit link map as byte or dwords offsets as needed
- freedreno/ir3: Add load and store intrinsics for global io
- freedreno: Don't count primitives for patches
- freedreno/ir3: Add ir3 intrinsics for tessellation
- freedreno/ir3: Use imul24 in offset calculations
- freedreno/ir3: Add tessellation field to shader key
- freedreno/ir3: Extend geometry lowering pass to handle tessellation
- freedreno/ir3: Add new synchronization opcodes
- freedreno/ir3: End TES with chsh when using GS
- freedreno/ir3: Implement tess coord intrinsic
- freedreno/ir3: Implement TCS synchronization intrinsics
- freedreno/ir3: Setup inputs and outputs for tessellation stages
- freedreno/ir3: Don't assume binning shader is always VS
- freedreno/ir3: Pre-color TCS header and primitive ID inputs
- freedreno/ir3: Allocate const space for tessellation parameters
- freedreno/a6xx: Build the right draw command for tessellation
- freedreno/a6xx: Allocate and program tessellation buffer
- freedreno/a6xx: Emit constant parameters for tessellation stages
- freedreno/a6xx: Program state for tessellation stages
- freedreno: Use bypass rendering for tessellation
- freedreno/a6xx: Only set emit.hs/ds when we're drawing patches
- freedreno/blitter: Save tessellation state
- freedreno/a6xx: Only use merged regs and four quads for VS+FS
- freedreno/a6xx: Turn on tessellation shaders
- freedreno/ir3: Use regid() helper when setting up precolor regs
- freedreno/registers: Remove duplicate register definitions
- freedreno: New struct packing macros
- freedreno/registers: Add 64 bit address registers
- freedreno/a6xx: Drop stale include
- freedreno/a6xx: Include fd6_pack.h in a few files
- freedreno/a6xx: Convert emit_mrt() to OUT_REG()
- freedreno/a6xx: Convert emit_zs() to OUT_REG()
- freedreno/a6xx: Convert VSC pipe setup to OUT_REG()
- freedreno/a6xx: Convert gmem blits to OUT_REG()
- freedreno/a6xx: Convert some tile setup to OUT_REG()
- freedreno/a6xx: Silence warning for unused perf counters
- freedreno/a6xx: Document the CP_SET_DRAW_STATE enable bits
- freedreno/a6xx: Make DEBUG_BLIT_FALLBACK only dump fallbacks
- freedreno: Add debug flag for forcing linear layouts
- freedreno/a6xx: Program sampler swap based on resource tiling
- freedreno/a6xx: Pick blitter swap based on resource tiling
- freedreno/a6xx: Add fd_resource_swap() helper
- freedreno/a6xx: Use blitter for resolve blits
- freedreno/a6xx: RB6_R8G8B8 is actually 32 bit RGBX
- freedreno/a6xx: Use A6XX_SP_2D_SRC_FORMAT_MASK macro
- freedreno/a6xx: Handle srgb blits on the blitter
- freedreno/a6xx: Move handle_rgba_blit() up
- freedreno/a6xx: Rewrite compressed blits in a helper function
- freedreno/a6xx: Set up multisample sysmem MRTs correctly
- st/mesa: Lower vars to ssa and constant prop before
gl_nir_lower_buffers
- ir3: Set up full/half register conflicts correctly
- iris: Advertise PIPE_CAP_NATIVE_FENCE_FD
- iris: Print warning and return \*out = NULL when fd to syncobj fails
Krzysztof Raszkowski (10):
- gallium/swr: Fix GS invocation issues - Fixed proper setting
gl_InvocationID. - Fixed GS vertices output memory overflow.
- gallium/swr: Enable some ARB_gpu_shader5 extensions Enable / add to
features.txt: - Enhanced textureGather. - Geometry shader instancing.
- Geometry shader multiple streams.
- gallium/swr: Fix crash when use GL_TDFX_texture_compression_FXT1
format.
- gallivm: add TGSI bit arithmetic opcodes support
- gallium/swr: Fix glVertexPointer race condition.
- gallium/swr: Disable showing detected arch message.
- docs/GL4: update gallium/swr features
- gallium/swr: add option for static link
- gallium/swr: Fix gcc 4.8.5 compile error
- gallium/swr: simplify environmental variabled expansion code
Lasse Lopperi (1):
- freedreno/drm: Fix memory leak in softpin implementation
Laurent Carlier (1):
- egl: avoid local modifications for eglext.h Khronos standard header
file
Leo Liu (1):
- ac: add missing Arcturus to the info of pc lines
Lepton Wu (2):
- gallium: dri2: Use index as plane number.
- android: mesa: Revert "android: mesa: revert "Enable asm
unconditionally""
Lionel Landwerlin (60):
- intel/dev: set default num_eu_per_subslice on gen12
- intel/perf: add TGL support
- intel/perf: fix Android build
- mesa: check draw buffer completeness on
glClearBufferfi/glClearBufferiv
- vulkan: bump headers/registry to 1.1.127
- anv: Properly handle host query reset of performance queries
- anv: implement VK_KHR_separate_depth_stencil_layouts
- mesa: check framebuffer completeness only after state update
- anv: invalidate file descriptor of semaphore sync fd at vkQueueSubmit
- anv: remove list items on batch fini
- anv: detach batch emission allocation from device
- anv: expose timeout helpers outside of anv_queue.c
- anv: move queue init/finish to anv_queue.c
- anv: allow NULL batch parameter to anv_queue_submit_simple_batch
- anv: prepare driver to report submission error through queues
- anv: refcount semaphores
- anv: prepare the driver for delayed submissions
- anv/wsi: signal the semaphore in the acquireNextImage
- anv: implement VK_KHR_timeline_semaphore
- intel/dev: flag the Elkhart Lake platform
- intel/perf: add EHL performance query support
- intel/perf: fix invalid hw_id in query results
- intel/perf: set read buffer len to 0 to identify empty buffer
- intel/perf: take into account that reports read can be fairly old
- intel/perf: simplify the processing of OA reports
- intel/perf: fix improper pointer access
- anv: fix missing gen12 handling
- anv: fix incorrect VMA alignment for CCS main surfaces
- anv: fix fence underlying primitive checks
- anv: fix assumptions about temporary fence payload
- intel/perf: drop batchbuffer flushing at query begin
- i965/iris: perf-queries: don't invalidate/flush 3d pipeline
- anv: constify pipeline layout in nir passes
- anv: drop unused parameter from apply layout pass
- vulkan/wsi: error out when image fence doesn't signal
- mesa: avoid triggering assert in implementation
- i965/iris/perf: factor out frequency register capture
- loader: fix close on uninitialized file descriptor value
- anv: don't close invalid syncfd semaphore
- anv: fix intel perf queries availability writes
- anv: set stencil layout for input attachments
- iris: Implement Gen12 workaround for non pipelined state
- anv: Implement Gen12 workaround for non pipelined state
- anv: only use VkSamplerCreateInfo::compareOp if enabled
- anv: fix pipeline switch back for non pipelined states
- genxml: add new Gen11+ PIPE_CONTROL field
- iris: handle new PIPE_CONTROL field
- iris: implement another workaround for non pipelined states
- anv: implement another workaround for non pipelined states
- intel/perf: expose timestamp begin for mdapi
- intel/perf: report query split for mdapi
- anv: enable VK_KHR_swapchain_mutable_format
- anv: don't report error with other vendor DRM devices
- anv: ensure prog params are initialized with 0s
- anv/iris: warn gen12 3DSTATE_HS restriction
- intel: Implement Gen12 workaround for array textures of size 1
- isl: drop CCS row pitch requirement for linear surfaces
- isl: add gen12 comment about CCS for linear tiling
- anv: implement gen9 post sync pipe control workaround
- anv: set MOCS on push constants
Luis Mendes (1):
- radv: fix radv secure compile feature breaks compilation on armhf
EABI and aarch64
Marco Felsch (1):
- etnaviv: Fix assert when try to accumulate an invalid fd
Marek Olšák (245):
- glsl: encode/decode types using a union with bitfields for
readability
- glsl: encode vector_elements and matrix_columns better
- glsl: encode explicit_stride for basic types better
- glsl: encode array types better
- glsl: encode struct/interface types better
- st/mesa: call nir_opt_access only once
- st/mesa: call nir_lower_flrp only once per shader
- compiler: make variable::data::binding unsigned
- nir: pack nir_variable::\data::stream
- nir: pack nir_variable::\data::xfb\_\*
- radeonsi: use IR SHA1 as the cache key for the in-memory shader cache
- radeonsi: don't keep compute shader IR after compilation
- radeonsi: keep serialized NIR instead of nir_shader in
si_shader_selector
- nir: pack the rest of nir_variable::data
- nir/serialize: don't expand 16-bit variable state slots to 32 bits
- nir/serialize: store 32-bit object IDs instead of 64-bit
- nir/serialize: pack nir_variable flags
- mesa: expose SPIR-V extensions in the Compatibility profile too
- util: add blob_finish_get_buffer
- radeonsi/nir: call nir_serialize only once per shader
- radeonsi/nir: fix compute shader crash due to nir_binary == NULL
- glsl/linker: pass shader_info to analyze_clip_cull_usage directly
- compiler: pack shader_info from 160 bytes to 96 bytes
- st/mesa: fix Sanctuary and Tropics by disabling ARB_gpu_shader5 for
them
- st/mesa: rename DEBUG_TGSI -> DEBUG_PRINT_IR
- st/mesa: remove \\n being only printed in debug builds after printed
TGSI
- st/mesa: print TCS/TES/GS/CS TGSI in the right place & keep disk
cache enabled
- st/mesa: add ST_DEBUG=nir to print NIR shaders
- st/mesa: remove unused TGSI-only debug printing functions
- gallium/noop: call finalize_nir
- radeonsi/nir: remove dead function temps
- radeonsi/nir: call nir_lower_flrp only once per shader
- radeonsi/nir: don't lower fma, instead, fuse fma
- mesa: enable glthread for 7 Days To Die
- st/mesa: rename delete_basic_variant -> delete_common_variant
- st/mesa: decrease the size of st_fp_variant_key from 48 to 40 bytes
- st/mesa: start deduplicating some program code
- st/mesa: initialize affected_states and uniform storage earlier in
deserialize
- st/mesa: consolidate and simplify code flagging
program::affected_states
- st/mesa: trivially merge st_vertex_program into st_common_program
- st/mesa: rename st_common_program to st_program
- st/mesa: cleanups after unification of st_vertex/common program
- st/mesa: rename occurences of stcp to stp to correspond to st_program
- st/mesa: more cleanups after unification of st_vertex/common_program
- st/mesa: subclass st_vertex_program for VP-specific members
- st/mesa: call nir_sweep in st_finalize_nir
- st/mesa: keep serialized NIR instead of nir_shader in st_program
- st/mesa: call nir_serialize only once per shader
- nir: move data.image.access to data.access
- nir/print: only print image.format for image variables
- glsl_to_nir: rename image_access to mem_access
- nir: move data.descriptor_set above data.index for better packing
- nir: don't use GLenum16 in nir.h
- ac: add radeon_info::num_rings and move ring_type to amd_family.h
- ac: fill num_rings for remaining IPs
- winsys/amdgpu: detect noop dependencies on the same ring correctly
- nir: strip as we serialize to remove the nir_shader_clone call
- nir/serialize: do ctx = {0} instead of manual initializations
- util/blob: add 8-bit and 16-bit reads and writes
- nir/serialize: pack instructions better
- nir/serialize: pack src better and limit the object count to 1M from
1G
- nir/serialize: don't serialize var->data for temporaries
- nir/serialize: deduplicate serialized var types by reusing the last
unique one
- nir/serialize: try to store a diff in var data locations instead of
var data
- nir/serialize: pack load_const with non-64-bit constants better
- nir/serialize: pack 1-component constants into 20 bits if possible
- nir/serialize: pack nir_intrinsic_instr::const_index[] better
- nir/serialize: try to pack two alu srcs into 1 uint32
- nir/serialize: don't store deref types if not needed
- nir/serialize: don't serialize mode for deref non-cast instructions
- nir/serialize: try to put deref->var index into the unused bits of
the header
- nir/serialize: cleanup - fold nir_deref_type_var cases into switches
- nir/serialize: try to pack both deref array src into 32 bits
- nir/serialize: remove up to 3 consecutive equal ALU instruction
headers
- nir/serialize: reuse the writemask field for 2 src X swizzles of SSA
ALU
- nir/serialize: serialize swizzles for vec8 and vec16
- nir/serialize: serialize writemask for vec8 and vec16
- nir/serialize: don't serialize redundant
nir_intrinsic_instr::num_components
- nir/serialize: use 3 unused bits in intrinsic for
packed_const_indices
- nir/serialize: support any num_components for remaining instructions
- ac: set swizzled bit in cache policy as a hint not to merge
loads/stores
- radeonsi: initialize the per-context compiler on demand
- radeonsi/nir: don't run si_nir_opts again if there is no change
- st/mesa: don't serialize all streamout state if there are no SO
outputs
- st/mesa: don't use redundant stp->state.ir.nir
- st/mesa: don't call ProgramStringNotify in glsl_to_nir
- st/mesa: propagate gl_PatchVerticesIn from TCS to TES before linking
for NIR
- st/mesa: simplify looping over linked shaders when linking NIR
- st/mesa: don't use \*\* in the st_nir_link_shaders signature
- st/mesa: add st_variant base class to simplify code for shader
variants
- ac/nir: don't rely on data.patch for tess factors
- radeonsi/nir: implement subgroup system values for SPIR-V
- radeonsi: simplify the interface of
get_dw_address_from_generic_indices
- radeonsi: simplify get_tcs_tes_buffer_address_from_generic_indices
- radeonsi/nir: validate is_patch because SPIR-V doesn't set it for
tess factors
- radeonsi/nir: don't rely on data.patch for tess factors
- radeonsi/nir: fix location_frac handling for TCS outputs
- radeonsi/nir: support interface output types to fix SPIR-V xfb
piglits
- radeonsi: enable SPIR-V and GL 4.6 for NIR
- util/driconfig: print ATTENTION if MESA_DEBUG=silent is not set
- radeonsi/gfx10: simplify some duplicated NGG GS code
- radeonsi/gfx10: fix the vertex order for triangle strips emitted by a
GS
- llvmpipe: implement TEX_LZ and TXF_LZ opcodes
- gallivm: implement LOAD with CONSTBUF but don't enable it for
llvmpipe
- st/mesa: support UBOs for Selection/Feedback/RasterPos
- st/mesa: save currently bound vertex samplers and sampler views in
st_context
- st/mesa: support samplers for Selection/Feedback/RasterPos
- st/mesa: support SSBOs for Selection/Feedback/RasterPos
- st/mesa: support shader images for Selection/Feedback/RasterPos
- st/mesa: use a separate VS variant for the draw module
- st/mesa: remove st_vp_variant::num_inputs
- st/mesa: remove struct st_vp_variant in favor of st_common_variant
- st/mesa: don't generate VS TGSI if NIR is enabled
- draw, st/mesa: generate TGSI for ffvp/ARB_vp if draw lacks LLVM
- st/mesa: release the draw shader properly to fix driver crashes
(iris)
- st/dri: assume external consumers of back buffers can write to the
buffers
- radeonsi: enable NIR by default and document GL 4.6 support
- radeonsi/gfx10: disable vertex grouping
- radeonsi/gfx10: simplify the tess_turns_off_ngg condition
- radeonsi: don't rely on CLEAR_STATE to set PA_SC_GENERIC_SCISSOR\_\*
- ac: fix ac_get_i1_sgpr_mask for Wave32
- ac: fix the return value in cull_bbox when bbox culling is disabled
- radeonsi: deduplicate ES and GS thread enablement code
- radeonsi: disallow compute-based culling if polygon mode is enabled
- radeonsi: set is_monolithic for VS prologs when the shader is really
monolithic
- radeonsi: don't wrap the VS prolog in if (ES thread) .. endif
- radeonsi/gfx10: don't insert NGG streamout atomics if they are never
used
- radeonsi: allow generating VS prologs with 0 inputs
- radeonsi: fix determining whether the VS prolog is needed
- radeonsi: reset more fields in si_llvm_context_set_ir to fix reusing
ctx
- radeonsi/gfx10: fix ngg_get_ordered_id
- amd/addrlib: update to the latest version
- ac/surface: fix an assertion failure on gfx9 in CMASK computation
- radeonsi/gfx10: don't declare any LDS for NGG if it's not used
- radeonsi/gfx10: enable NGG passthrough for eligible shaders
- radeonsi/gfx10: improve performance for TES using PrimID but not
exporting it
- Revert "u_vbuf: Regard non-constant vbufs with non-instance elements
as free"
- winsys/radeon: initialize pte_fragment_size
- radeonsi: preserve the scanout flag for shared resources on gfx9 and
gfx10
- radeonsi: ignore PIPE_BIND_SCANOUT for imported textures
- radeonsi: remove the "display_dcc_offset == 0" assertion
- radeonsi: rename SDMA debug flags
- radeonsi: remove broken and unused SI SDMA image copy code
- radeonsi: add AMD_DEBUG=nodmaclear for debugging
- radeonsi: add AMD_DEBUG=nodmacopyimage for debugging
- radeonsi: rename dma_cs -> sdma_cs
- radeonsi: move SI and CIK+ SDMA code into 1 common function for
cleanups
- radeonsi: disable SDMA on gfx8 to fix corruption on RX 580
- radeonsi: remove TGSI
- gallium: put u_vbuf_get_caps return values into u_vbuf_caps
- gallium/cso_context: move non-vbuf vertex buffer and element code
into helpers
- gallium: bypass u_vbuf if it's not needed (no fallbacks and no user
VBOs)
- ac/gpu_info: always use distributed tessellation on gfx10
- radeonsi: fix monolithic pixel shaders with two-sided colors and
SampleMaskIn
- radeonsi: fix context roll tracking in si_emit_shader_vs
- radeonsi: test polygon mode enablement accurately
- radeonsi: determine accurately if line stippling is enabled for
performance
- radeonsi: clean up messy si_emit_rasterizer_prim_state
- ac: unify build_sendmsg_gs_alloc_req
- ac: unify primitive export code
- ac/gpu_info: add pc_lines and use it in radeonsi
- ac: add 128-bit bitcount
- ac: add ac_build_s_endpgm
- radeonsi/gfx9: force the micro tile mode for MSAA resolve correctly
on gfx9
- radeonsi: rename desc_list_byte_size -> vb_desc_list_alloc_size
- radeonsi: add si_context::num_vertex_elements
- radeonsi: don't allow draw calls with uninitialized VS inputs
- radeonsi: simplify si_set_vertex_buffers
- ac,radeonsi: increase the maximum number of shader args and return
values
- radeonsi: put up to 5 VBO descriptors into user SGPRs
- radeonsi: don't enable VBOs in user SGPRs if compute-based culling
can be used
- radeonsi: fix assertion and other failures in
si_emit_graphics_shader_pointers
- radeonsi: actually enable VBOs in user SGPRs
- radeonsi: don't adjust depth and stencil PS output locations
- radeonsi: rename DBG_NO_TGSI -> DBG_NO_NIR
- radeonsi: remove TGSI from comments
- radeonsi: rename si_shader_info -> si_shader_binary_info
- radeonsi: fork tgsi_shader_info and tgsi_tessctrl_info
- radeonsi: merge si_tessctrl_info into si_shader_info
- radeonsi: clean up si_shader_info
- radeonsi: rename si_compile_tgsi_main -> si_build_main_function
- radeonsi: rename si_shader_create -> si_create_shader_variant for
clarity
- radeonsi: fold si_create_function into si_llvm_create_func
- radeonsi: remove always constant ballot_mask_bits from
si_llvm_context_init
- radeonsi: move PS LLVM code into si_shader_llvm_ps.c
- radeonsi: separate code computing info for small primitive culling
- ac/cull: don't read Position.Z if it's not needed for culling
- radeonsi: make si_insert_input\_\* functions non-static
- radeonsi: move VS_STATE.LS_OUT_PATCH_SIZE a few bits higher to make
space there
- radeonsi/gfx10: separate code for getting edgeflags from the
gs_invocation_id VGPR
- radeonsi/gfx10: separate code for determining the number of vertices
for NGG
- radeonsi: fix si_build_wrapper_function for compute-based primitive
culling
- radeonsi: work around an LLVM crash when using
llvm.amdgcn.icmp.i64.i1
- radeonsi: move si_insert_input\_\* functions
- radeonsi: move tessellation shader code into si_shader_llvm_tess.c
- radeonsi: remove llvm_type_is_64bit
- radeonsi: move geometry shader code into si_shader_llvm_gs.c
- radeonsi: move code for shader resources into
si_shader_llvm_resources.c
- radeonsi: remove useless #includes
- radeonsi: merge si_compile_llvm and si_llvm_compile functions
- gallium: add st_context_iface::flush_resource to call FLUSH_VERTICES
- st/dri: do FLUSH_VERTICES before calling flush_resource
- Revert "radeonsi: unbind image before compute clear"
- radeonsi: clean up how internal compute dispatches are handled
- radeonsi: don't invoke decompression inside internal launch_grid
- radeonsi: fix doubles and int64
- radeonsi: turn an assertion into return in si_nir_store_output_tcs
- ac: add prefix bitcount functions
- ac: add ac_build_readlane without optimization barrier
- radeonsi/gfx10: update comments and remove invalid TODOs
- radeonsi/gfx10: correct VS PrimitiveID implementation for NGG
- radeonsi/gfx10: move s_sendmsg gs_alloc_req to the beginning of
shaders
- radeonsi/gfx10: export primitives at the beginning of VS/TES
- radeonsi/gfx10: merge main and pos/param export IF blocks into one if
possible
- radeonsi/gfx10: don't initialize VGPRs not used by NGG passthrough
- radeonsi/gfx10: move GE_PC_ALLOC setting to shader states
- radeonsi/gfx10: implement NGG culling for 4x wave32 subgroups
- ac: add helper ac_build_triangle_strip_indices_to_triangle
- radeonsi/gfx10: rewrite late alloc computation
- radeonsi/gfx10: enable GS fast launch for triangles and strips with
NGG culling
- radeonsi: use ctx->ac. for types and integer constants
- radeonsi: move non-LLVM code out of si_shader_llvm.c
- radeonsi: move VS shader code into si_shader_llvm_vs.c
- radeonsi: move si_shader_llvm_build.c content into si_shader_llvm.c
- radeonsi: minor cleanup in si_shader_internal.h
- radeonsi: move si_nir_build_llvm into si_shader_llvm.c
- radeonsi: fold si_shader_context_set_ir into si_build_main_function
- radeonsi: move more LLVM functions into si_shader_llvm.c
- radeonsi: make si_compile_llvm return bool
- radeonsi: make si_compile_shader return bool
- radeonsi: change prototypes of si_is_multi_part_shader &
si_is_merged_shader
- radeonsi: separate LLVM compilation from non-LLVM code
- util/simple_mtx: add a missing include to get ASSERTED
- gallium/util: add a cache of live shaders for shader CSO
deduplication
- radeonsi: use the live shader cache
- radeonsi: restructure si_shader_cache_load_shader
- radeonsi: print shader cache stats with AMD_DEBUG=cache_stats
- radeonsi: expose shader cache stats to the HUD
- radeonsi: make screen available to shader part compilation
- radeonsi: fix a regression since the addition of si_shader_llvm_vs.c
- Revert "winsys/amdgpu: Close KMS handles for other DRM file
descriptions"
- Revert "winsys/amdgpu: Re-use amdgpu_screen_winsys when possible"
- radeonsi: don't report that multi-plane formats are supported
- radeonsi: fix the DCC MSAA bug workaround
- radeonsi: don't wait for shader compilation to finish when destroying
a context
Marek Vasut (5):
- etnaviv: Replace bitwise OR with logical OR
- etnaviv: tgsi: Fix gl_FrontFacing support
- etnaviv: Report correct number of vertex buffers
- etnaviv: Do not filter out PIPE_FORMAT_S8_UINT_Z24_UNORM on
pre-HALTI2
- etnaviv: Destroy rsc->pending_ctx set in etna_resource_destroy()
Mark Janes (3):
- Revert "st/mesa: call nir_serialize only once per shader"
- Revert "st/mesa: keep serialized NIR instead of nir_shader in
st_program"
- iris: separating out common perf code
Markus Wick (3):
- mapi/glapi: Generate sizeof() helpers instead of fixed sizes.
- mesa/glthread: Implement ARB_multi_bind.
- drirc: Enable glthread for dolphin/citra/yuzu.
Martin Fuzzey (1):
- etnaviv: update Android build files
Mathias Fröhlich (1):
- egl: Implement getImage/putImage on pbuffer swrast.
Matt Turner (19):
- intel/compiler: Use ARRAY_SIZE()
- intel/compiler: Extract GEN\_\* macros into separate file
- intel/compiler: Split has_64bit_types into float/int
- intel/compiler: Don't disassemble align1 3-src operands on Gen < 10
- intel/compiler: Limit compaction unit tests to specific gens
- intel/compiler: Add NF some more places
- intel/compiler: Add a INVALID_{,HW_}REG_TYPE macros
- intel/compiler: Split hw_type tables
- intel/compiler: Handle invalid inputs to brw_reg_type_to_*()
- intel/compiler: Handle invalid compacted immediates
- intel/compiler: Factor out brw_validate_instruction()
- intel/compiler: Validate some instruction word encodings
- intel/compiler: Add unit tests for new EU validation checks
- intel/compiler: Validate fuzzed instructions
- intel/compiler: Test compaction on Gen <= 12
- gitlab-ci: Skip ext_timer_query/time-elapsed
- intel/compiler: Move Gen4/5 rounding to visitor
- util: Explain BITSET_FOREACH_SET params
- util: Remove tmp argument from BITSET_FOREACH_SET macro
Mauro Rossi (9):
- android: aco: fix Lower to CSSA
- android: radeonsi: fix build error due to wrong u_format.csv file
path
- android: util/format: fix include path list
- android: radeonsi: fix build after vl refactoring (v2)
- android: nir: add a load/store vectorization pass
- android: util: Add a mapping from VkFormat to PIPE_FORMAT.
- android: radv: fix vk_format_table.c generated source build
- android: radeonsi,ac: fix building error due to ac changes
- android: radv: build radv_shader_args.c
Michel Dänzer (36):
- gitlab-ci: Set arm job CCACHE_DIR properly
- gitlab-ci: Use separate arm64 build/test docker images
- gitlab-ci: Don't build libdrm for ARM
- gitlab-ci: Use ninja -j4 for building dEQP
- gitlab-ci: Move artifact preparation to separate script
- gitlab-ci: Share dEQP build process between x86 & ARM test image
scripts
- gitlab-ci: Sort packages in debian-install.sh
- gitlab-ci: Run piglit tests with llvmpipe
- gitlab-ci: Use separate docker images for x86 build/test jobs
- gitlab-ci: Delete install/bin from artifacts as well
- gitlab-ci: Document that ci-templates refs must be in sync
- gitlab-ci: Use functional container job names
- gitlab-ci: Rename container install scripts to match job names
(better)
- gitlab-ci: Organize images using new REPO_SUFFIX templates feature
- gitlab-ci: Directly use host-mapped directory for ccache
- gitlab-ci: Stop reporting piglit test results via JUnit
- gitlab-ci: Stop storing piglit test results as JUnit
- gitlab-ci: Put HTML summary in artifacts for failed piglit jobs
- gitlab-ci: Update to current ci-templates master
- gitlab-ci: Run piglit glslparser & quick_shader tests separately
- glsl/tests: Use splitlines() instead of strip()
- gitlab-ci: Use the common run policy for LAVA jobs as well again
- gitlab-ci: Overhaul job run policy
- gitlab-ci: Don't exclude any piglit quick_shader tests
- gitlab-ci: Test against LLVM / clang 9 on x86
- gitlab-ci: Stop using manual jobs for merge requests
- gitlab-ci: Set GIT_STRATEGY to none for the dummy job
- gitlab-ci: Use single if for manual job rules entry
- winsys/amdgpu: Keep a list of amdgpu_screen_winsyses in amdgpu_winsys
- winsys/amdgpu: Keep track of retrieved KMS handles using hash tables
- winsys/amdgpu: Only re-export KMS handles for different DRM FDs
- util: Add os_same_file_description helper
- winsys/amdgpu: Re-use amdgpu_screen_winsys when possible
- winsys/amdgpu: Close KMS handles for other DRM file descriptions
- winsys/amdgpu: Re-use amdgpu_screen_winsys when possible
- winsys/amdgpu: Close KMS handles for other DRM file descriptions
Michel Zou (3):
- Meson: Check for dladdr with MinGW
- disk_cache_get_function_timestamp: check for dladdr
- Meson: Add llvm>=9 modules
Miguel Casas-Sanchez (1):
- i965: Ensure that all 2101010 image imports can pass framebuffer
completeness.
Nanley Chery (3):
- gallium/dri2: Fix creation of multi-planar modifier images
- gallium: Store the image format in winsys_handle
- iris: Fix import of multi-planar surfaces with modifiers
Nataraj Deshpande (1):
- egl/android: Restrict minimum triple buffering for android
color_buffers
Nathan Kidd (1):
- llvmpipe: Check thread creation errors
Neha Bhende (3):
- st/mesa: release tgsi tokens for shader states
- svga: fix size of format_conversion_table[]
- svga: Use pipe_shader_state_from_tgsi to set shader state
Neil Armstrong (3):
- Add support for T820 CI Jobs
- ci: Remove T820 from CI temporarily
- gitlab-ci/lava: add pipeline information in the lava job name
Neil Roberts (9):
- nir/opcodes: Add a helper function to generate the comparison binops
- nir/opcodes: Add a helper function to generate reduce opcodes
- nir: Add a 16-bit bool type
- nir: Add a 8-bit bool type
- nir/lower_alu_to_scalar: Support lowering 8- and 16-bit reduce ops
- freedreno/ir3: Support 16-bit comparison instructions
- freedreno/ir3: Add implementation of nir_op_b16csel
- freedreno/ir3: Implement f2b16 and i2b16
- freedreno/ir3: Enabling lowering 16-bit flrp
Paul Cercueil (5):
- kmsro: Extend to include ingenic-drm
- u_vbuf: Mark vbufs incompatible if more were requested than HW
supports
- u_vbuf: Only create driver CSO if no incompatible elements
- u_vbuf: Regard non-constant vbufs with non-instance elements as free
- u_vbuf: Return true in u_vbuf_get_caps if nb of vbufs is below
minimum
Paul Gofman (1):
- state_tracker: Handle texture view min level in st_generate_mipmap()
Paulo Zanoni (2):
- intel/compiler: remove the operand restriction for src1 on GLK
- intel/compiler: fix nir_op_{i,u}*32 on ICL
Peng Huang (1):
- radeonsi: make si_fence_server_signal flush pipe without work
Philipp Sieweck (1):
- svga: check return value of define_query_vgpu{9,10}
Pierre Moreau (4):
- compiler/spirv: Fix uses of gnu struct = {} extension
- include/CL: Update OpenCL headers to latest
- clover: Use the dispatch table type from the OpenCL headers
- clover/meson: Define OpenCL header macros
Pierre-Eric Pelloux-Prayer (54):
- radeonsi: tell the shader disk cache what IR is used
- mesa: enable msaa in clear_with_quad if needed
- mesa: pass vao as a function paramter
- mesa: add EXT_dsa glVertexArray\* functions declarations
- mesa: rework \_mesa_lookup_vao_err to allow usage from EXT_dsa
- mesa: add vao/vbo lookup helper for EXT_dsa
- mesa: add EXT_dsa glVertexArray\* functions implementation
- mesa: add gl_vertex_array_object parameter to client state helpers
- mesa: add EXT_dsa glEnableVertexArrayEXT / glDisableVertexArrayEXT
- mesa: add EXT_dsa EnableVertexArrayAttribEXT /
DisableVertexArrayAttribEXT
- mesa: extract helper function from \_mesa_GetPointerv
- mesa: add EXT_dsa glGetVertexArray\* 4 functions
- mesa: fix call to \_mesa_lookup_vao_err
- radeonsi: fix shader disk cache key
- radeonsi: enable mesa_glthread for GfxBench
- mesa: update features.txt to reflect EXT_dsa status
- mesa: add ARB_framebuffer_no_attachments named functions
- mesa: add ARB_vertex_attrib_64bit VertexArrayVertexAttribLOffsetEXT
- mesa: add ARB_clear_buffer_object named functions
- mesa: add ARB_gpu_shader_fp64 selector-less functions
- mesa: add ARB_instanced_arrays EXT_dsa function
- mesa: add ARB_texture_buffer_range glTextureBufferRangeEXT function
- mesa: implement ARB_texture_storage_multisample + EXT_dsa functions
- mesa: extend vertex_array_attrib_format to support EXT_dsa
- mesa: add ARB_vertex_attrib_binding glVertexArray\* functions
- mesa: add ARB_sparse_buffer NamedBufferPageCommitmentEXT function
- mesa: enable EXT_direct_state_access
- mesa: fix warning in 32 bits build
- radeonsi: implement sdma for GFX9
- radeonsi: display cs blit count for AMD_DEBUG=testdma
- radeonsi: use gfx9.surf_offset to compute texture offset
- radeonsi: fix multi plane buffers creation
- radeonsi: dcc dirty flag
- st/mesa: add a notify_before_flush callback param to flush
- st/dri: use st->flush callback to flush the backbuffer
- radeonsi: disable dcc for 2x MSAA surface and bpe < 4
- gallium: refuse to create buffers larger than UINT32_MAX
- radeon/vcn2: enable rate control for hevc encoding
- radeonsi: check ctx->sdma_cs before using it
- radeonsi: release saved resources in si_retile_dcc
- radeonsi: release saved resources in si_compute_expand_fmask
- radeonsi: release saved resources in si_compute_clear_render_target
- radeonsi: release saved resources in si_compute_copy_image
- radeonsi: release saved resources in si_compute_clear_12bytes_buffer
- radeonsi: release saved resources in si_compute_do_clear_or_copy
- radeonsi: fix fmask expand compute shader
- radeonsi: make sure fmask expand is done if needed
- radeonsi: unbind image before compute clear
- radeonsi: drop the negation from fmask_is_not_identity
- util: call bind_sampler_states before setting sampler_views
- radeonsi: move AMD_DEBUG tests to AMD_TEST
- docs: document AMD_DEBUG variable
- radeonsi: stop using the VM_ALWAYS_VALID flag
- radeonsi/ngg: add VGT_FLUSH when enabling fast launch
Prodea Alexandru-Liviu (2):
- Meson: Remove lib prefix from graw and osmesa when building with
Mingw. Also remove version sufix from osmesa swrast on Windows.
- Appveyor: Quickly fix meson build. As this required use of Python
3.8, mako module also had to be updated.
Qiang Yu (3):
- lima: sync lima_drm.h with kernel
- lima: create heap buffer with new interface if available
- lima: add noheap debug option
Rafael Antognolli (23):
- intel/isl: Add MOCS settings to isl_device.
- anv: Use mocs settings from isl_dev.
- iris: Use mocs from isl_dev.
- intel: Add workaround for stencil state.
- intel/genxml: Add 3DSTATE_CONSTANT_ALL packet.
- intel/aubinator: Decode 3DSTATE_CONSTANT_ALL.
- intel/blorp: Use 3DSTATE_CONSTANT_ALL to setup push constants.
- iris: Rework push constants emitting code.
- iris: Use 3DSTATE_CONSTANT_ALL when possible.
- anv: Move gen8+ push constant packet workaround.
- anv: Add get_push_range_address() helper.
- anv: Move code for emitting push constants into its own function.
- anv: Use 3DSTATE_CONSTANT_ALL when possible.
- iris: Add restriction to 3DSTATE_CONSTANT\_ packets.
- util/os_socket: Add socket related functions.
- vulkan/overlay: Add a control socket.
- vulkan/overlay: Add support for a control socket.
- vulkan/overlay: Add a command to start capturing data to a file.
- vulkan/overlay: Add basic overlay control script.
- vulkan/overlay: Update docs.
- iris: Implement WA for push constants.
- utils/os_socket: Define ssize_t on windows.
- intel: Load the driver even if I915_PARAM_REVISION is not found.
Rhys Perry (131):
- radv: adjust loop unrolling heuristics for int64
- aco: add Instruction::usesModifiers() and add more checks in the
optimizer
- radv: fix radv_nir_get_max_workgroup_size when nir=NULL
- aco: use DPP instead of exec modification when lowering GFX10
shuffles
- aco: fix shuffle with uniform operands
- nir/divergence: improve DA of shuffle
- aco: fix read_invocation with VGPR lane index
- aco: don't propagate vgprs into v_readlane/v_writelane
- aco: combine read_invocation and shuffle implementations
- radv: enable FP16/FP64 denormals earlier and only for LLVM
- aco: don't combine literals into v_cndmask_b32/v_subb/v_addc
- aco: fix 64-bit fsign with 0
- aco: implement VK_KHR_shader_float_controls
- aco: refactor reduction lowering helpers
- aco: implement 64-bit integer reductions
- radv/aco: enable VK_KHR_shader_subgroup_extended_types
- nir: make nir_variable::{num_members,num_state_slots} a uint16_t
- nir: add nir_variable::index and nir_index_vars
- nir/large_constants: use nir_index_vars and nir_variable::index
- docs: update features.txt for RADV
- aco: improve waitcnt insertion around loops
- aco: fix copy+paste error
- aco: fix waitcnts for barriers at block ends
- nir: add nir_num_variable_modes and nir_var_mem_push_const
- radv: set alignment for load_ssbo/store_ssbo in meta shaders
- nir: add a load/store vectorization pass
- nir: add load/store vectorizer tests
- aco: enable load/store vectorizer
- aco: allow constant offsets for global/scratch instructions on GFX10
- aco: set dlc/glc correctly for image loads
- aco: propagate p_wqm on an image_sample's coordinate p_create_vector
- aco: fix i2i64
- aco: fix incorrect cast in parse_wait_instr()
- aco: add v_nop inbetween exec write and VMEM/DS/FLAT
- aco: improve WAR hazard workaround with >64bit stores
- aco: fix GFX10 opcodes for some global/flat atomics
- aco: fix assembly of FLAT/GLOBAL atomics
- aco: fix SADDR with FLAT on GFX10
- aco: don't enable store_global for helper invocations
- aco: improve FLAT/GLOBAL scheduling
- aco: implement global atomics
- ac/llvm: fix pointer type for global atomics
- ac/llvm: improve sync scope for global atomics
- radv: set writes_memory for global memory stores/atomics
- aco: validate the CFG
- aco: handle loop exit and IF merge phis with break/discard
- aco: fix block_kind_discard s_andn2 definition to exec
- nir/lower_io_to_vector: don't create arrays when not needed
- nir/load_store_vectorize: fix combining stores with aliasing loads
between
- aco/wave32: fix comparison optimizations
- aco: improve jump threading with wave32
- aco: fix vgpr alloc granule with wave32
- aco: limit register usage for large work groups
- aco: set vm for pos0 exports on GFX10
- aco: fix imageSize()/textureSize() with large buffers on GFX8
- aco: fix uninitialized data in the binary
- aco: handle VOP3 modifiers when combining a constant comparison's NaN
test
- aco: handle omod successors with the constant in the first operand
- aco: check usesModifiers() when identifying a neg/abs
- aco: better handle neg/abs of sgprs
- aco: set exec_potentially_empty for demotes
- aco: don't DCE atomics with return values
- aco: disable add combining for ds_swizzle_b32
- aco: check if multiplication/clamp is live when applying output
modifier
- nir/divergence: handle load_primitive_id in GS
- nir/lower_gs_intrinsics: add option for per-stream counts
- aco: update IR validator
- aco: apply literals to split mads
- aco: combine two sgprs into a VALU if they're the same
- aco: improve can_use_VOP3()
- aco: rewrite literal combining
- aco: rewrite apply_sgprs()
- aco: add check_vop3_operands()
- aco: be more careful with literals in combine_salu_{n2,lshl_add}
- aco: follow through temporary when merging tests into constant
comparisons
- aco: allow applying two sgprs to an instruction
- aco: allow an extra SGPR with multiple uses to be applied to VOP3
- aco: take advantage of GFX10's constant bus limit and VOP3 literals
- aco: improve creation of v_madmk_f32/v_madak_f32
- aco: fix clamp optimization
- aco: improve clamp optimization
- aco: add min(-max(), ) and max(-min(), ) optimization
- aco: don't move literal to reg when making an instruction VOP3 on
GFX10
- aco: allow input modifiers on v_cndmask_b32
- aco: replace extract_vector with copies
- aco: improve readfirstlane after uniform LDS loads
- aco: add integer min/max to can_swap_operands
- nir/sink,nir/move: move/sink load_per_vertex_input
- nir/sink,nir/move: move/sink nir_op_mov
- nir/algebraic: a & ~(a >> 31) -> imax(a, 0)
- aco: fix stack buffer overflow in apply_sgprs()
- aco: fix fall-through test in try_remove_simple_block() with
back-edges
- aco: fix operand kill flags when a temporary is used more than once
- aco: fix off-by-one error when initializing sgpr_live_in
- radv: move gs copy shader creation before other variants
- aco: improve support for s_sendmsg
- radv/aco,aco: implement GS on GFX9+
- aco: implement GS on GFX7-8
- radv/aco: allow ACO for GS
- aco: explicitly mark end blocks for exports
- aco: remove needs_instance_id
- aco: implement GS copy shaders
- radv/aco: use ACO for GS copy shaders
- aco: use nir_move_copies
- aco: fix WaR check for >64-bit FLAT/GLOBAL instructions
- aco: fix operand to scc when selecting SGPR ufind_msb/ifind_msb
- aco: always add sgprs to sgpr_ids when choosing literals
- aco: fix literal application with v_cndmask_b32/v_addc_co_u32/etc
- amd/common,radv: move vertex_format_table to ac_shader_util.{h,c}
- aco: rework vertex fetching a bit
- aco: skip unused channels at the start when fetching vertices
- aco: handle unaligned vertex fetch on GFX10
- aco: value-number MUBUF instructions
- aco: use MUBUF in some situations instead of splitting vertex fetches
- aco: fix rebase error from GS copy shader support
- aco: ensure predecessors' p_logical_end is in WQM when a p_phi is in
WQM
- aco: run p_wqm instructions in WQM
- nir/algebraic: add patterns for a >> #b << #b
- nir/algebraic: add some half packing optimizations
- aco: fix target calculation when vgpr spilling introduces sgpr
spilling
- aco: don't consider loop header blocks branch blocks in
add_coupling_code
- aco: don't update demand in add_coupling_code() for loop headers
- aco: only create parallelcopy to restore exec at loop exit if needed
- aco: don't always add logical edges from continue_break blocks to
headers
- aco: error when block has no logical preds but VGPRs are live at the
start
- aco: set exec_potentially_empty after continues/breaks in nested IFs
- aco: improve assertion at the end of spiller
- aco: fill reg_demand with sensible information in add_coupling_code()
- aco: parallelcopy exec mask before s_wqm
- aco: fix exec mask consistency issues
- aco: fix gfx10_wave64_bpermute
Ricardo Garcia (1):
- anv: Unify GetDeviceQueue and GetDeviceQueue2
Rob Clark (89):
- freedreno/ir3: split pre-coloring to it's own function
- freedreno/ir3: use SSA flag on dest register too
- freedreno/ir3: ir3_print tweaks
- freedreno/ir3/ra: move regs_count==0 check
- freedreno/ir3/ra: remove ir print after livein/out
- freedreno/ir3: remove obsolete comment
- freedreno/a3xx: fix SP_FS_MRT_REG.HALF_PRECISION
- freedreno/a4xx: fix SP_FS_MRT_REG.HALF_PRECISION
- freedreno/ir3: sync disasm changes from envytools
- freedreno/ir3: also track # of nops for shader-db
- freedreno: fix eglDupNativeFenceFD error
- freedreno/ir3: fix valgrind complaint with STLW
- freedreno/ir3: remove half-precision output
- freedreno/ir3: rename fanin/fanout to collect/split
- freedreno/ir3: remove impossible condition
- freedreno/ir3: add input/output iterators
- freedreno/ir3: show input/output wrmask's in disasm
- freedreno/ir3: helper to print ir if debug enabled
- freedreno/ir3: remove first-vertex sysval
- freedreno/ir3: simplify creating sysval inputs
- freedreno/ir3: re-work shader inputs/outputs
- freedreno/ir3: only tex instructions have wrmask
- freedreno/ir3: fix gpu hang with pre-fs-tex-fetch
- freedreno/ir3: legalize cleanups
- freedreno/ir3: remove unused parameter
- freedreno/perfcntrs: small cleanup
- freedreno/perfcntrs: remove gallium dependencies
- freedreno/perfcntrs: move to shared location
- freedreno/perfcntrs: add accessor to get per-gen tables
- freedreno/perfctrs/a2xx: move CP to be first group
- freedreno/perfcntrs/a6xx: remove RBBM counters
- freedreno/perfcntrs: add fdperf
- freedreno/perfctrs/fdperf: periodically restore counters
- gitlab-ci: update deqp build so we can generate xml
- gitlab-ci/deqp: preserve full list of unexpected results
- gitlab-ci/deqp: preserve caselists for blocks with fails
- gitlab-ci/deqp: detect and report flakes
- gitlab-ci: bump arm test container
- gitlab-ci/deqp: generate xml results for fails/flakes
- gitlab-ci/deqp: generate junit results
- gitlab-ci/freedreno/a6xx: remove most of the flakes
- freedreno: use rsc->slice accessor everywhere
- freedreno: switch to layout helper
- gitlab-ci: disable junit results for deqp
- freedreno/ir3: remove store_output lowered to store_shared_ir3
- freedreno/ir3: fix neverball assert in case of unused VS inputs
- nir/lower_clip: Fix incorrect driver loc for clipdist outputs
- freedreno/fdperf: use drmOpen()
- freedreno/a6xx: disable LRZ when blending
- freedreno/a5xx+a6xx: split LRZ layout to per-gen
- freedreno/a6xx: fix LRZ layout
- freedreno/a6xx: fix LRZ logic
- freedreno/a6xx: enable LRZ by default
- spirv: add OpLifetime\*
- freedreno/ir3: add last-baryf shaderdb stat
- freedreno/ir3: add scheduler traces
- freedreno/ir3: add iterator macros
- freedreno/a6xx: fix OUT_REG() vs growable cmdstream
- nir+vtn: vec8+vec16 support
- freedreno/ir3: fix flat shading again
- nir: assert that nir_lower_tex runs after lowering derefs
- mesa/st: lower samplers before nir_lower_tex
- freedreno/ir3: rename instructions
- gitlab-ci: fix missing caselist.css/xsl
- freedreno/a6xx: limit scratch/debug markers to debug builds
- freedreno/a6xx: cleanup rasterizer state
- freedreno/a6xx: separate rast stateobj for prim restart
- freedreno/a6xx: drop a few more per-draw registers
- freedreno/a6xx: move dynamic program state to streaming stateobj
- freedreno/a6xx: add PROG_FB_RAST stateobj
- freedreno/drm: fix invalid-cmdstream-size with older kernels
- freedreno: use PIPE_CAP_RGB_OVERRIDE_DST_ALPHA_BLEND
- mesa/st: random whitespace cleanup
- freedreno/a6xx: remove special handling based on MRT format
- freedreno/a6xx: convert blend state to stateobj
- freedreno: extract vsc pipe bo from GMEM state
- freedreno: consolidate GMEM state
- freedreno: constify fd_tile
- freedreno: constify fd_vsc_pipe
- freedreno/a6xx: constify gmem state
- freedreno/a5xx: constify gmem state
- freedreno/a4xx: constify gmem state
- freedreno/a3xx: constify gmem state
- freedreno/a2xx: constify gmem state
- freedreno: get GMEM state from batch
- freedreno: add gmem state cache
- freedreno: add gmem_lock
- freedreno: remove flush-queue
- freedreno: allow ctx->batch to be NULL
Robert Foss (5):
- nir: Build nir_lower_point_size.c in libmesa_nir
- android: Add panfrost support to build scripts
- android: Fix u_format_table.c being generated twice
- panfrost: Prefix schedule_program to prevent collision
- android: Fix whitespace issue
Rohan Garg (1):
- gitlab-ci: Use lavacli from packages
Roland Scheidegger (3):
- gallium/scons: fix graw_gdi build
- util/atomic: Fix p_atomic_add for unlocked and msvc paths
- winsys/svga: use new ioctl for logging
Roman Stratiienko (2):
- Android: Fix build issue without LLVM
- panfrost: Fix Android build
Ross Zwisler (1):
- intel: limit shader geometry on BDW GT1
Sagar Ghuge (1):
- intel/compiler: Clear accumulator register before EOT
Samuel Iglesias Gonsálvez (1):
- main: fix coverity error in \_mesa_program_resource_find_name()
Samuel Pitoiset (202):
- radv: declare NGG scratch for VS or TES and only on GFX10
- radv: fix compute pipeline keys when optimizations are disabled
- docs: document all RADV environment variables
- radv: add a note about perftest/debug options
- radv: fix 32-bit compiler warnings
- nir: fix packing of nir_variable
- radv/gfx10: enable wave32 for compute based on shader's wavesize
- radv: hardcode the number of waves for the GFX6 LS-HS bug
- radv: determine shaders wavesize at pipeline level
- radv: rely on shader's wavesize when computing NGG info
- radv: implement VK_EXT_subgroup_size_control
- radv/gfx10: fix primitive indices orientation for NGG GS
- ac: handle pointer types to LDS in ac_get_elem_bits()
- gitlab-ci: build a specific libdrm version for ARM64
- gitlab-ci: build RADV on ARM64
- ac: fix build with recent LLVM
- radv: remove useless RADV_DEBUG=unsafemath debug option
- radv: make sure to not clear the ds attachment after resolves
- ac: add radeon_info::has_l2_uncached
- radv: implement VK_AMD_device_coherent_memory
- spirv: fix lowering of OpGroupNonUniformAllEqual
- ac: remove useless cast in ac_build_set_inactive()
- ac: add 8-bit and 16-bit supports to ac_build_shuffle()
- ac: add 8-bit and 16-bit supports to ac_build_readlane()
- ac: add 8-bit and 16-bit supports to ac_build_set_inactive()
- ac: add 8-bit and 16-bit supports to ac_build_dpp()
- ac: add 8-bit and 16-bit supports to ac_build_swizzle()
- ac: add 8-bit and 16-bit supports to get_reduction_identity()
- ac: add 8-bit and 16-bit supports to ac_build_wwm()
- ac: add 8-bit and 16-bit supports to ac_build_optimization_barrier()
- ac: add 16-bit float support to ac_build_alu_op()
- radv: advertise VK_KHR_shader_subgroup_extended_types on GFX8-GFX9
- radv: enable VK_KHR_shader_subgroup_extended_types on GFX6-GFX7
- docs: add missing new features for RADV
- pipe-loader: check that the pointer to driconf_xml isn't NULL
- gitlab-ci: move building piglit into a separate script
- gitlab-ci: fix ldd check for Vulkan drivers
- gitlab-ci: add a job that only build things needed for testing
- gitlab-ci: do not build with debugoptimized for meson-main
- gitlab-ci: build swr in meson-main
- gitlab-ci: build GLVND in meson-clang
- gitlab-ci: remove now useless meson-swr-glvnd build job
- gitlab-ci: reduce the number of scons build
- radv: disable subgroup shuffle operations on GFX10
- ac/llvm: fix the local invocation index for wave32
- meson: only build imgui when needed
- radv: set the image view aspect mask during subpass transitions
- radv: set the image view aspect mask before resolves
- radv: rework creation of decompress/resummarize meta pipelines
- radv: create decompress pipelines for separate depth/stencil layouts
- radv: select the depth decompress path based on the aspect mask
- ac/llvm: fix warning in ac_build_canonicalize()
- radv: fix reporting subgroup size with
VK_KHR_pipeline_executable_properties
- radv: fix enabling sample shading with SampleID/SamplePosition
- radv/gfx10: fix implementation of exclusive scans
- ac: add 8-bit and 16-bit supports to ac_build_permlane16()
- radv: enable VK_KHR_shader_subgroup_extended_types on GFX10
- ac/llvm: convert src operands to pointers if necessary
- radv: add more constants to avoid using magic numbers
- radv,ac/nir: lower deref operations for shared memory
- aco: drop useless lowering of deref operations for shared memory
- ac/llvm: fix atomic var operations if source isn't a deref
- radv: remove dead shader input/output variables
- radv: simplify a check in radv_fixup_vertex_input_fetches()
- radv/gfx10: fix the vertex order for triangle strips emitted by a GS
- gitlab-ci: rename build-deqp.sh to build-deqp-gl.sh
- gitlab-ci: add a gl suffix to the x86 test image and all test jobs
- gitlab-ci: add a new job that builds a base test image for VK
- gitlab-ci: build cts_runner in the x86 test image for VK
- gitlab-ci: build dEQP VK 1.1.6 in the x86 test image for VK
- gitlab-ci: add a new base test job for VK
- gitlab-ci: allow to run dEQP Vulkan with DEQP_VER
- gitlab-ci: configure the Vulkan ICD export with VK_DRIVER
- gitlab-ci: build RADV in meson-testing
- gitlab-ci: add a job that runs Vulkan CTS with RADV conditionally
- radv: do not use VK_TRUE/VK_FALSE
- radv: move emission of two PA_SC\_\* registers to the pipeline CS
- radv: fix possibly wrong PA_SC_AA_CONFIG value for conservative rast
- radv: synchronize after performing a separate depth/stencil fast
clears
- radv: do not init HTILE as compressed state when dst layout allows it
- radv: initialize HTILE for separate depth/stencil aspects
- radv: implement VK_KHR_separate_depth_stencil_layouts
- gitlab-ci: set RADV_DEBUG=checkir for RADV test jobs
- ac/nir: fix out-of-bound access when loading constants from global
- radv: enable SpvCapabilityImageMSArray
- radv: handle unaligned vertex fetches on GFX6/GFX10
- radv/gfx10: fix ngg_get_ordered_id
- radv/gfx10: fix the out-of-bounds check for vertex descriptors
- ac: declare an enum for the OOB select field on GFX10
- radv: init a default multisample state for the resolve FS path
- radv: ignore pMultisampleState if rasterization is disabled
- radv: ignore pTessellationState if the pipeline doesn't use tess
- radv: ignore pDepthStencilState if rasterization is disabled
- radv: tidy up radv_pipeline_init_blend_state()
- radv: ignore pColorBlendState if rasterization is disabled
- radv: rely on pipeline layout when creating push descriptors with
template
- radv: return the correct pitch for linear mipmaps on GFX10
- radv: record number of color/depth samples for each subpass
- radv: implement VK_AMD_mixed_attachment_samples
- ac/surface: use uint16_t for mipmap level pitches
- radv: do not fill keys from fragment shader twice
- spirv: add SpvCapabilityImageReadWriteLodAMD
- spirv,nir: add new lod parameter to image_{load,store} intrinsics
- amd/llvm: handle nir_intrinsic_image_deref_{load,store} with lod
- aco: handle nir_intrinsic_image_deref_{load,store} with lod
- radv: advertise VK_AMD_shader_image_load_store_lod
- radv/gfx10: disable vertex grouping
- radv/gfx10: determine if a pipeline is eligible for NGG passthrough
- radv/gfx10: do not declare LDS for NGG if useless
- radv/gfx10: add support for NGG passthrough mode
- radv/gfx10: improve performance for TES using PrimID but not
exporting it
- radv: only use VkSamplerCreateInfo::compareOp if enabled
- radv/gfx10: enable all CUs if NGG is never used
- radv/gfx10: simplify some duplicated NGG GS code
- vulkan/overlay: Fix for Vulkan 1.2
- radv: update VK_EXT_descriptor_indexing for Vulkan 1.2
- radv: update VK_EXT_host_query_reset for Vulkan 1.2
- radv: update VK_EXT_sampler_filter_minmax for Vulkan 1.2
- radv: update VK_EXT_scalar_block_layout for Vulkan 1.2
- radv: update VK_KHR_8bit_storage for Vulkan 1.2
- radv: update VK_KHR_buffer_device_address for Vulkan 1.2
- radv: update VK_KHR_create_renderpass2 for Vulkan 1.2
- radv: update VK_KHR_depth_stencil_resolve for Vulkan 1.2
- radv: update VK_KHR_draw_indirect_count for Vulkan 1.2
- radv: update VK_KHR_driver_properties for Vulkan 1.2
- radv: update VK_KHR_image_format_list for Vulkan 1.2
- radv: update VK_KHR_imageless_framebuffer for Vulkan 1.2
- radv: update VK_KHR_shader_atomic_int64 for Vulkan 1.2
- radv: update VK_KHR_shader_float16_int8 for Vulkan 1.2
- radv: update VK_KHR_shader_float_controls for Vulkan 1.2
- radv: update VK_KHR_shader_subgroup_extended_types for Vulkan 1.2
- radv: update VK_KHR_uniform_buffer_standard_layout for Vulkan 1.2
- radv: update VK_KHR_timeline_semaphore for Vulkan 1.2
- radv: implement Vulkan 1.1 features and properties
- radv: implement Vulkan 1.2 features and properties
- radv: enable Vulkan 1.2
- aco: fix emitting SMEM instructions with no operands on GFX6-GFX7
- aco: do not select 96-bit/128-bit variants for ds_read/ds_write on
GFX6
- aco: do not combine additions of DS instructions on GFX6
- aco: implement stream output with vec3 on GFX6
- aco: fix emitting slc for MUBUF instructions on GFX6-GFX7
- aco: print assembly with CLRXdisasm for GFX6-GFX7 if found on the
system
- aco: fix constant folding of SMRD instructions on GFX6
- aco: do not use the vec3 variant for stores on GFX6
- aco: do not use the vec3 variant for loads on GFX6
- aco: add new addr64 bit to MUBUF instructions on GFX6-GFX7
- aco: implement nir_intrinsic_load_barycentric_at_sample on GFX6
- radv: fix double free corruption in radv_alloc_memory()
- radv: add explicit external subpass dependencies to meta operations
- radv: handle missing implicit subpass dependencies
- spirv: add SpvCapabilityFragmentMaskAMD
- nir: add two new texture ops for multisample fragment color/mask
fetches
- spirv: add support for SpvOpFragment{Mask}FetchAMD operations
- nir/lower_input_attachments: lower nir_texop_fragment_{mask}_fetch
- ac/nir: add support for nir_texop_fragment_{mask}_fetch
- aco: add support for nir_texop_fragment_{mask}_fetch
- radv: advertise VK_AMD_shader_fragment_mask
- aco: fix printing assembly with CLRXdisasm on GFX6
- aco: fix wrong IR in nir_intrinsic_load_barycentric_at_sample
- aco: implement nir_intrinsic_store_global on GFX6
- aco: implement nir_intrinsic_load_global on GFX6
- aco: implement nir_intrinsic_global_atomic\_\* on GFX6
- aco: implement 64-bit nir_op_ftrunc on GFX6
- aco: implement 64-bit nir_op_fceil on GFX6
- aco: implement 64-bit nir_op_fround_even on GFX6
- aco: implement 64-bit nir_op_ffloor on GFX6
- aco: implement nir_op_f2i64/nir_op_f2u64 on GFX6
- ac/llvm: fix missing casts in ac_build_readlane()
- aco: combine MRTZ (depth, stencil, sample mask) exports
- aco: fix a hardware bug for MRTZ exports on GFX6
- aco: fix a hazard with v_interp\_\* and v_{read,readfirst}lane\_\* on
GFX6
- aco: copy the literal offset of SMEM instructions to a temporary
- radv: enable ACO support for GFX6
- radv: print NIR shaders after lowering FS inputs/outputs
- radv: do not allow sparse resources with multi-planar formats
- radv: enable VK_AMD_shader_fragment_mask on GFX6-GFX7
- compiler: add a new explicit interpolation mode
- spirv: add support for SpvDecorationExplicitInterpAMD
- compiler: add PERSP to the existing barycentric system values
- compiler: add new SYSTEM_VALUE_BARYCENTRIC\_\*
- spirv: add support for SpvBuiltInBaryCoord\*
- nir: add nir_intrinsic_load_barycentric_model
- nir: lower SYSTEM_VALUE_BARYCENTRIC\_\* to nir_load_barycentric()
- nir: add nir_intrinsic_interp_deref_at_vertex
- nir: lower interp_deref_at_vertex to load_input_vertex
- spirv: implement SPV_AMD_shader_explicit_vertex_parameter
- ac/llvm: implement VK_AMD_shader_explicit_vertex_parameter
- aco: implement VK_AMD_shader_explicit_vertex_parameter
- radv: gather which input PS variables use an explicit interpolation
mode
- radv: implement VK_AMD_shader_explicit_vertex_parameter
- radv: bump conformance version to 1.2.0.0
- radv: remove the non conformant VK implementation warning on GFX10
- aco: fix VS input loads with MUBUF on GFX6
- radv/gfx10: add a separate flag for creating a GDS OA buffer
- radv/gfx10: implement NGG GS queries
- radv/gfx10: re-enable NGG GS
- radv: refactor physical device properties
- aco: fix MUBUF VS input loads when expanding vec3 to vec4 on GFX6
- aco: do not use ds_{read,write}2 on GFX6
- aco: fix waiting for scalar stores before "writing back" data on
GFX8-GFX9
- aco: fix creating v_madak if v_mad_f32 has two sgpr literals
- nir: do not use De Morgan's Law rules for flt and fge
Samuel Thibault (3):
- loader: #define PATH_MAX when undefined (eg. Hurd)
- util: Do not fail to build on unknown pthread_setname_np
- meson: Do not require libdrm for DRI2 on hurd
Satyajit Sahu (1):
- radeon/vcn: Handle crop parameters for encoder
Sonny Jiang (1):
- radeonsi: use compute shader for clear 12-byte buffer
Stephan Gerhold (1):
- kmsro: Add "mcde" entry point
Tapani Pälli (33):
- nir: fix couple of compile warnings
- util/android: fix android build errors
- Revert "egl: implement new functions from
EGL_EXT_image_flush_external"
- Revert "egl: handle EGL_IMAGE_EXTERNAL_FLUSH_EXT"
- Revert "st/dri: add support for EGL_EXT_image_flush_external"
- Revert "st/dri: assume external consumers of back buffers can write
to the buffers"
- Revert "dri_interface: add interface for
EGL_EXT_image_flush_external"
- mesa: allow bit queries for EXT_disjoint_timer_query
- Revert "mesa: allow bit queries for EXT_disjoint_timer_query"
- mesa: allow bit queries for EXT_disjoint_timer_query
- gitlab-ci: update Piglit commit, update skips
- mapi: add GetInteger64vEXT with EXT_disjoint_timer_query
- glsl: handle max uniform limits with lower_const_arrays_to_uniforms
- gitlab-ci: bump piglit checkout commit
- glsl: additional interface redeclaration check for SSO programs
- intel/compiler: add newline to limit_dispatch_width message
- intel/compiler: force simd8 when dual src blending on gen8
- dri: add \__DRI_IMAGE_FORMAT_SXRGB8
- i965: expose MESA_FORMAT_B8G8R8X8_SRGB visual
- mesa/st/i965: add a ProgramResourceHash for quicker resource lookup
- mesa: create program resource hash in a single place
- iris: set depth stall enabled when depth flush enabled on gen12
- anv: set depth stall enabled when depth flush enabled on gen12
- isl/gen12: add reminder comment about missing WA with 3D surfaces
- anv: fix assert in GetImageDrmFormatModifierPropertiesEXT
- anv: add assert for isl_mod_info in choose_isl_tiling_flags
- anv: initialize clear_color_is_zero_one
- egl/android: fix buffer_count for applications setting max count
- anv/android: setup gralloc1 usage from gralloc0 usage manually
- anv/android: make format_supported_with_usage static
- intel/vec4: fix valgrind errors with vf_values array
- glsl: fix a memory leak with resource_set
- iris: fix aux buf map failure in 32bits app on Android
Thomas Hellstrom (4):
- winsys/svga: Enable transhuge pages for buffer objects
- svga: Avoid discard DMA uploads
- gallium/util: Increase the debug_flush map depth
- svga: Fix banded DMA upload
Thong Thai (8):
- st/va: Convert interlaced NV12 to progressive
- util/format: Add the P010 format used for 10-bit videos
- gallium: Add PIPE_FORMAT_P010 support
- st/va: Add support for P010, used for 10-bit videos
- radeon: Use P010 for decoding of 10-bit videos
- r600: Remove HEVC related code since HEVC is not supported
- mesa: Prevent \_MaxLevel from being less than zero
- Revert "st/va: Convert interlaced NV12 to progressive"
Timothy Arceri (66):
- glsl: just use NIR to lower outputs when driver can't read outputs
- glsl: disable lower_fragdata_array() for NIR drivers
- mesa: add ARB_shading_language_include stubs
- glsl: add infrastructure for ARB_shading_language_include
- mesa: add ARB_shading_language_include infrastructure to
gl_shared_state
- mesa: add helper to validate tokenise shader include path
- mesa: add \_mesa_lookup_shader_include() helper
- mesa: add copy_string() helper
- mesa: add glNamedStringARB() support
- mesa: implement glGetNamedStringARB()
- mesa: make error checking optional in \_mesa_lookup_shader_include()
- mesa: implement glIsNamedStringARB()
- mesa: implement glGetNamedStringivARB()
- mesa: split \_mesa_lookup_shader_include() in two
- mesa: implement glDeleteNamedStringARB()
- glsl: add ARB_shading_language_include support to #line
- glsl: pass gl_context to glcpp_parser_create()
- glsl: add preprocessor #include support
- glsl: error if #include used while extension is disabled
- glsl: add can_skip_compile() helper
- glsl: delay compilation skip if shader contains an include
- mesa: add support cursor support for relative path shader includes
- mesa: add shader include lookup support for relative paths
- mesa: implement glCompileShaderIncludeARB()
- mesa: enable ARB_shading_language_include
- gitlab-ci: bump piglit checkout commit
- gitlab-ci: update for arb_shading_language_include
- compiler: move build definition of pp_standalone_scaffolding.c
- radv: add some infrastructure for fresh forks for each secure compile
- radv: add a secure_compile_open_fifo_fds() helper
- radv: create a fresh fork for each pipeline compile
- docs: update source code repository documentation
- glsl: move calculate_array_size_and_stride() to link_uniforms.cpp
- glsl: don't set uniform block as used when its not
- glsl: make use of active_shader_mask when building resource list
- glsl/nir: iterate the system values list when adding varyings
- docs: remove mailing list as way of submitting patches
- glsl: move nir_remap_dual_slot_attributes() call out of glsl_to_nir()
- glsl: copy the how_declared field when converting to nir
- nir: add some fields to nir_variable_data
- glsl: copy the new data fields when converting to nir
- glsl: add support for named varyings in
nir_build_program_resource_list()
- glsl: add subroutine support to nir_build_program_resource_list()
- st/glsl_to_nir: call gl_nir_lower_buffers() a little later
- st/glsl_to_nir: use nir based program resource list builder
- st/glsl_to_nir: fix SSO validation regression
- glsl: rename gl_nir_link() to gl_nir_link_spirv()
- glsl: add gl_nir_link_check_atomic_counter_resources()
- glsl: add new gl_nir_link_glsl() helper
- glsl: reorder link_and_validate_uniforms() calls
- mesa: add new UseNIRGLSLLinker constant
- glsl: use nir linker to link atomics
- glsl: add check_image_resources() for the nir linker
- glsl: use nir version of check_image_resources() for nir linker
- glsl: move check_subroutine_resources() into the shared util code
- glsl: call check_subroutine_resources() from the nir linker
- glsl: move uniform resource checks into the common linker code
- glsl: call uniform resource checks from the nir linker
- glsl: move calculate_subroutine_compat() to shared linker code
- glsl: call calculate_subroutine_compat() from the nir linker
- glsl: fix potential bug in nir uniform linker
- glsl: remove bogus assert in nir uniform linking
- glsl: fix check for matrices in blocks when using nir uniform linker
- glsl: count uniform components and storage better in nir linking
- glsl_to_nir: update interface type properly
- glsl: fix gl_nir_set_uniform_initializers() for image arrays
Timur Kristóf (39):
- ac: Handle invalid GFX10 format correctly in ac_get_tbuffer_format.
- aco: Make sure not to mistakenly propagate 64-bit constants.
- aco: Treat all booleans as per-lane.
- aco: Optimize out trivial code from uniform bools.
- aco: Fix operand of s_bcnt1_i32_b64 in emit_boolean_reduce.
- aco: Remove superfluous argument from emit_boolean_logic.
- aco: Remove lower_linear_bool_phi, it is not needed anymore.
- aco: Optimize load_subgroup_id to one bit field extract instruction.
- aco/wave32: Change uniform bool optimization to work with wave32.
- aco/wave32: Replace hardcoded numbers in spiller with wave size.
- aco/wave32: Introduce emit_mbcnt which takes wave size into account.
- aco/wave32: Add wave size specific opcodes to aco_builder.
- aco/wave32: Use lane mask regclass for exec/vcc.
- aco/wave32: Fix load_local_invocation_index to support wave32.
- aco/wave32: Use wave_size for barrier intrinsic.
- aco/wave32: Allow setting the subgroup ballot size to 64-bit.
- aco/wave32: Fix reductions.
- aco: Fix uniform i2i64.
- ac/llvm: Fix ac_build_reduce in wave32 mode.
- aco/wave32: Set the definitions of v_cmp instructions to the lane
mask.
- aco: Implement 64-bit constant propagation.
- aco: Allow optimizing vote_all and nir_op_iand.
- aco: Don't skip combine_instruction when definitions[1] is used.
- aco: Optimize out s_and with exec, when used on uniform bitwise
values.
- aco: Flip s_cbranch / s_cselect to optimize out an s_not if possible.
- nouveau/nvc0: add extern keyword to nvc0_miptree_vtbl.
- intel/compiler: Fix array bounds warning on GCC 10.
- radeon: Move si_get_pic_param to radeon_vce.c
- r600: Move get_pic_param to radeon_vce.c
- gallium: Fix a couple of multiple definition warnings.
- radeon: Fix multiple definition error with radeon_debug
- aco: Fix -Wstringop-overflow warnings in aco_span.
- aco: Fix maybe-uninitialized warnings.
- aco: Fix signedness compare warning.
- aco: Make a better guess at which instructions need the VCC hint.
- aco: Transform uniform bitwise instructions to 32-bit if possible.
- aco/gfx10: Fix VcmpxExecWARHazard mitigation.
- aco: Fix the meaning of is_atomic.
- aco/optimizer: Don't combine uniform bool s_and to s_andn2.
Tomasz Pyra (1):
- gallium/swr: Fix arb_transform_feedback2
Tomeu Vizoso (38):
- gitlab-ci: Disable lima jobs
- gitlab-ci: Run only LAVA jobs in special-named branches
- panfrost: Add checksum fields to SFBD descriptor
- panfrost: Set 0x10 bit on mali_shader_meta.unknown2_4 on T720
- panfrost: Rework format encoding on SFBD
- panfrost: Take into account texture layers in SFBD
- panfrost: Decode blend shaders for SFBD
- panfrost: Generate polygon list manually for SFBD
- panfrost: Print the right zero field
- panfrost: Pipe the GPU ID into compiler and disassembler
- panfrost: Set depth and stencil for SFBD based on the format
- panfrost: Multiply offset_units by 2
- panfrost: Make sure the shader descriptor is in sync with the GL
state
- gitlab-ci: Remove limit on kernel logging
- panfrost: Just print tiler fields as-is for Tx20
- panfrost: Rework buffers in SFBD
- gitlab-ci: Fix dir name for VK-GL-CTS sources
- panfrost: Don't print the midgard_blend_rt structs on SFBD
- panfrost: Add quirks system to cmdstream
- panfrost: Simplify shader patching
- panfrost: White list the Mali T720
- gitlab-ci: Test Panfrost on T720 GPUs
- panfrost: Add PAN_MESA_DEBUG=sync
- panfrost: Hold a reference to sampler views
- pan/midgard: Remove undefined behavior
- nir: Don't copy empty array
- util: Don't access members of NULL pointers
- panfrost: Don't lose bits!
- st/mesa: Don't access members of NULL pointers
- panfrost: Handle Z24_UNORM_S8_UINT as MALI_Z32_UNORM
- panfrost: Increase PIPE_SHADER_CAP_MAX_OUTPUTS to 16
- panfrost: Dynamically allocate array of texture pointers
- panfrost: Map with size of first layer for 3D textures
- panfrost: Store internal format
- gitlab-ci: Update kernel for LAVA to 5.5-rc1 plus fixes
- gitlab-ci: Switch LAVA jobs to use shared dEQP runner
- gitlab-ci: Upgrade kernel for LAVA jobs to v5.5-rc5
- gitlab-ci: Consolidate container and build stages for LAVA
Urja Rannikko (4):
- panfrost: free last_read/write tables in mir_create_dependency_graph
- panfrost: free allocations in schedule_block
- panfrost: add lcra_free() to free lcra state
- panfrost: free spill cost table in mir_spill_register
Vasily Khoruzhick (31):
- lima: add debug prints for BO cache
- lima: align size before trying to fetch BO from cache
- lima: ignore flags while looking for BO in cache
- lima: set dithering flag when necessary
- lima: add support for gl_PointSize
- lima: enable tiling
- lima: handle DRM_FORMAT_MOD_INVALID in resource_from_handle()
- lima: expose tiled format modifier in query_dmabuf_modifiers()
- lima: use single BO for GP outputs
- lima: drop suballocator
- lima: fix allocation of GP outputs storage for indexed draw
- lima: postpone PP stream generation
- lima: don't reload and redraw tiles that were not updated
- lima: fix PP stream terminator size
- lima: use linear layout for shared buffers if modifier is not
specified
- lima: add debug flag to disable tiling
- lima: drop support for R8G8B8 format
- lima: fix PLBU_CMD_PRIMITIVE_SETUP command
- lima: fix viewport clipping
- lima: implement polygon offset
- lima: fix PIPE_CAP\_\* to mark features that aren't supported yet
- lima: add new findings to texture descriptor
- lima: fix handling of reverse depth range
- ci: lava: pass CI_NODE_INDEX and CI_NODE_TOTAL to lava jobs
- ci: Re-enable CI for lima on mali450
- lima: implement invalidate_resource()
- nir: don't emit ishl in \_nir_mul_imm() if backend doesn't support
bitops
- lima: use imul for calculations with intrinsic src
- lima: ppir: don't delete root ld_tex nodes without successors in
current block
- lima: ppir: always create move and update ld_tex successors for all
blocks
- lima: disable early-z if fragment shader uses discard
Vinson Lee (9):
- swr: Fix build with llvm-10.0.
- panfrost: Fix gnu-empty-initializer build errors.
- scons: Bump C standard to gnu11 on macOS 10.15.
- util/u_thread: Restrict u_thread_get_time_nano on macOS.
- swr: Fix build with llvm-10.0.
- swr: Fix build with llvm-10.0.
- lima: Fix build with GCC 10.
- swr: Fix GCC 4.9 checks.
- panfrost: Remove unused anonymous enum variables.
Wladimir J. van der Laan (2):
- u_vbuf: add logic to use a limited number of vbufs
- u_vbuf: use single vertex buffer if it's not possible to have
multiple
X512 (1):
- util/u_thread: Fix build under Haiku
Yevhenii Kolesnikov (5):
- glsl: Enable textureSize for samplerExternalOES
- meson: Fix linkage of libgallium_nine with libgalliumvl
- meta: Cleanup function for DrawTex
- main: allow external textures for BindImageTexture
- meta: Add cleanup function for Bitmap
Zebediah Figura (1):
- Revert "draw: revert using correct order for prim decomposition."
luc (1):
- zink: confused compilation macro usage for zink in target helpers.
|