Difference between revisions of "SourcePawn Transitional Syntax"

From AlliedModders Wiki
Jump to: navigation, search
m
m (Add quantifier to methodmap property declGrammar)
 
(42 intermediate revisions by 7 users not shown)
Line 1: Line 1:
 
__FORCETOC__
 
__FORCETOC__
<b>EXPERIMENTAL</b>: This document is a prototype and not yet complete.
+
We would like to give our users a more modern language. Pawn is showing its age; manual memory management, buffers, tags, and lack of object-oriented API are very frustrating. We can't solve everything all at once, but we can begin to take steps in the right direction.
  
We would like to provide our users a more modern language. Pawn is showing its age; manual memory management, buffers, tags, and lack of object-oriented API are very frustrating for developers. We can't solve everything all at once, but we can begin to take steps in the right direction.
+
SourceMod 1.7 introduces a ''Transitional API''. It is built on a new ''Transitional Syntax'' in SourcePawn, which is a set of language tools to make Pawn feel more modern. In particular, it allows developers to use older APIs in an object-oriented manner, without breaking compatibility. Someday, if and when SourcePawn can become a full-fledged modern language, the transitional API will making porting efforts very minimal.
  
SourceMod 1.7 introduces what we call the "Transitional API". It is built on a new "Transitional Syntax" in SourcePawn, which is a set of language tools to make Pawn feel more modern. In particular, it allows developers to use older APIs in an object-oriented manner, without breaking compatibility. Someday, if and when SourcePawn can become a full-fledged modern language, the transitional API will making porting efforts very minimal.
+
The transitional API has the following major features and changes:
 +
* New Declarators - A cleaner way of declaring variables, similar to Java and C#.
 +
* Methodmaps - Object-oriented wrappers around older APIs.
 +
* Real Types - SourcePawn now has "int", "float", "bool", "void", and "char" as real types.
 +
* <tt>null</tt> - A new, general keyword to replace <tt>INVALID_HANDLE</tt>.
  
The transitional API, so far, has the following major features and changes:
+
=New Declarators=
* New Declarators - A cleaner way of declaring variables.
+
Developers familiar with pawn will recognize Pawn's original declaration style:
* Methodmaps - Object-oriented wrappers around existing API.
+
<sourcepawn>
* Classes - Stricter API that eliminates the Handle system.
+
new Float:x = 5.0;
 +
new y = 7;
 +
</sourcepawn>
 +
 
 +
In the transitional syntax, this can be reworded as:
 +
<sourcepawn>
 +
float x = 5.0;
 +
int y = 7;
 +
</sourcepawn>
 +
 
 +
The following builtin tags now have types:
 +
* <tt>Float:</tt> is <tt>float</tt>.
 +
* <tt>bool:</tt> is <tt>bool</tt>.
 +
* <tt>_:</tt> (or no tag) is <tt>int</tt>.
 +
* <tt>String:</tt> is <tt>char</tt>.
 +
* <tt>void</tt> can now be used as a return type for functions.
 +
 
 +
===Rationale===
 +
In the old style, tagged variables are not real types. <tt>Float:x</tt> does not indicate a float-typed variable, it indicates a 32-bit "cell" <i>tagged</i> as a float. It is possible to remove the tag or change the tag, which while flexible, is dangerous and confusing. The syntax itself is also problematic. The parser does not have the ability to identify characters in between the tag name and the colon. Internally, the compiler cannot represent values that are not exactly 32-bit.
 +
 
 +
The takeaway message is: there is no sensible way to represent a concept like "int64" or "X is a type that represents an array of floats". The tagging grammar makes it too awkward, and the compiler itself is incapable of attaching such information to a tag. We can't fix that right away, but we can begin to deprecate the tag system via a more normal declaration syntax.
 +
 
 +
An easy example to see why this is necessary, is the weirdness in expressing something like this with tags:
 +
<sourcepawn>
 +
native float[3] GetEntOrigin();
 +
</sourcepawn>
 +
 
 +
''Note on the <tt>String</tt> tag:'' The introduction of <tt>char</tt> might initially seem confusing. The reason for this is that in the future, we would like to introduce a true "string" type that acts as an object, like strings in most other languages. The existing behavior in Pawn is an array of characters, which is much lower-level. The renaming clarifies what the type really is, and leaves the door open for better types in the future.
 +
 
 +
==Arrays==
 +
The new style of declaration disambiguates between two kinds of arrays. Pawn has ''indeterminate arrays'', where the size is not known, and ''determinate arrays'', where the size is known. We refer to these as "dynamic" and "fixed-length" arrays, respectively.
 +
 
 +
'''A fixed-length array is declared by placing brackets after a variable name'''. For example:
 +
<sourcepawn>
 +
int CachedStuff[1000];
 +
int PlayerData[MAXPLAYERS + 1] = { 0, ... };
 +
int Weapons[] = { WEAPON_AK47, WEAPON_GLOCK, WEAPON_KNIFE };
 +
</sourcepawn>
 +
 
 +
In these examples, the array size is fixed. The size is known ahead of time and cannot change. When using brackets in this position, the array size must be specified, either via an explicit size or inferred from an initial value.
 +
 
 +
A dynamic-length array has the brackets '''before the variable name''', that is, '''after the type'''. The most common case is when specifying functions that take an array as input:
 +
<sourcepawn>
 +
native void SetPlayerName(int player, const char[] name);
 +
</sourcepawn>
 +
 
 +
Here, we are specifying that the length of <tt>name</tt> is not always known - it could be anything.
 +
 
 +
Dynamic arrays can also be created in local scopes. For example,
 +
<sourcepawn>
 +
void FindPlayers()
 +
{
 +
  int[] players = new int[MaxClients + 1];
 +
}
 +
</sourcepawn>
 +
 
 +
This allocates a new array of the given size and places a reference in <tt>players</tt>. The memory is automatically freed when no longer in use.
 +
 
 +
It is illegal to initialize a fixed-length array with an indeterminate array, and it is illegal to initialize a dynamic array with a fixed-array. It is also illegal to specify a fixed size on a dynamic length array.
  
Methodmaps and classes are very similar, and to the user will look almost identical. The major difference is that methodmaps are an extension of the existing API and tag system. Classes actually introduce new types that have strict behavior. All new APIs will be introduced as classes, and our goal is to migrate older APIs into methodmaps.
+
For the most part, this does not change existing Pawn semantics. It is simply new syntax intended to clarify the way arrays work.
  
=New Declarators=
+
===Rationale===
Pawn was designed for embedded CPUs and microcontrollers. It does not have a formal type system, and the way variables are declared reflects this. For example,
+
In the original syntax, there was a subtle difference in array declaration:
 +
<sourcepawn>
 +
new array1[MAXPLAYERS + 1];
 +
new array2[MaxClients + 1];
 +
</sourcepawn>
  
<pawn>
+
Here, <tt>array1</tt> and <tt>array2</tt> are very different types: the first is an <tt>int[65]</tt> and the latter is an <tt>int[]</tt>. However, there is no syntactic difference: the compiler has to deduce whether the size expression is constant to determine the type. The new syntax clearly and explicitly disambiguates these cases, so in the future when we introduce fully dynamic and flexible arrays, we're less likely to break existing code.
new Float:x = 5.0;
 
</pawn>
 
  
In Pawn, this is not a variable <tt>x</tt> of type "Float". It is a 32-bit "cell" <i>tagged</i> as a float. It is possible to remove the tag or change the tag, which is almost unheard of in programming languages. While tagging is flexible, it is dangerous and confusing. The syntax itself is also problematic. The parser does not have the ability to identify characters in between the tag name and the colon. Internally, the compiler cannot represent values that are not exactly 32-bit.
+
This may result in some confusion. Hopefully won't matter long term, as once we have true dynamic arrays, fixed arrays will become much less useful and more obscure.
  
The takeaway message is: there is no sensible way to represent a concept like "int64" or "X is a type that represents an array of floats". The tagging grammar makes it too awkward, and the compiler itself is incapable of attaching such information to a tag. We can't fix that right away, but we can begin to deprecate the tag system via a more normal declaration syntax:
+
The rationale for restricting initializers is similar. Once we have true dynamic arrays, the restrictions will be lifted. In the meantime, we need to make sure we're limited to semantics that won't have subtle differences in the future.
  
<pawn>
+
==Examples==
float x = 5.0;    // Replacement for Float
+
<sourcepawn>
int y = 4;        // Replacement for _
+
float x = 5.0;    // Replacement for "new Float:x = 5.0;"
char name[32];    // Replacement for String
+
int y = 4;        // Replacement for "new y = 4;"
 +
char name[32];    // Replacement for "new String:name[32];" and "decl String:name[32];"
  
void DoStuff(float x, int y, char name[]) {
+
void DoStuff(float x, int y, char[] name, int length) { //Replacement for "DoStuff(Float:x, y, String:name[], length)"
 +
  Format(name, length, "%f %d", x, y); //No replacement here
 
}
 
}
</pawn>
+
</sourcepawn>
 +
 
 +
==View As==
 +
A new operator is available for reinterpreting the bits in a value as another type. This operator is called <tt>view_as</tt>. It is not a safe cast, in that, it can transform one type to another even if the actual value does not conform to either.
 +
 
 +
In pre-transitional syntax, this was called "retagging". Retagging does not support new-style types, which is why this operator has been introduced. Example of before and after code:
 +
 
 +
<sourcepawn>
 +
// Before:
 +
float x = Float:array.Get(i);
  
Note the crucial difference in this syntax: it will, in the future, allow us to have method signatures like this:
+
// After:
<pawn>
+
float y = view_as<float>(array.Get(i));
float[3] GetVector();
+
</sourcepawn>
</pawn>
 
  
There is no sane way to express this signature with tags. This syntax will also free up the "new" keyword, and bring Pawn closer in line with languages like C# and Java.
+
It is worth reiterating that this is not a cast. If the value in the array is not a float, then when "viewed as" a float it will probably look very odd.
  
 
==Grammar==
 
==Grammar==
Line 82: Line 156:
 
</pre>
 
</pre>
  
Note that arrays can either be declared next to the type (like "int[]"), or after the variable name (like "int x[]"). The latter style is deeply embedded in the Pawn community, so it will continue to work. However, associating the dimensions with the type is another important step forward. First, it regularizes the positioning when a name isn't needed. Second, another long-term goal is eliminating the need for fixed-length, pre-allocated arrays in the API, and those are one of the only compelling reasons to use the existing syntax.
+
Also note, there is no equivalent of <tt>decl</tt> in the new declarator syntax. <tt>decl</tt> is considered to be dangerous and unnecessary. If an array's zero initialization is too costly, consider making it static or global.
 
 
However, it is illegal to have dimensions in both positions.
 
 
 
Also note, there is no equivalent of <tt>decl</tt> in the new declarator syntax. <tt>decl</tt> is considered to be dangerous and unnecessary.
 
  
 
=Methodmaps=
 
=Methodmaps=
 
==Introduction==
 
==Introduction==
Methodmaps are simple: they attach methods onto a tag. For example, here is our legacy API for Handles:
+
Methodmaps are simple: they attach methods onto an enum. For example, here is our legacy API for Handles:
  
<pawn>
+
<sourcepawn>
native CloneHandle(Handle handle);
+
native Handle CloneHandle(Handle handle);
native CloseHandle(Handle handle);
+
native void CloseHandle(Handle handle);
</pawn>
+
</sourcepawn>
  
 
This is a good example of our legacy API. Using it generally looks something like:
 
This is a good example of our legacy API. Using it generally looks something like:
<pawn>
+
<sourcepawn>
 
Handle array = CreateAdtArray();
 
Handle array = CreateAdtArray();
 
PushArrayCell(array, 4);
 
PushArrayCell(array, 4);
 
CloseHandle(array);
 
CloseHandle(array);
</pawn>
+
</sourcepawn>
  
Gross! A Methodmap can clean it up by attach functions to the <tt>Handle</tt> tag, like this:
+
Gross! A Methodmap can clean it up by attaching functions to the <tt>Handle</tt> tag, like this:
<pawn>
+
<sourcepawn>
 
methodmap Handle {
 
methodmap Handle {
     public Clone = CloneHandle;
+
     public native Handle Clone() = CloneHandle;
     public Close = CloseHandle;
+
     public native void Close() = CloseHandle;
 
};
 
};
</pawn>
+
</sourcepawn>
  
 
Now, our earlier array code can start to look object-oriented:
 
Now, our earlier array code can start to look object-oriented:
<pawn>
+
<sourcepawn>
 
Handle array = CreateAdtArray();
 
Handle array = CreateAdtArray();
 
PushArrayCell(array, 4);
 
PushArrayCell(array, 4);
 
array.Close();
 
array.Close();
</pawn>
+
</sourcepawn>
 +
 
 +
With a full methodmap for Arrays, for example,
 +
<sourcepawn>
 +
methodmap ArrayList < Handle
 +
{
 +
  public native ArrayList(); // constructor
 +
  public native void Push(any value);
 +
};
 +
</sourcepawn>
 +
 
 +
We can write even more object-like code:
 +
<sourcepawn>
 +
ArrayList array = new ArrayList();
 +
array.Push(4);
 +
delete array;
 +
</sourcepawn>
 +
 
 +
(Note: the official API does not expose methods on raw Handles.)
  
 
==Inheritance==
 
==Inheritance==
Line 124: Line 212:
 
For example, here is a transitional API for arrays:
 
For example, here is a transitional API for arrays:
  
<pawn>
+
<sourcepawn>
 
native AdtArray CreateAdtArray();
 
native AdtArray CreateAdtArray();
  
 
methodmap AdtArray < Handle {
 
methodmap AdtArray < Handle {
     public PushCell = PushArrayCell;
+
     public native void PushCell(any value) = PushArrayCell;
 
};
 
};
</pawn>
+
</sourcepawn>
  
 
Note that <tt>CreateAdtArray</tt> now returns <tt>AdtArray</tt> instead of <tt>Handle</tt>. Normally that would break older code, but since <tt>AdtArray</tt> inherits from <tt>Handle</tt>, there is a special rule in the type system that allows coercing an <tt>AdtArray</tt> to a <tt>Handle</tt> (but not vice versa).
 
Note that <tt>CreateAdtArray</tt> now returns <tt>AdtArray</tt> instead of <tt>Handle</tt>. Normally that would break older code, but since <tt>AdtArray</tt> inherits from <tt>Handle</tt>, there is a special rule in the type system that allows coercing an <tt>AdtArray</tt> to a <tt>Handle</tt> (but not vice versa).
  
 
Now, the API looks much more object-oriented:
 
Now, the API looks much more object-oriented:
<pawn>
+
<sourcepawn>
 
AdtArray array = CreateAdtArray();
 
AdtArray array = CreateAdtArray();
 
array.PushCell(4);
 
array.PushCell(4);
 
array.Close();
 
array.Close();
</pawn>
+
</sourcepawn>
  
 
==Inline Methods==
 
==Inline Methods==
 
Methodmaps can declare inline methods and accessors. Inline methods can be either natives or Pawn functions. For example:
 
Methodmaps can declare inline methods and accessors. Inline methods can be either natives or Pawn functions. For example:
  
<pawn>
+
<sourcepawn>
 
methodmap AdtArray {
 
methodmap AdtArray {
     public native void PushCell(value);
+
     public native void PushCell(any value);
 
};
 
};
</pawn>
+
</sourcepawn>
  
 
This example requires that an "AdtArray.PushCell" native exists somewhere in SourceMod. It has a magic initial parameter called "this", so the signature will look something like:
 
This example requires that an "AdtArray.PushCell" native exists somewhere in SourceMod. It has a magic initial parameter called "this", so the signature will look something like:
<pawn>
+
<sourcepawn>
native void AdtArray.PushCell(AdtArray this, value);
+
native void AdtArray.PushCell(AdtArray this, any value);
</pawn>
+
</sourcepawn>
  
 
(Of course, this exact signature will not appear in an include file - it's the signature that the C++ implementation should expect, however.)
 
(Of course, this exact signature will not appear in an include file - it's the signature that the C++ implementation should expect, however.)
  
 
It's also possible to define new functions without a native:
 
It's also possible to define new functions without a native:
<pawn>
+
<sourcepawn>
 
methodmap AdtArray {
 
methodmap AdtArray {
     public native void PushCell(value);
+
     public native void PushCell(any value);
  
     public void PushCells(list[], count) {
+
     public void PushCells(any[] list, int count) {
 
         for (int i = 0; i < count; i++) {
 
         for (int i = 0; i < count; i++) {
 
             this.PushCell(i);
 
             this.PushCell(i);
Line 168: Line 256:
 
     }
 
     }
 
};
 
};
</pawn>
+
</sourcepawn>
  
Lastly, we can also define accessors. Currently only "get" accessors are available. For example,
+
Lastly, we can also define accessors. or example,
<pawn>
+
<sourcepawn>
 
methodmap AdtArray {
 
methodmap AdtArray {
 
     property int Size {
 
     property int Size {
         public get = GetArraySize;
+
         public native get() = GetArraySize;
 
     }
 
     }
 
     property bool Empty {
 
     property bool Empty {
Line 182: Line 270:
 
     }
 
     }
 
     property int Capacity {
 
     property int Capacity {
         public native get;
+
         public native get();
 
     }
 
     }
 
};
 
};
</pawn>
+
</sourcepawn>
  
 
The first accessor simply assigns an existing function as an accessor for "Size". The second accessor is an inline method with an implicit "this" parameter. The third accessor will bind to a native with the following name and signature:
 
The first accessor simply assigns an existing function as an accessor for "Size". The second accessor is an inline method with an implicit "this" parameter. The third accessor will bind to a native with the following name and signature:
  
<pawn>
+
<sourcepawn>
 
native int AdtArray.Capacity.get(AdtArray this);
 
native int AdtArray.Capacity.get(AdtArray this);
</pawn>
+
</sourcepawn>
 +
 
 +
Setters are also supported. For example:
 +
 
 +
<sourcepawn>
 +
methodmap Player {
 +
    property int Health {
 +
        public native get();
 +
        public native set(int health);
 +
    }
 +
}
 +
</sourcepawn>
  
 
==Custom Tags==
 
==Custom Tags==
 
Methodmaps don't have to be used with Handles. It is possible to define custom methodmaps on new or existing tags. For example:
 
Methodmaps don't have to be used with Handles. It is possible to define custom methodmaps on new or existing tags. For example:
  
<pawn>
+
<sourcepawn>
 
methodmap AdminId {
 
methodmap AdminId {
 
     public int Rights() {
 
     public int Rights() {
Line 202: Line 301:
 
     }
 
     }
 
};
 
};
</pawn>
+
</sourcepawn>
  
 
Now, for example, it is possible to do:
 
Now, for example, it is possible to do:
  
<pawn>GetPlayerAdmin(id).Rights()</pawn>
+
<sourcepawn>GetPlayerAdmin(id).Rights()</sourcepawn>
  
 
==Constructors and Destructors==
 
==Constructors and Destructors==
Methodmaps can also define constructors and destructors, which is useful if they are intended to behave like actual objects. For example,
+
Methodmaps can also define constructors, which is useful if they are intended to behave like actual objects. For example,
  
<pawn>
+
<sourcepawn>
 
methodmap AdtArray {
 
methodmap AdtArray {
     public AdtArray(blocksize = 1);
+
     public native AdtArray(int blocksize = 1);
     public ~AdtArray();
+
     public native void PushCell(any value);
    public void PushCell(value);
 
 
};
 
};
</pawn>
+
</sourcepawn>
  
 
Now AdtArrays can be used in a fully object-oriented style:
 
Now AdtArrays can be used in a fully object-oriented style:
<pawn>
+
<sourcepawn>
AdtArray array = AdtArray();
+
AdtArray array = new AdtArray();
 
array.PushCell(10);
 
array.PushCell(10);
 
array.PushCell(20);
 
array.PushCell(20);
 
array.PushCell(30);
 
array.PushCell(30);
 
delete array;
 
delete array;
</pawn>
+
</sourcepawn>
 +
 
 +
[https://forums.alliedmods.net/showpost.php?p=2332183&postcount=1 Support for defining destructors on methodmaps was removed in SourcePawn 1.8.]  Handle types can implement destructors as part of native code in SourceMod's core or as an extension.
  
 
==Caveats==
 
==Caveats==
Line 238: Line 338:
 
  <li>It is not possible to inherit from anything other than another previously declared methodmap.</li>
 
  <li>It is not possible to inherit from anything other than another previously declared methodmap.</li>
 
  <li>Methodmaps can only be defined over scalars - that is, the "this" parameter can never be an array. This means they cannot be used for enum-structs.</li>
 
  <li>Methodmaps can only be defined over scalars - that is, the "this" parameter can never be an array. This means they cannot be used for enum-structs.</li>
  <li>Destructors can only be native.</li>
+
  <li>Destructors can only be native. When we are able to achieve garbage collection, destructors will be removed.</li>
 
  <li>The signatures of methodmaps must use the new declaration syntax.</li>
 
  <li>The signatures of methodmaps must use the new declaration syntax.</li>
 +
<li>Methodmaps must be declared before they are used.</li>
 
</ol>
 
</ol>
  
Line 248: Line 349:
 
method-args ::= arg-new* "..."?
 
method-args ::= arg-new* "..."?
  
methodmap ::= "methodmap" symbol? { methodmap-item* } term
+
methodmap ::= "methodmap" symbol methodmap-inheritance? "{" methodmap-item* "}" term
 +
methodmap-inheritance ::= "<" symbol
 
methodmap-item ::=
 
methodmap-item ::=
           visibility "~"? symbol "=" symbol term
+
           visibility "native" "~"? symbol "(" method-args* ")" ("=" symbol)? term
         | visibility "native" type-expr "~"? symbol methodmap-symbol "(" method-args ")" term
+
         | visibility "~"? symbol "(" method-args* ")" func-body newline
         | visibility type-expr symbol "(" method-args ")" func-body term
+
        | visibility "static"? "native" type-expr symbol "(" method-args* ")" ("=" symbol)? term
         | "property" type-expr symbol { property-decl } term
+
         | visibility "static"? type-expr symbol "(" method-args* ")" func-body newline
property-func ::= "get"
+
         | "property" type-expr symbol "{" property-decl* "}" newline
 
property-decl ::= visibility property-impl
 
property-decl ::= visibility property-impl
 
property-impl ::=
 
property-impl ::=
           "native" property-func "(" ")" term
+
           "native" "get" "(" ")" ("=" symbol)? term
         | property-func "(" ")" func-body term
+
         | "get" "(" ")" func-body newline
         | property-func "=" symbol
+
         | "native" "set" "(" type-expr symbol ")" ("=" symbol)? term
 +
        | "set" "(" type-expr symbol ")" func-body newline
 
</pre>
 
</pre>
  
=Classes=
+
=Typedefs=
 +
 
 +
Function tags and function enums have been deprecated in favor of a more modern syntax. Currently, they can still only create tag names for functions. Future versions will support arbitrary types.
 +
 
 +
Upgrading both functags and funcenums is simple. Below are two examples:
 +
 
 +
<sourcepawn>
 +
functag public Action:SrvCmd(args);
  
Classes are almost identical to methodmaps, with one very important distinction: they are actual types, rather than an attachment to a tag. In fact, classes form an entirely new type system in Pawn. For example, the following is possible with methodmaps:
+
funcenum Timer {
 +
  Action:public(Handle:Timer, Handle:hndl),
 +
  Action:public(Handle:timer),
 +
};
 +
</sourcepawn>
  
<pawn>
+
Now, this becomes:
native Handle CreateTimer(float interval, Timer func, any data = 0);
+
<sourcepawn>
 +
typedef SrvCmd = function Action (int args);
  
methodmap AdtArray {
+
typeset Timer {
    public AdtArray(blocksize = 1);
+
  function Action (Handle timer, Handle hndl);
    public ~AdtArray();
+
  function Action (Handle timer);
 
};
 
};
 +
</sourcepawn>
 +
 +
The grammar for the new syntax is:
 +
 +
<pre>
 +
typedef ::= "typedef" symbol "=" full-type-expr term
 +
full-type-expr ::= "(" type-expr ")"
 +
                | type-expr
 +
type-expr ::= "function" type-name "(" typedef-args? ")"
 +
typedef-args ::= "..."
 +
              | typedef-arg (", " "...")?
 +
</pre>
 +
 +
Note that typedefs only support new-style types.
 +
 +
=Enum Structs=
  
public CheckPlayers(Handle timer, any data)
+
Enum structs were a previously unsupported mechanism for emulating structs through arrays. As of SourceMod 1.10, this mechanism is now fully supported through Transitional Syntax. Here is an example of enum struct syntax:
{
+
 
    ....
+
<sourcepawn>
 +
enum struct Rectangle {
 +
  int x;
 +
  int y;
 +
  int width;
 +
  int height;
 +
 
 +
  int Area() {
 +
    return this.width * this.height;
 +
  }
 +
}
 +
 
 +
void DoStuff(Rectangle r) {
 +
  PrintToServer("%d, %d, %d, %d", r.x, r.y, r.width, r.height);
 +
}
 +
</sourcepawn>
 +
 
 +
Enum structs are syntactic sugar and are internally represented as arrays. This means they pass by-reference to function arguments, and the "&" token is not required (nor is it allowed).
 +
 
 +
Note that even though enum structs are actually arrays, for the most part they cannot be used as arrays. The exception is when interacting with opaque data structures like <tt>ArrayList</tt>. For example, this is considered valid:
 +
 
 +
<sourcepawn>
 +
void SaveRectangle(ArrayList list, const Rectangle r) {
 +
  list.PushArray(r, sizeof(r));
 +
}
 +
 
 +
void PopArray(ArrayList list, Rectangle r) {
 +
  list.GetArray(list.Length - 1, r, sizeof(r));
 +
  list.Erase(list.Length - 1);
 
}
 
}
 +
</sourcepawn>
 +
 +
But this is not allowed:
  
AdtArray array = AdtArray();
+
<sourcepawn>
CreateTimer(2.0, CheckPlayers, array);
+
Rectangle r;
</pawn>
+
PrintToServer("%d", r[0]);
 +
</sourcepawn>
 +
 
 +
The grammar for enum structs is as follows:
 +
 
 +
<pre>
 +
enum-struct ::= "enum" "struct" symbol "{" newline enum-struct-entry enum-struct-entry* "}" term
 +
enum-struct-entry ::= enum-struct-field
 +
                    | enum-struct-method
 +
enum-struct-field ::= type-expr symbol old-dims? term
 +
enum-struct-method ::= type-expr symbol "(" method-args ")" func-body term
 +
</pre>
  
If <tt>AdtArray</tt> was created via <tt>class</tt> instead of <tt>methodmap</tt>, this code would not compile. The reason is that <tt>methodmap</tt> creates a tag that can coerce to <tt>any</tt>. However, <tt>class</tt> creates a '''type''' that coerces '''only to related types'''. Those types are <tt>AdtArray</tt>, and a special builtin type called <tt>Object</tt>. The type of a class does not coerce via <tt>any:</tt> or <tt>_:</tt>.
+
=Enforcing new syntax=
  
==Explanation==
+
You can enforce the new syntax in 1.7 by using <pawn>#pragma newdecls required</pawn> ontop of your code, after the includes (or else current 1.7 includes which contain old syntax will be read with new-syntax rules).
Why does it work this way? The reason is a technical deficiency in the existing Pawn compiler. When you use "any", Pawn does not tell SourceMod what the exact types of its variables are. SourceMod just sees a bunch of bits. It doesn't know whether it's a float, an integer, an array pointer, a handle, or an object. This prevents SourceMod from implementing garbage collection* - SourceMod can't automatically free memory if it can't tell where all its objects are.
 
  
Ultimately, we want garbage collection, but it's a very big and complex task. In the meantime, it's important that the transitional API make garbage collection a future possibility. To this end, "class"-based objects do not have observable values. They cannot be passed into variadic functions or functions that would coerce them into an unrelated or non-object type - including the <tt>any</tt> type. Eventually, we will introduce a new variant of <tt>any</tt> that is safe to use with objects.
 
  
<nowiki>*</nowiki>''Well, decent garbage collection. We could implement "conservative" GC, but it's very unappealing.''
+
[[Category:SourceMod_Scripting]]

Latest revision as of 15:20, 26 May 2022

We would like to give our users a more modern language. Pawn is showing its age; manual memory management, buffers, tags, and lack of object-oriented API are very frustrating. We can't solve everything all at once, but we can begin to take steps in the right direction.

SourceMod 1.7 introduces a Transitional API. It is built on a new Transitional Syntax in SourcePawn, which is a set of language tools to make Pawn feel more modern. In particular, it allows developers to use older APIs in an object-oriented manner, without breaking compatibility. Someday, if and when SourcePawn can become a full-fledged modern language, the transitional API will making porting efforts very minimal.

The transitional API has the following major features and changes:

  • New Declarators - A cleaner way of declaring variables, similar to Java and C#.
  • Methodmaps - Object-oriented wrappers around older APIs.
  • Real Types - SourcePawn now has "int", "float", "bool", "void", and "char" as real types.
  • null - A new, general keyword to replace INVALID_HANDLE.

New Declarators

Developers familiar with pawn will recognize Pawn's original declaration style:

new Float:x = 5.0;
new y = 7;

In the transitional syntax, this can be reworded as:

float x = 5.0;
int y = 7;

The following builtin tags now have types:

  • Float: is float.
  • bool: is bool.
  • _: (or no tag) is int.
  • String: is char.
  • void can now be used as a return type for functions.

Rationale

In the old style, tagged variables are not real types. Float:x does not indicate a float-typed variable, it indicates a 32-bit "cell" tagged as a float. It is possible to remove the tag or change the tag, which while flexible, is dangerous and confusing. The syntax itself is also problematic. The parser does not have the ability to identify characters in between the tag name and the colon. Internally, the compiler cannot represent values that are not exactly 32-bit.

The takeaway message is: there is no sensible way to represent a concept like "int64" or "X is a type that represents an array of floats". The tagging grammar makes it too awkward, and the compiler itself is incapable of attaching such information to a tag. We can't fix that right away, but we can begin to deprecate the tag system via a more normal declaration syntax.

An easy example to see why this is necessary, is the weirdness in expressing something like this with tags:

native float[3] GetEntOrigin();

Note on the String tag: The introduction of char might initially seem confusing. The reason for this is that in the future, we would like to introduce a true "string" type that acts as an object, like strings in most other languages. The existing behavior in Pawn is an array of characters, which is much lower-level. The renaming clarifies what the type really is, and leaves the door open for better types in the future.

Arrays

The new style of declaration disambiguates between two kinds of arrays. Pawn has indeterminate arrays, where the size is not known, and determinate arrays, where the size is known. We refer to these as "dynamic" and "fixed-length" arrays, respectively.

A fixed-length array is declared by placing brackets after a variable name. For example:

int CachedStuff[1000];
int PlayerData[MAXPLAYERS + 1] = { 0, ... };
int Weapons[] = { WEAPON_AK47, WEAPON_GLOCK, WEAPON_KNIFE };

In these examples, the array size is fixed. The size is known ahead of time and cannot change. When using brackets in this position, the array size must be specified, either via an explicit size or inferred from an initial value.

A dynamic-length array has the brackets before the variable name, that is, after the type. The most common case is when specifying functions that take an array as input:

native void SetPlayerName(int player, const char[] name);

Here, we are specifying that the length of name is not always known - it could be anything.

Dynamic arrays can also be created in local scopes. For example,

void FindPlayers()
{
  int[] players = new int[MaxClients + 1];
}

This allocates a new array of the given size and places a reference in players. The memory is automatically freed when no longer in use.

It is illegal to initialize a fixed-length array with an indeterminate array, and it is illegal to initialize a dynamic array with a fixed-array. It is also illegal to specify a fixed size on a dynamic length array.

For the most part, this does not change existing Pawn semantics. It is simply new syntax intended to clarify the way arrays work.

Rationale

In the original syntax, there was a subtle difference in array declaration:

new array1[MAXPLAYERS + 1];
new array2[MaxClients + 1];

Here, array1 and array2 are very different types: the first is an int[65] and the latter is an int[]. However, there is no syntactic difference: the compiler has to deduce whether the size expression is constant to determine the type. The new syntax clearly and explicitly disambiguates these cases, so in the future when we introduce fully dynamic and flexible arrays, we're less likely to break existing code.

This may result in some confusion. Hopefully won't matter long term, as once we have true dynamic arrays, fixed arrays will become much less useful and more obscure.

The rationale for restricting initializers is similar. Once we have true dynamic arrays, the restrictions will be lifted. In the meantime, we need to make sure we're limited to semantics that won't have subtle differences in the future.

Examples

float x = 5.0;    // Replacement for "new Float:x = 5.0;"
int y = 4;        // Replacement for "new y = 4;"
char name[32];    // Replacement for "new String:name[32];" and "decl String:name[32];"
 
void DoStuff(float x, int y, char[] name, int length) { //Replacement for "DoStuff(Float:x, y, String:name[], length)"
  Format(name, length, "%f %d", x, y); //No replacement here
}

View As

A new operator is available for reinterpreting the bits in a value as another type. This operator is called view_as. It is not a safe cast, in that, it can transform one type to another even if the actual value does not conform to either.

In pre-transitional syntax, this was called "retagging". Retagging does not support new-style types, which is why this operator has been introduced. Example of before and after code:

// Before:
float x = Float:array.Get(i);
 
// After:
float y = view_as<float>(array.Get(i));

It is worth reiterating that this is not a cast. If the value in the array is not a float, then when "viewed as" a float it will probably look very odd.

Grammar

The new and old declaration grammar is below.

return-type ::= return-old | return-new
return-new ::= type-expr new-dims?        // Note, dims not yet supported.
return-old ::= old-dims? label?

argdecl ::= arg-old | arg-new
arg-new ::= "const"? type-expr '&'? symbol old-dims? ('=' arg-init)?
arg-old ::= "const"? tags? '&'? symbol old-dims? ('=' arg-init)?

vardecl ::= var-old | var-new
var-new ::= var-new-prefix type-expr symbol old-dims?
var-new-prefix ::= "static" | "const"
var-old ::= var-old-prefix tag? symbol old-dims?
var-old-prefix ::= "new" | "decl" | "static" | "const"

global ::= global-old | global-new
global-new ::= storage-class* type-expr symbol old-dims?
global-old ::= storage-class* tag? symbol old-dims?

storage-class ::= "public" | "static" | "const" | "stock"

type-expr ::= (builtin-type | symbol) new-dims?
builtin-type ::= "void"
               | "int"
               | "float"
               | "char"
               | "bool"

tags ::= tag-vector | tag
tag-vector ::= '{' symbol (',' symbol)* '}' ':'
tag ::= label

new-dims ::= ('[' ']')*
old-dims ::= ('[' expr? ']')+

label ::= symbol ':'
symbol ::= [A-Za-z_]([A-Za-z0-9_]*)

Also note, there is no equivalent of decl in the new declarator syntax. decl is considered to be dangerous and unnecessary. If an array's zero initialization is too costly, consider making it static or global.

Methodmaps

Introduction

Methodmaps are simple: they attach methods onto an enum. For example, here is our legacy API for Handles:

native Handle CloneHandle(Handle handle);
native void CloseHandle(Handle handle);

This is a good example of our legacy API. Using it generally looks something like:

Handle array = CreateAdtArray();
PushArrayCell(array, 4);
CloseHandle(array);

Gross! A Methodmap can clean it up by attaching functions to the Handle tag, like this:

methodmap Handle {
    public native Handle Clone() = CloneHandle;
    public native void Close() = CloseHandle;
};

Now, our earlier array code can start to look object-oriented:

Handle array = CreateAdtArray();
PushArrayCell(array, 4);
array.Close();

With a full methodmap for Arrays, for example,

methodmap ArrayList < Handle
{
  public native ArrayList(); // constructor
  public native void Push(any value);
};

We can write even more object-like code:

ArrayList array = new ArrayList();
array.Push(4);
delete array;

(Note: the official API does not expose methods on raw Handles.)

Inheritance

The Handle system has a "weak" hierarchy. All handles can be passed to CloseHandle, but only AdtArray handles can be passed to functions like PushArrayCell. This hierarchy is not enforced via tags (unfortunately), but instead by run-time checks. Methodmaps allow us to make individual handle types object-oriented, while also moving type-checks into the compiler.

For example, here is a transitional API for arrays:

native AdtArray CreateAdtArray();
 
methodmap AdtArray < Handle {
    public native void PushCell(any value) = PushArrayCell;
};

Note that CreateAdtArray now returns AdtArray instead of Handle. Normally that would break older code, but since AdtArray inherits from Handle, there is a special rule in the type system that allows coercing an AdtArray to a Handle (but not vice versa).

Now, the API looks much more object-oriented:

AdtArray array = CreateAdtArray();
array.PushCell(4);
array.Close();

Inline Methods

Methodmaps can declare inline methods and accessors. Inline methods can be either natives or Pawn functions. For example:

methodmap AdtArray {
    public native void PushCell(any value);
};

This example requires that an "AdtArray.PushCell" native exists somewhere in SourceMod. It has a magic initial parameter called "this", so the signature will look something like:

native void AdtArray.PushCell(AdtArray this, any value);

(Of course, this exact signature will not appear in an include file - it's the signature that the C++ implementation should expect, however.)

It's also possible to define new functions without a native:

methodmap AdtArray {
    public native void PushCell(any value);
 
    public void PushCells(any[] list, int count) {
        for (int i = 0; i < count; i++) {
            this.PushCell(i);
        }
    }
};

Lastly, we can also define accessors. or example,

methodmap AdtArray {
    property int Size {
        public native get() = GetArraySize;
    }
    property bool Empty {
        public get() {
            return this.Size == 0;
        }
    }
    property int Capacity {
        public native get();
    }
};

The first accessor simply assigns an existing function as an accessor for "Size". The second accessor is an inline method with an implicit "this" parameter. The third accessor will bind to a native with the following name and signature:

native int AdtArray.Capacity.get(AdtArray this);

Setters are also supported. For example:

methodmap Player {
    property int Health {
        public native get();
        public native set(int health);
    }
}

Custom Tags

Methodmaps don't have to be used with Handles. It is possible to define custom methodmaps on new or existing tags. For example:

methodmap AdminId {
    public int Rights() {
        return GetAdminFlags(this);
    }
};

Now, for example, it is possible to do:

GetPlayerAdmin(id).Rights()

Constructors and Destructors

Methodmaps can also define constructors, which is useful if they are intended to behave like actual objects. For example,

methodmap AdtArray {
    public native AdtArray(int blocksize = 1);
    public native void PushCell(any value);
};

Now AdtArrays can be used in a fully object-oriented style:

AdtArray array = new AdtArray();
array.PushCell(10);
array.PushCell(20);
array.PushCell(30);
delete array;

Support for defining destructors on methodmaps was removed in SourcePawn 1.8. Handle types can implement destructors as part of native code in SourceMod's core or as an extension.

Caveats

There are a few caveats to methodmaps:

  1. CloseHandle() is not yet gone. It is required to call delete on any object that previously would have required CloseHandle().
  2. There can be only one methodmap for a tag.
  3. When using existing natives, the first parameter of the native must coerce to the tag of the methodmap. Tag mismatches of the "this" parameter will result in an error. Not a warning!
  4. Methodmaps can only be defined on tags. Pawn has some ways of creating actual types (like via struct or class). Methodmaps cannot be created on those types.
  5. Methodmaps do not have strong typing. For example, it is still possible to perform "illegal" casts like Float:CreateAdtArray(). This is necessary for backwards compatibility, so methodmap values can flow into natives like PrintToServer or CreateTimer.
  6. It is not possible to inherit from anything other than another previously declared methodmap.
  7. Methodmaps can only be defined over scalars - that is, the "this" parameter can never be an array. This means they cannot be used for enum-structs.
  8. Destructors can only be native. When we are able to achieve garbage collection, destructors will be removed.
  9. The signatures of methodmaps must use the new declaration syntax.
  10. Methodmaps must be declared before they are used.

Grammar

The grammar for methodmaps is:

visibility ::= "public"
method-args ::= arg-new* "..."?

methodmap ::= "methodmap" symbol methodmap-inheritance? "{" methodmap-item* "}" term
methodmap-inheritance ::= "<" symbol
methodmap-item ::=
           visibility "native" "~"? symbol "(" method-args* ")" ("=" symbol)? term
         | visibility "~"? symbol "(" method-args* ")" func-body newline
         | visibility "static"? "native" type-expr symbol "(" method-args* ")" ("=" symbol)? term
         | visibility "static"? type-expr symbol "(" method-args* ")" func-body newline
         | "property" type-expr symbol "{" property-decl* "}" newline
property-decl ::= visibility property-impl
property-impl ::=
           "native" "get" "(" ")" ("=" symbol)? term
         | "get" "(" ")" func-body newline
         | "native" "set" "(" type-expr symbol ")" ("=" symbol)? term
         | "set" "(" type-expr symbol ")" func-body newline

Typedefs

Function tags and function enums have been deprecated in favor of a more modern syntax. Currently, they can still only create tag names for functions. Future versions will support arbitrary types.

Upgrading both functags and funcenums is simple. Below are two examples:

functag public Action:SrvCmd(args);
 
funcenum Timer {
  Action:public(Handle:Timer, Handle:hndl),
  Action:public(Handle:timer),
};

Now, this becomes:

typedef SrvCmd = function Action (int args);
 
typeset Timer {
  function Action (Handle timer, Handle hndl);
  function Action (Handle timer);
};

The grammar for the new syntax is:

typedef ::= "typedef" symbol "=" full-type-expr term
full-type-expr ::= "(" type-expr ")"
                 | type-expr
type-expr ::= "function" type-name "(" typedef-args? ")"
typedef-args ::= "..."
               | typedef-arg (", " "...")?

Note that typedefs only support new-style types.

Enum Structs

Enum structs were a previously unsupported mechanism for emulating structs through arrays. As of SourceMod 1.10, this mechanism is now fully supported through Transitional Syntax. Here is an example of enum struct syntax:

enum struct Rectangle {
  int x;
  int y;
  int width;
  int height;
 
  int Area() {
    return this.width * this.height;
  }
}
 
void DoStuff(Rectangle r) {
  PrintToServer("%d, %d, %d, %d", r.x, r.y, r.width, r.height);
}

Enum structs are syntactic sugar and are internally represented as arrays. This means they pass by-reference to function arguments, and the "&" token is not required (nor is it allowed).

Note that even though enum structs are actually arrays, for the most part they cannot be used as arrays. The exception is when interacting with opaque data structures like ArrayList. For example, this is considered valid:

void SaveRectangle(ArrayList list, const Rectangle r) {
  list.PushArray(r, sizeof(r));
}
 
void PopArray(ArrayList list, Rectangle r) {
  list.GetArray(list.Length - 1, r, sizeof(r));
  list.Erase(list.Length - 1);
}

But this is not allowed:

Rectangle r;
PrintToServer("%d", r[0]);

The grammar for enum structs is as follows:

enum-struct ::= "enum" "struct" symbol "{" newline enum-struct-entry enum-struct-entry* "}" term
enum-struct-entry ::= enum-struct-field
                    | enum-struct-method
enum-struct-field ::= type-expr symbol old-dims? term
enum-struct-method ::= type-expr symbol "(" method-args ")" func-body term

Enforcing new syntax

You can enforce the new syntax in 1.7 by using

#pragma newdecls required

ontop of your code, after the includes (or else current 1.7 includes which contain old syntax will be read with new-syntax rules).