There are situations that we may have large number of input columns and we may need to Trim the data in those. Normally we can use Derived Column component to get done this, but in this scenario it is very difficult due to the large number of columns.
In such scenario it is easy to use Script Component and it can be configured to trim data for any number of input columns.
Solution is as below.
We can have a Script Component to trim data and there are some steps to follow.
We need to select on the required input columns and change the Usage Type of those columns to ReadWrite as below image.
Then click on Edit Script and follow the below steps to add the required code in C#
Add the using System.Reflection; namespace
Then modify the Input0_ProcessInputRow method as below
public override void Input0_ProcessInputRow(Input0Buffer Row)
{
/*
* Add your code here
*/
foreach (PropertyInfo p in Row.GetType().GetProperties())
{
if (object.ReferenceEquals(p.PropertyType, typeof(string)))
{
if (p.GetValue(Row, null) != null)
{
p.SetValue(Row, TrimData(p.GetValue(Row, null).ToString()), null);
}
}
}
}
The TrimData method is below
public string TrimData(string ValueOfProperty)
{
// Uppercase the value
ValueOfProperty = ValueOfProperty.TrimStart().TrimEnd();
return ValueOfProperty;
}
The complete code after those modifications is as below
#region Namespaces
using System;
using System.Data;
using Microsoft.SqlServer.Dts.Pipeline.Wrapper;
using Microsoft.SqlServer.Dts.Runtime.Wrapper;
using System.Reflection;
#endregion
///
/// This is the class to which to add your code. Do not change the name, attributes, or parent
/// of this class.
///
[Microsoft.SqlServer.Dts.Pipeline.SSISScriptComponentEntryPointAttribute]In such scenario it is easy to use Script Component and it can be configured to trim data for any number of input columns.
Solution is as below.
We can have a Script Component to trim data and there are some steps to follow.
We need to select on the required input columns and change the Usage Type of those columns to ReadWrite as below image.
Then click on Edit Script and follow the below steps to add the required code in C#
Add the using System.Reflection; namespace
Then modify the Input0_ProcessInputRow method as below
public override void Input0_ProcessInputRow(Input0Buffer Row)
{
/*
* Add your code here
*/
foreach (PropertyInfo p in Row.GetType().GetProperties())
{
if (object.ReferenceEquals(p.PropertyType, typeof(string)))
{
if (p.GetValue(Row, null) != null)
{
p.SetValue(Row, TrimData(p.GetValue(Row, null).ToString()), null);
}
}
}
}
The TrimData method is below
public string TrimData(string ValueOfProperty)
{
// Uppercase the value
ValueOfProperty = ValueOfProperty.TrimStart().TrimEnd();
return ValueOfProperty;
}
The complete code after those modifications is as below
#region Namespaces
using System;
using System.Data;
using Microsoft.SqlServer.Dts.Pipeline.Wrapper;
using Microsoft.SqlServer.Dts.Runtime.Wrapper;
using System.Reflection;
#endregion
///
/// This is the class to which to add your code. Do not change the name, attributes, or parent
/// of this class.
///
public class ScriptMain : UserComponent
{
///
/// This method is called once, before rows begin to be processed in the data flow.
///
/// You can remove this method if you don't need to do anything here.
///
public override void PreExecute()
{
base.PreExecute();
/*
* Add your code here
*/
}
///
/// This method is called after all the rows have passed through this component.
///
/// You can delete this method if you don't need to do anything here.
///
public override void PostExecute()
{
base.PostExecute();
/*
* Add your code here
*/
}
///
/// This method is called once for every row that passes through the component from Input0.
///
/// The row that is currently passing through the component
public override void Input0_ProcessInputRow(Input0Buffer Row)
{
/*
* Add your code here
*/
foreach (PropertyInfo p in Row.GetType().GetProperties())
{
if (object.ReferenceEquals(p.PropertyType, typeof(string)))
{
if (p.GetValue(Row, null) != null)
{
p.SetValue(Row, TrimData(p.GetValue(Row, null).ToString()), null);
}
}
}
}
// New function that you can adjust to suit your needs
public string TrimData(string ValueOfProperty)
{
// Uppercase the value
ValueOfProperty = ValueOfProperty.TrimStart().TrimEnd();
return ValueOfProperty;
}
}
After that Build the code, save and close it. Then connect the Script Component with the Output source.
This can be used for any number of columns and it will loop though the available input columns and trim the data on those.
What do you mean by "trim" the data? are you referring to removing leading and trailing spaces? or control characters?
ReplyDelete